![]() |
Excel question - removing duplicate words
I want to eliminate duplicate words in excel. Let's say I have column A1 that reads =
John thinks pizza pizza is better than pappa john Excel will automatically rewrite it in Column B1 = John thinks pizza is better than pappa It needs to be a universal formula because it's not just pizza and john that is duplicated, got about 10,000 rows of data with duplicate words and I am working on a keyword list with limited length allowance. Hope someone can help, I asked my friend and google, and they are smart but they couldn't help. |
There might be a more direct way in Excel that I do not know of, but this is how I would solve it, quick and dirty:
Export all to a comma separated file. Search and replace all "spaces" by a comma. Import into Excel, with each word in it's own column. Then do some IF conditions, IF B1=A1; "then delete B1" ELSE leave B1 as is and proceed to C1. |
yah i guess thats one way to do it :) i wonder if theres a formula for it though
|
I doubt there is a formula cause if I understood it correctly you have to define which dupe to remove and it's going to be specific for each particular case.
If you don't care which of the dupes to remove then I have a solution for sentences consisting of up to 8 words. First of all do what Adraco said, separate each word in a separate cell. Then here's your formula: IF((AND(A1<>B1,A1<>C1,A1<>D1 and etc up to 8 conditions)),A1,"blahblah") Make sure that the 1st output cell is in 8+ columns from the last cell in the initial input When done copy the output, go to a separate list and do special paste, select paste values. Then run search and replace for the "blahblah), empty the cells with the blahblah. Then copy the output and paste in notepad, select the space between two words (it will be the Tab sign) run search and replace. replace it with a space. This function can create double spaces, you can remove them later. Hope this helps to solve at least a part of your problem :winkwink: |
oh lol...I've just realized that basically I suggested the same as Ardaco, but from "the other end" :)
Bottom line, if you need to control sense of the sentences I doubt there is a way to automate the dupe removal. |
Quote:
|
| All times are GMT -7. The time now is 04:58 AM. |
Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123