Monday, July 4, 2011

Removing Duplicate Rows in Open Office Calc

There is no ready made solution to remove duplicate items in spreadsheet. Use the following method to keep the sheet clean from duplicates.


  1. Create one new column before or after the column needs to be filtered. (ie) if column "A" need to be filtered then create before to "A" Now "A" become "B" and new one become as "A".
  2. Enter '1' in cell "A1" and in "A2", enter the formula =if(B2=B1,0,1). 
  3. Then copy the cell "A2" and paste it through out the rows the sheet have. (ie A2 - Axx)
  4. Now the Column "A" contains only 0's and 1's. It means that the row contains '1' or unique values and '0' means duplicates.
  5. Now, copy the entire "A" column and place the cursor in "A1".
  6. Then select 'Special paste' from Edit menu or else click 'Ctrl+Shift+V' to get that.
  7. In that, un-check the 'Paste all' option and un-check all the options except 'Text and Numbers'. Then press 'OK'. It will paste only the values not formula's.
  8. Now insert one new row at the top of all and enter 1 on cell "A1".
  9. Now place the cursor in "A1" and click sort button in tools bar. This will sort the entire sheet with respect to column "A".
  10. Now, the duplicate values with '0' are all in top of the sheet and unique rows are resides at bottom.
  11. Now, remove all the rows which are all having value '0' in column "A".
  12. Now the sheet is cleaned and remove the column "A" to get the exact list without duplicates.

For MS-Office some steps differs but the concept is same.



No comments:

Post a Comment