Tuesday, 11 May 2010

How to remove duplicate rows from an excel file

The method demonstrated in this post is applicable for Microsoft Office Excel 2007 , Microsoft Office Excel 2003 and Microsoft Excel 2002.
Normally when we import data using dataLoader or toad from excel, we have to clear lots of duplicate from this legacy data. So this post will give an easy way of handle this issue of duplicate.
A duplicate row or a duplicate record means all values in the row are an exact match of all the values in another row.

In this demonstartion, will filter excel list for unique rows, delete the original list, and then replace it with the filtered list. Before proceeding to this demo, I would recommend you to make a backup copy of your excel sheet to avoid loss of your valuable data.

Step by step solution is provided with screenshots

Step#1: Open the excel document. Select all the rows, including the column headers.

Step#2: On the Data menu, click Filter first, immediately the lov boxes will appear in the column headers, and then click Advanced Filter, screen will looks like the one in the picture

Step#3: Select the Unique records only check box, and then click OK.

The filtered list is displayed and the duplicate rows are hidden.

Step#4: On the Edit menu, click Office Clipboard.
The Clipboard task pane is displayed.

Step#5: Copy the filtered list. You filtered list should be still selected and you just need to press CTRL+C or just click Copy button.

In this stage the filtered list is highlighted with bounding outlines and the selection appears as an item at the top of the Clipboard.

Step#6: In this step just delete the original list because we want to overwrite it. To delete the original list on the Data menu, point to Filter, click Show All, original list will be displayed and then press DELETE key.

Step#7: In the Clipboard, click on the filtered list item. The filtered list appears in the same location as the original list.

Related Posts by Categories

No comments: