SIVANANDA REDDY GANGIREDDY: How to eliminate duplicates in a table?

Saturday, June 1, 2013

How to eliminate duplicates in a table?

Execute the following Microsoft SQL Server T-SQL script to demonstrate how to delete duplicates from a table.

-- Create table with SELECT INTO for testing - Price is increased with $1.00

USE tempdb;

SELECT ProductID=CONVERT(int, ProductID),

ProductName = Name,

ListPrice = ListPrice + 1.00

INTO Product

FROM AdventureWorks2008.Production.Product

WHERE ListPrice > 0.0

-- (304 row(s) affected)

-- Insert full row (line) duplicates

INSERT INTO Product

SELECT TOP (100) ProductID=CONVERT(int, ProductID),

ProductName = Name,

ListPrice = ListPrice + 1.00

FROM AdventureWorks2008.Production.Product

WHERE ListPrice > 0.0

ORDER BY NEWID()

-- (100 row(s) affected)

SELECT COUNT(*) FROM Product

-- 404

------------

-- Eliminate identical duplicates (entire row identical) with SELECT DISTINCT INTO

------------

SELECT DISTINCT *

INTO dedupProduct

FROM Product

-- (304 row(s) affected)

------------

-- Eliminate duplicates with GROUP BY

------------

SELECT *

INTO dedupProductGROUPBY

FROM Product

GROUP BY ProductID, ProductName, ListPrice

-- (304 row(s) affected)

------------

-- Eliminating / deleting duplicates based on duplicate keys - CTE / ROW_NUMBER

------------

;WITH CTE AS (

SELECT RN=ROW_NUMBER() OVER (PARTITION BY ProductID

ORDER BY ProductName)

FROM Product)

DELETE CTE

WHERE RN > 1

-- (100 row(s) affected)

SELECT COUNT(ProductID) FROM Product

-- 304

DROP TABLE tempdb.dbo.Product

DROP TABLE tempdb.dbo.dedupProduct

How to remove duplicate rows from a table in SQL Server

SQL SERVER Delete Duplicate Records / Rows by Pinal Dave

SIVANANDA REDDY GANGIREDDY

Pages

Saturday, June 1, 2013

How to eliminate duplicates in a table?

No comments:

Post a Comment