How to drop duplicated rows in a DataFrame with Pandas using Python

February 1, 20221 min readDataFrame Pandas Fundamentals Getting Started

7-Day Challenge

Land Your First Data Science Job

A proven roadmap to prepare for $75K+ entry-level data roles. Perfect for Data Scientist ready to level up their career.

Build portfolios that hiring managers love

Master the Python and SQL essentials to be industry-ready

Practice with real interview questions from tech companies

Access to the $100k/y Data Scientist Cheatsheet

In order to drop duplicated rows or columns, you can use the DataFrame.drop_duplicates() method.

The example

Imagine we have a DataFrame that has two similar rows but you want to eliminate one of the duplicates.

Here is the code

# To work with dataframes
import pandas as pd

# We create a sample dataframe
df = pd.DataFrame({"col1" : [0, 10, 3, 11, 3, 12],
                   "col2" : [1, 2, 2, 12, 2, 12]})

# We assign the dataframe
df = df.drop_duplicates()

# We print the dataframe
print(df)

Here you are! You now know how to drop duplicated rows in a DataFrame with Pandas using Python.

More on DataFrames

If you want to know more about DataFrame and Pandas. Check out the other articles I wrote on the topic, just here :

Pandas - The Python You Need

We gathered the only Python essentials that you will probably ever need.

The Python You Need

7-Day Challenge

Land Your First Data Science Job

A proven roadmap to prepare for $75K+ entry-level data roles. Perfect for Data Scientist ready to level up their career.

Build portfolios that hiring managers love

Master the Python and SQL essentials to be industry-ready

Practice with real interview questions from tech companies

Access to the $100k/y Data Scientist Cheatsheet

Continue your learning journey with these related topics

Getting Started

3 min read

How long does it take to learn Python for Data Science in 2023

Learn how long it takes to learn Python for Data Science in 2021. What does it take to be a Data Scientist in 2021?

4/18/2023Read More

DataFrame

1 min read

How to merge two time-series DataFrames with different time intervals

Learn how to merge time-series DataFrames with different frequencies in Pandas using the resample and merge/concat methods.

4/8/2023Read More

Finance

2 min read

How to do Financial Analysis with Pandas using Python

Learn how to use yfinance, a Python library to fetch financial market data such as stock prices, dividends, financial statements, options, and sustainability data for a company or index. With yfinance, you can easily retrieve and analyze financial data for informed decision-making.

4/6/2023Read More

Free Newsletter

Master Data Science in Days, Not Months 🚀

Skip the theoretical rabbit holes. Get practical data science skills delivered in bite-sized lessons – Approach used by real data scientist. Not bookworms. 📚

Weekly simple and practical lessons

Access to ready to use code examples

Skip the math, focus on results

Learn while drinking your coffee

Land Your First Data Science Job

The example

Here is the code

More on DataFrames

Land Your First Data Science Job

Related Articles

Master Data Science in Days, Not Months 🚀