I have a large csv file and I want to delete rows with words containing "Names" [duplicate]

Question

I have a large csv file and it contains repeated rows, I want to delete all these repeated rows, containing word "Names"

1 Names Dates Picture 2 Alex 6-12 4364.jpg 3 Names Dates Picture 4 Jade 8-11 7435.jpg 5 Names Dates Picture 6 Dread 1-5 8635.jpg

The csv file looks like this. I want to delete all the rows with these repeated "Names" "Dates" "Picture".

I have tried different methods from online but I can't find solution

Im using pandas to import the csv file df = pd.read_csv('file2022.csv')

Names row seems to be column header but it's repeated in content. How is your file generated? — Ynjxsjmh
– Ynjxsjmh, Commented Apr 17, 2022 at 13:42

NYC Coder · Accepted Answer · 2022-04-17 12:59:08Z

You can use drop_duplicates here:

df = pd.read_csv('test2.csv', sep=' *', engine='python', header=None, index_col=0) df.drop_duplicates(keep=False, inplace=True) df.reset_index(inplace=True, drop=True) print(df)

Output:

 1 2 3 0 Alex 6-12 4364.jpg 1 Jade 8-11 7435.jpg 2 Dread 1-5 8635.jpg

LetsSeo · Accepted Answer · 2022-04-17 12:52:59Z

1

df = df[df["Names"] != "Names"]

should drop the "Names" values under "Names" column.

answered Apr 17, 2022 at 12:52

LetsSeo

8757 silver badges21 bronze badges

Collectives™ on Stack Overflow

I have a large csv file and I want to delete rows with words containing "Names" [duplicate]

2 Answers 2

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Linked

Related