deleting pandas dataframe row based condition

Question

My pandas dataframe has a column where each row is a string which corresponds to a filename. I read my data from a JSON file and extract the column like this:

df = pd.read_json("mergedJSON.txt",lines=True,orient='columns') df2 = df.set_index("subject") for key,value in some_dict.iteritems(): df2.loc[value,"file_name"].to_csv(outfile,index=False, header=False)

I need to drop certain rows from this dataframe based on whether the file is found on disk. Not sure how to do this. Appreciate help.

DJK · Accepted Answer · 2017-08-23 02:01:20Z

1

Just use this as the last line

df2[df2.file_name.str.contains('stringValue')].loc[value,:].to_csv()

answered Aug 23, 2017 at 2:01

DJK

9,3424 gold badges28 silver badges41 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Shihe Zhang · Accepted Answer · 2017-08-23 02:38:42Z

0

First, set_index,reindex use the filename as index,and then do df.drop(filename).

answered Aug 23, 2017 at 2:38

Shihe Zhang

2,7815 gold badges40 silver badges58 bronze badges

Collectives™ on Stack Overflow

deleting pandas dataframe row based condition

2 Answers 2

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Related