CSV new-line character seen in unquoted field error

Question

the following code worked until today when I imported from a Windows machine and got this error:

new-line character seen in unquoted field - do you need to open the file in universal-newline mode?

import csv class CSV: def __init__(self, file=None): self.file = file def read_file(self): data = [] file_read = csv.reader(self.file) for row in file_read: data.append(row) return data def get_row_count(self): return len(self.read_file()) def get_column_count(self): new_data = self.read_file() return len(new_data[0]) def get_data(self, rows=1): data = self.read_file() return data[:rows]

How can I fix this issue?

def upload_configurator(request, id=None): """ A view that allows the user to configurator the uploaded CSV. """ upload = Upload.objects.get(id=id) csvobject = CSV(upload.filepath) upload.num_records = csvobject.get_row_count() upload.num_columns = csvobject.get_column_count() upload.save() form = ConfiguratorForm() row_count = csvobject.get_row_count() colum_count = csvobject.get_column_count() first_row = csvobject.get_data(rows=1) first_two_rows = csvobject.get_data(rows=5)

rectummelancolique's answer below is what solved my similar issue. stackoverflow.com/a/17315726/3131666 — kmantel
– kmantel, Commented Jan 9, 2015 at 0:10

alecxe · Accepted Answer · 2013-06-26 09:14:48Z

185

It'll be good to see the csv file itself, but this might work for you, give it a try, replace:

file_read = csv.reader(self.file)

with:

file_read = csv.reader(self.file, dialect=csv.excel_tab)

Or, open a file with universal newline mode and pass it to csv.reader, like:

reader = csv.reader(open(self.file, 'rU'), dialect=csv.excel_tab)

Or, use splitlines(), like this:

def read_file(self): with open(self.file, 'r') as f: data = [row for row in csv.reader(f.read().splitlines())] return data

edited Jun 26, 2013 at 9:14

answered Jun 26, 2013 at 9:09

alecxe

476k127 gold badges1.1k silver badges1.2k bronze badges

Sign up to request clarification or add additional context in comments.

7 Comments

GrantU Over a year ago

This now gives the same error, but on line starting upload.num_records = csvobject.get_row_count() now

GrantU Over a year ago

and when I try the split lines version (which is very cools thanks) I get coercing to Unicode: need string or buffer, S3BotoStorageFile found

alecxe Over a year ago

What option eventually worked? Btw, you are reading the file twice: in get_row_count() and in get_column_count() - consider reading the file in __init__ and remember data in self.data, then use it in other methods.

pythonjsgeo Over a year ago

+1 for splitlines() which avoids messing around with different formatting options on OSX. Hope it works across other platforms too...

Murphy Over a year ago

Great answer. Using - "dialect=csv.excel_tab" however, screws up the output when used with csv.DictReader. Just the 'rU' options works magically though

|

g.kovatchev · Accepted Answer · 2015-01-11 18:44:19Z

I realize this is an old post, but I ran into the same problem and don't see the correct answer so I will give it a try

Python Error:

_csv.Error: new-line character seen in unquoted field

Caused by trying to read Macintosh (pre OS X formatted) CSV files. These are text files that use CR for end of line. If using MS Office make sure you select either plain CSV format or CSV (MS-DOS). Do not use CSV (Macintosh) as save-as type.

My preferred EOL version would be LF (Unix/Linux/Apple), but I don't think MS Office provides the option to save in this format.

MS DOS Comma Separated didn't work for me (same error), but Windows Comma Separated.
I get the same issue on OS X. I find myself having to make a new CSV file. Simply saving the current as plain CSV format or CSV (MS-DOS) does not fix the issue.
On OS X, Windows Comma Separated csv worked, MS DOS Comma Separated didn't.

BoltzmannBrain · Accepted Answer · 2015-05-21 17:33:27Z

33

For Mac OS X, save your CSV file in "Windows Comma Separated (.csv)" format.

answered May 21, 2015 at 17:33

BoltzmannBrain

5,49213 gold badges55 silver badges83 bronze badges

1 Comment

travelingbones Over a year ago

thanks, that was the needed ingredient, as I'm using Mac w/ MS office.

Nimo · Accepted Answer · 2015-09-28 15:53:53Z

If this happens to you on mac (as it did to me):

Save the file as CSV (MS-DOS Comma-Separated)

Run the following script

with open(csv_filename, 'rU') as csvfile: csvreader = csv.reader(csvfile) for row in csvreader: print ', '.join(row)

rectummelancolique · Accepted Answer · 2013-06-26 09:00:34Z

5

Try to run dos2unix on your windows imported files first

answered Jun 26, 2013 at 9:00

rectummelancolique

2,24717 silver badges13 bronze badges

2 Comments

GrantU Over a year ago

no really an option I need to allow user to upload csv from both Windows and Macs without any special modification. The import was saved from Excel (Windows) as a CSV so maybe there is something extra that needs to be done in Python to read these?

Damian Yerrick Over a year ago

@GrantU You are referring to Mac OS X 10.0 or later, not Mac OS 9 or earlier, correct? Between 9 and 10, Mac OS switched from \x0d (ProDOS) line endings to \x0a (UNIX) line endings.

Suraj · Accepted Answer · 2017-03-08 01:19:07Z

This is an error that I faced. I had saved .csv file in MAC OSX.

While saving, save it as "Windows Comma Separated Values (.csv)" which resolved the issue.

Resonance · Accepted Answer · 2016-10-28 16:49:53Z

This worked for me on OSX.

# allow variable to opened as files from io import StringIO # library to map other strange (accented) characters back into UTF-8 from unidecode import unidecode # cleanse input file with Windows formating to plain UTF-8 string with open(filename, 'rb') as fID: uncleansedBytes = fID.read() # decode the file using the correct encoding scheme # (probably this old windows one) uncleansedText = uncleansedBytes.decode('Windows-1252') # replace carriage-returns with new-lines cleansedText = uncleansedText.replace('\r', '\n') # map any other non UTF-8 characters into UTF-8 asciiText = unidecode(cleansedText) # read each line of the csv file and store as an array of dicts, # use first line as field names for each dict. reader = csv.DictReader(StringIO(cleansedText)) for line_entry in reader: # do something with your read data

Dougyfresh · Accepted Answer · 2018-12-01 00:40:03Z

I know this has been answered for quite some time but not solve my problem. I am using DictReader and StringIO for my csv reading due to some other complications. I was able to solve problem more simply by replacing delimiters explicitly:

with urllib.request.urlopen(q) as response: raw_data = response.read() encoding = response.info().get_content_charset('utf8') data = raw_data.decode(encoding) if '\r\n' not in data: # proably a windows delimited thing...try to update it data = data.replace('\r', '\r\n')

Might not be reasonable for enormous CSV files, but worked well for my use case.

p699 · Accepted Answer · 2018-12-26 19:04:03Z

Alternative and fast solution : I faced the same error. I reopened the "wierd" csv file in GNUMERIC on my lubuntu machine and exported the file as csv file. This corrected the issue.

Collectives™ on Stack Overflow

CSV new-line character seen in unquoted field error

9 Answers 9

7 Comments

4 Comments

1 Comment

Comments

2 Comments

Comments

Comments

1 Comment

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

7 Comments

4 Comments

1 Comment

Comments

2 Comments

Comments

Comments

1 Comment

Comments

Linked

Related