TypeError: "delimiter" must be string, not unicode #36

carlos-jenkins · 2014-11-17T20:16:02Z

Python 2.7:

from __future__ import unicode_literals
from unicodecsv import DictReader

reader = DictReader(
    StringIO(content),
    fieldnames=CSV_HEADER,
    delimiter=';',
    quotechar='"'
)

The text was updated successfully, but these errors were encountered:

pymaldebaran · 2015-03-04T08:36:10Z

Same problem for me: delimiter and quotechar keywordargs for DictReader() and Reader() have to be string, not unicode. This is kind of anoying when you want to work only with unicode...
Workaround:

delimiter=str(u';'), quotechar=str(u'"')

Here the u prefix is just for clarity, most of the time my 2.7 module have from __future__ import unicode_literals to ensure Python 3 compatibility (and to ensure correct utf8 handling everywhere).

jruere · 2015-06-11T19:17:05Z

Can't this be handled by the library? The encoding of the CSV is provided.

jdunck · 2015-06-11T21:14:26Z

Yes, it can. I'll try to work this up soon.

ryanhiebert · 2015-12-13T18:41:09Z

In addition to making sure it's a string, it also need to make sure it's a one-byte string when it's done. The reader only support single character str (bytes) for these. It's a reasonable limitation, but perhaps one that we should guard for when converting these characters.

akaIDIOT · 2016-08-11T11:23:42Z

Hate to pull the old '+1', but just ran in to this trying to make my lib work with both py2 and py3. Many things seem to be a lot easier than with the builtin csv module, but delimiter (and lineterminator) arguments are required to be str. My current issue is with the writer, not the reader, but I'm assuming this is the same issue at its core.

As I was already using six, my current workaround looks like this:

from __future__ import unicode_literals
import csv
import six

if six.PY2:
    import unicodecsv as csv

def export(fname, …, delimiter='\t', …):
    if six.PY2:
        output = open(fname, 'wb')
        delimiter = delimiter.encode('utf-8')
    else:
        output = open(fname, 'w')
    …
    with output:
        writer = csv.DictWriter(output, …, delimiter=delimiter, …)
        …

Getting rid of the first bit would be awesome.

artwr mentioned this issue Aug 11, 2015

add helper function for unicode delimiter and quotechar #60

Open

jdunck mentioned this issue Sep 21, 2015

Sniffer returns unicode but Reader expects bytes #35

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: "delimiter" must be string, not unicode #36

TypeError: "delimiter" must be string, not unicode #36

carlos-jenkins commented Nov 17, 2014

pymaldebaran commented Mar 4, 2015

jruere commented Jun 11, 2015

jdunck commented Jun 11, 2015

ryanhiebert commented Dec 13, 2015

akaIDIOT commented Aug 11, 2016

TypeError: "delimiter" must be string, not unicode #36

TypeError: "delimiter" must be string, not unicode #36

Comments

carlos-jenkins commented Nov 17, 2014

pymaldebaran commented Mar 4, 2015

jruere commented Jun 11, 2015

jdunck commented Jun 11, 2015

ryanhiebert commented Dec 13, 2015

akaIDIOT commented Aug 11, 2016