How to find out the byte position of a CSV file line in python?

Question

How to find out the byte position of a CSV file line in python?

I work with huge CSV files (20-25Mln lines) and do not want to break them into smaller parts for many reasons.

My script reads the file line by line using the csv module. I now need to specify the position (number of bytes) of the line that will be read at the next iteration (or which has just been read).

I tried

>>> import csv
>>> f = open("uscompany.csv","rU")
>>> reader = csv.reader(f)
>>> reader.next()
....
>>> f.tell()
8230

But it looks like the csv module reads the file in blocks. Because when I continue the iteration, I get the same position

>>> reader.next()
....
>>> f.tell()
8230

Any suggestions? Please advice.

+5

python file csv

Maksym polshcha Aug 24 '12 at 12:43

source share

2 answers

Short answer: impossible. Byte position not available via csvreader API

+6

Andreas Jung Aug 24 '12 at 12:48

source share

John Y · Accepted Answer · 2012-08-24T13:17:07+0000

"" , , . . CSV csv:

for line in myfile:
  row = csv.reader([line]).next()

, CSV, , CSV. , "data" data - , CSV, d 2- 1- , .

How to find out the byte position of a CSV file line in python?

More articles: