PERF: don't create the skiprows set if using the c-parser #13005

jreback · 2016-04-26T23:38:06Z

In [4]: DataFrame(np.random.randn(1000000,1)).to_csv('test.csv',index=False)

branch

In [1]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 65.74 MiB, increment: 1.59 MiB

In [2]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 65.89 MiB, increment: 0.22 MiB

In [3]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 65.98 MiB, increment: 0.28 MiB

master

In [1]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 169.84 MiB, increment: 105.79 MiB

In [2]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 171.27 MiB, increment: 24.11 MiB

In [3]: %memit pd.read_csv('test.csv',skiprows=999999)
peak memory: 173.39 MiB, increment: 24.63 MiB

closes pandas-dev#13005

jreback · 2016-04-26T23:39:20Z

@gfyoung I believe this is handled internally in the c-parser.

gfyoung · 2016-04-27T02:18:06Z

@jreback : Travis and I both agree. LGTM otherwise.

jreback added Performance Memory or execution speed performance IO CSV read_csv, to_csv labels Apr 26, 2016

jreback added this to the 0.18.1 milestone Apr 26, 2016

PERF: don't create the skiprows set if using the c-parser

9506a3c

closes pandas-dev#13005

jreback force-pushed the skiprows branch from 393a2fe to 9506a3c Compare April 26, 2016 23:38

jreback closed this in b8921ac Apr 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: don't create the skiprows set if using the c-parser #13005

PERF: don't create the skiprows set if using the c-parser #13005

jreback commented Apr 26, 2016 •

edited

Loading

jreback commented Apr 26, 2016

gfyoung commented Apr 27, 2016 •

edited

Loading

PERF: don't create the skiprows set if using the c-parser #13005

PERF: don't create the skiprows set if using the c-parser #13005

Conversation

jreback commented Apr 26, 2016 • edited Loading

jreback commented Apr 26, 2016

gfyoung commented Apr 27, 2016 • edited Loading

jreback commented Apr 26, 2016 •

edited

Loading

gfyoung commented Apr 27, 2016 •

edited

Loading