Skip to content

Commit 7624809

Browse files
authored
Chunkstore performance enhancements (pandas-dev#182)
- new serializer for ChunkStore - supports by column serialization - Significantly faster than the record serializer for this use case - Supports DataFrames and Series only - Changes to chunker that boost performance - Ability to read subset of columns - Also fixes pandas-dev#164
1 parent 5ffd781 commit 7624809

File tree

13 files changed

+761
-440
lines changed

13 files changed

+761
-440
lines changed

CHANGES.md

+5
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,10 @@
11
## Changelog
22

3+
### 1.27
4+
5+
* Bugfix: #187 Compatibility with latest version of pytest-dbfixtures
6+
* Feature: #182 Improve ChunkStore read/write performance
7+
38
### 1.26 (2016-07-20)
49

510
* Bugfix: Faster TickStore querying for multiple symbols simultaneously

arctic/chunkstore/_chunker.py

+16
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,7 @@
1+
START = 's'
2+
END = 'e'
3+
4+
15
class Chunker(object):
26

37
def to_chunks(self, data, *args, **kwargs):
@@ -60,3 +64,15 @@ def exclude(self, data, range_obj):
6064
data, filtered by range_obj
6165
"""
6266
raise NotImplementedError
67+
68+
def chunk_to_str(self, chunk_id):
69+
"""
70+
Converts parts of a chunk range (start or end) to a string. These
71+
chunk ids/indexes/markers are produced by to_chunks.
72+
(See to_chunks)
73+
74+
returns
75+
-------
76+
string
77+
"""
78+
raise NotImplementedError

0 commit comments

Comments
 (0)