-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
ENH: PyTables Enhancements for future #2391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
what about allowing creation/access of groups by using "/" in the key. i.e.,
would create/access the groups some, path, to and finally df. Right now I can only save the data on one level within an hdf5 file On Thu, Nov 29, 2012 at 6:20 PM, jreback [email protected] wrote:
|
good idea...shouldn't be too hard to implement |
Here are things that are most interesting/beneficial to my current workload: Full Float32 support & full pandas dtype support I like the way Term's work. Is there support for ORing Terms or other logical operations in the Selection? I can pick up work on any of these issues, but I would absolutely to like to discuss some of the details first. |
Scott send me an email and I'll send u offline so we can correspond |
Term language: perhaps it makes sense to piggyback on existing syntax. SQL comes to mind, but also XESAM (whole http://xesam.org is down at the time, but one can get the gist of it here: http://banshee.fm/support/guide/searching/. |
It would be nice if attribute access (e.g. |
see #2485, this is actually somewhat easy in HDFStore, the problem is that pandas in general doesnt' propogate these attributes; you can easily store/retrieve attributes if you want on the nodes themselves something like:
|
sorry...misundestood your comment....(though you meant saving attributes) attribute access on the store is not a big deal, will add to the list |
Thank you for considering this, dotted access will save my pinky a lot of strain Regarding attributes on DFs actually this would preempt a number of cases for specialization of DataFrame (see recent MetaDataFrame PR #2695) and in particular perhaps support the addition for metadata that would facilitate automated merges (foreign keys...). EDIT: there was a discussion about this topic in the mailing list |
see #2755 , was pretty easy to add dotted access, so i did! |
@scottkidder did you get a chance to look at issue 13. #2852 |
dated |
open (not in any particular order)
3a. implement
or
in Terms (maybe use pyparsing like syntax)export
function, to export to different PyTables formats(an easy to read table for R (partially done), and output a GenericTable)read_column
, expand this to the entire table (if possible), allows one to avoid selecting all columns in a table (and then reindexing), this works ifcolumns
argument is provided to select or inferred from the where.create_table
)?, w/o actually appending, so don't have to add parms in each call to append.Splitter
? that a user can specify how to split (rather than a dict); then store this object, so can automatically recreate the resulting table (enable for both Storer and Table objects)itemsize='truncate'
to allow subsquent appends to proceed with string truncation (on specific columns)select_column
, return a properly indexed Series, add option to include the index (use_index=True?
)Term
, but running multipleor
sub-queriescarray
, http://carray.pytables.org/docs/manual/done
http://sourceforge.net/mailarchive/message.php?msg_id=30190886
5a. DONE real dtype integration is coming on PR ENH/BUG/DOC: allow propogation and coexistance of numeric dtypes #2708 (eg even though 0.10.1 will actually read/write float32 columns u can't really do much with them w/o having them upcasted) - in any event I think HDFStore will accommodate this already. but more testing needed
select
, http://stackoverflow.com/questions/14614512/merging-two-tables-with-millions-of-rows-in-python (GH ENH: support iteration on returned results in select and select_as_multiple in HDFStore #3078)The text was updated successfully, but these errors were encountered: