add methods to doc #1355

timmie · 2012-05-30T21:18:29Z

Please add a list of sampling methods to the docs at:

http://pandas.pydata.org/pandas-docs/dev/timeseries.html#up-and-downsampling

changhiskhan · 2012-05-30T21:22:33Z

There are no separate upsample or downsample methods. Everything is done
through "resample" with various frequency aliases specified in the
documentation.

On Wed, May 30, 2012 at 5:18 PM, timmie <
[email protected]

wrote:

Please add a list of sampling methods to the docs at:

http://pandas.pydata.org/pandas-docs/dev/timeseries.html#up-and-downsampling

Reply to this email directly or view it on GitHub:
#1355

Chang She
Lambda Foundry http://www.lambdafoundry.com

timmie · 2012-05-30T21:33:19Z

OK, but how do I find out what I can use as how argument In [956]: ts.resample('D', how='mean')?

changhiskhan · 2012-05-30T21:56:36Z

sum, mean, std, max, min, median, first, last, ohlc

note that you don't have to use strings at all if you supply your own method.

I'll add a note to the timeseries docs about this. Thanks

timmie · 2012-05-30T22:18:36Z

I also wondered where how='ohlc' comes from...

are you not using sphinx autodoc together with the docstrings?

wesm · 2012-05-30T22:19:36Z

OHLC is a pretty standard way to aggregate financial data (http://en.wikipedia.org/wiki/Open-high-low-close_chart)

timmie · 2012-05-30T22:23:15Z

yes, but wanted to suggest to point new users better to the possibilities that exists: which methods are available (standard, extended via numpy, etc.).

wesm · 2012-05-30T22:24:30Z

The docstring could use work. I'm hopeful that users (ahem) will help in this regard

timmie · 2012-05-30T22:27:30Z

Maybe such low hanging fruits could be tagged in the issues?
Then, user would have the possibility to find easy fixes.

changhiskhan · 2012-05-30T22:30:41Z

Most "DOC" issues are similar low hanging fruit so I don't think it's necessary to have a distinct tag for this. We'd really welcome a pull request on docstrings :)

timmie · 2012-05-31T21:06:24Z

The docstring could use work. I'm hopeful that users (ahem) will help in this regard
So I gained / provoked a post on twitter by my questions.

I can give you hope: users just have to find an entry point: aplicability to own work, enough functionality and suffiecient documentation to know how to make use of it.

When Pandas was published first time (more or less together with the larry packages) it dd not see much applicability for my codig. Two things came together: the discontinuation of the scikits.timeseries and your ambitions to develop the best python time series library with the bridge to statistsics.
You can even not imagine the number applications and use case you would gain once this is more promoted in science and engineering.

Last but not least, the nature of open source shows that the interest and participation indeally also follow a leaning curve: first after publishing, you'll get mostly bug reports and feature request. Once the user base is there, people's contribution gets more colourful: some wirte docs, others new features, others just report bugs...

And pandas being a base libarary, many people will start building their libraries on top of it. together with statsmodels it's such a solid foundatdtion for data analysis, that other coders will then extend its capabilities domain specific (bioinformatics, earth observation, geoscience, etc.).

For my own libs (not yet published), I became quite good at docstring and Sphinx'ing. But first you have to understand how it works.

Also, I advice to add a link to the Numpy documentation standards to the contributors page.

Uff rather long here But I ain't got a blog or such...

wesm · 2012-05-31T21:17:47Z

I completely agree on all points. And don't worry, you aren't the first person to make tons of requests / suggestions but offer little help ;) Keep in mind how much coding work has gone into pandas over the last year and that I'm about 300 pages into a 400 page book on data analysis in Python. Plate rather full at the moment. Docs, etc will improve over time, and much faster if people who are not me or Chang get involved in the process-- we have to really be focused on shipping bug-free code and new features which is a much less accessible area of work for people to get involved (since grokking the pandas codebase, while not that complicated, is not a brief affair)

timmie · 2012-06-01T08:46:23Z

Most "DOC" issues are similar low hanging fruit so I don't think it's necessary to have a distinct tag for this. We'd really welcome a pull request on docstrings :)

I made also experience in the past that contributed where not added to the codebased due to

API changes
continous re-organisation of the docs by core developers

Consequently, I think #1370 could avoid such disapointment.

* origin/master: DOC: string args to how in resample pandas-dev#1355 BLD: turning isreleased to False DOC: release notes to close pandas-dev#1349 ENH: Cython nancorr speeds up DataFrame.corr with method='pearson' by > 100x DOC: new parser functionality pandas-dev#1347 DOC: another pass at release notes DOC: what's new DOC: release notes, what's new DOC: release notes and what's new BUG: parser with multiple date col and multiple index col pandas-dev#1344 TST: additional tests for parsers and minor code cleanup ENH: KdePlot with DataFrame pandas-dev#1342. TST: frame kde and kde with logy pandas-dev#1341 BUG: respect logy argument in KdePlot, close pandas-dev#1341 BUG: set_xlim for time series plots pandas-dev#1339 BUG: PeriodIndex.map tries to get super(DatetimIndex, self) BUG: fixed doc bug that caused latex build to fail DOC: cleaned up parser doc string to stop sphinx from complaining ENH: better error msg for fillna() with invalid method

changhiskhan pushed a commit that referenced this issue May 30, 2012

DOC: string args to how in resample #1355

8948f49

changhiskhan closed this as completed May 30, 2012

changhiskhan mentioned this issue May 30, 2012

Use parsed date column as index #1251

Closed

timmie mentioned this issue Jun 1, 2012

accompany each release with a document: entry points for help / input #1370

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add methods to doc #1355

add methods to doc #1355

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 30, 2012

wesm commented May 30, 2012

timmie commented May 30, 2012

wesm commented May 30, 2012

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 31, 2012

wesm commented May 31, 2012

timmie commented Jun 1, 2012

add methods to doc #1355

add methods to doc #1355

Comments

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 30, 2012

wesm commented May 30, 2012

timmie commented May 30, 2012

wesm commented May 30, 2012

timmie commented May 30, 2012

changhiskhan commented May 30, 2012

timmie commented May 31, 2012

wesm commented May 31, 2012

timmie commented Jun 1, 2012