Jaydyou
diff --git a/‎CONTRIBUTING.md
+33-124 b/‎CONTRIBUTING.md
+33-124
diff --git a/‎MANIFEST.in
-2 b/‎MANIFEST.in
-2
diff --git a/‎ci/print_versions.py
+11-3 b/‎ci/print_versions.py
+11-3
diff --git a/‎doc/source/10min.rst
+1 b/‎doc/source/10min.rst
+1
diff --git a/‎doc/source/_static/df_repr_truncated.png
7.85 KB b/‎doc/source/_static/df_repr_truncated.png
7.85 KB
diff --git a/‎doc/source/api.rst
+36 b/‎doc/source/api.rst
+36
diff --git a/‎doc/source/basics.rst
+47-15 b/‎doc/source/basics.rst
+47-15
@@ -4,41 +4,38 @@ All contributions, bug reports, bug fixes, documentation improvements,
 enhancements and ideas are welcome.
 
 The [GitHub "issues" tab](https://github.com/pydata/pandas/issues)
-contains some issues labeled "Good as first PR"; these are
-tasks which do not require deep knowledge of the package. Look those up if you're
+contains some issues labeled "Good as first PR"; Look those up if you're
 looking for a quick way to help out.
 
-Please try and follow these guidelines, as this makes it easier for us to accept
-your contribution or address the issue you're having.
-
 #### Bug Reports
 
   - Please include a short, self-contained Python snippet reproducing the problem.
   You can have the code formatted nicely by using [GitHub Flavored Markdown](http://github.github.com/github-flavored-markdown/) :
 
         ```python
-    
+
         print("I ♥ pandas!")
 
         ```
 
-  - A [test case](https://github.com/pydata/pandas/tree/master/pandas/tests) may be more helpful.
-  - Specify the pandas (and NumPy) version used. (check `pandas.__version__`
-    and `numpy.__version__`)
-  - Explain what the expected behavior was, and what you saw instead.
-  - If the issue seems to involve some of [pandas' dependencies](https://github.com/pydata/pandas#dependencies)
-    such as
-    [NumPy](http://numpy.org),
-    [matplotlib](http://matplotlib.org/), and
-    [PyTables](http://www.pytables.org/)
-    you should include (the relevant parts of) the output of
+  - Specify the pandas version used and those of it's dependencies. You can simply include   the output of
     [`ci/print_versions.py`](https://github.com/pydata/pandas/blob/master/ci/print_versions.py).
+  - Explain what the expected behavior was, and what you saw instead.
 
 #### Pull Requests
 
-  - **Make sure the test suite passes** for both python2 and python3.
-    You can use `test_fast.sh`, **tox** locally, and/or enable **Travis-CI** on your fork.
-    See "Getting Travis-CI going" below.
+  - **Make sure the test suite passes** on your box, Use the provided `test_*.sh` scripts or tox.
+  - Use [proper commit messages](http://tbaggery.com/2008/04/19/a-note-about-git-commit-messages.html):
+    - a subject line with `< 80` chars.
+    - One blank line.
+    - Optionally, a commit message body.
+  - Please reference relevant Github issues in your commit message using `GH1234`
+    or `#1234`. Either style is fine but the '#' style generates nose when your rebase your PR.
+  - `doc/source/release.rst` and `doc/source/vx.y.z.txt` contain an ongoing
+    changelog for each release. Add entries to these files
+    as needed in a separate commit in your PR: document the fix, enhancement,
+    or (unavoidable) breaking change.
+  - Keep style fixes to a separate commit to make your PR more readable.
   - An informal commit message format is in effect for the project. Please try
     and adhere to it. Check `git log` for examples. Here are some common prefixes
     along with general guidelines for when to use them:
@@ -49,119 +46,31 @@ your contribution or address the issue you're having.
       - **BLD**: Updates to the build process/scripts
       - **PERF**: Performance improvement
       - **CLN**: Code cleanup
-  - Commit messages should have:
-    - a subject line with `< 80` chars
-    - one blank line
-    - a commit message body, if there's a need for one
-  - If you are changing any code, you should enable Travis-CI on your fork
-    to make it easier for the team to see that the PR does indeed pass all the tests.
-  - **Backward-compatibility really matters**. Pandas already has a large user base and
-    a lot of existing user code.
-    - Don't break old code if you can avoid it.
-    - If there is a need, explain it in the PR.
-    - Changes to method signatures should be made in a way which doesn't break existing
-      code. For example, you should beware of changes to ordering and naming of keyword
-      arguments.
+  - Maintain backward-compatibility. Pandas has lots of users with lots of existing code. Don't break it.
+    - If you think breakage is required clearly state why as part of the PR.
+    - Be careful when changing method signatures.
     - Add deprecation warnings where needed.
-  - Performance matters. You can use the included `test_perf.sh`
-    script to make sure your PR does not introduce any new performance regressions
-    in the library.
+  - Performance matters. Make sure your PR hasn't introduced perf regressions by using `test_perf.sh`.
   - Docstrings follow the [numpydoc](https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt) format.
-  - **Don't** merge upstream into a branch you're going to submit as a PR.
-    This can create all sorts of problems. Use `git rebase` instead. This ensures
-    no merge conflicts occur when your code is merged by the core team.
-  - Please reference the GH issue number in your commit message using `GH1234`
-    or `#1234`. Either style is fine.
-  - Use `raise AssertionError` rather then plain `assert` in library code (`assert` is fine
-    for test code). `python -o` strips assertions. Better safe than sorry.
-  - When writing tests, don't use "new" assertion methods added to the `unittest` module
-    in 2.7 since pandas currently supports 2.6. The most common pitfall is:
-
-        with self.assertRaises(ValueError):
-            foo
-
-
-    which fails with Python 2.6. You need to use `assertRaises` from
-    `pandas.util.testing` instead (or use `self.assertRaises(TheException,func,args)`).
-
-  - `doc/source/release.rst` and `doc/source/vx.y.z.txt` contain an ongoing
-    changelog for each release. Add entries to these files
-    as needed in a separate commit in your PR: document the fix, enhancement,
-    or (unavoidable) breaking change.
-  - For extra brownie points, use `git rebase -i` to squash and reorder
-    commits in your PR so that the history makes the most sense. Use your own
-    judgment to decide what history needs to be preserved.
-  - Pandas source code should not -- with some exceptions, such as 3rd party licensed code --
-    generally speaking, include an "Authors" list or attribution to individuals in source code.
-    `RELEASE.rst` details changes and enhancements to the code over time.
-    A "thanks goes to @JohnSmith." as part of the appropriate entry is a suitable way to acknowledge
-    contributions. The rest is `git blame`/`git log`.
-    Feel free to ask the commiter who merges your code to include such an entry
-    or include it directly yourself as part of the PR if you'd like to.
-    **We're always glad to have new contributors join us from the ever-growing pandas community.**
-    You may also be interested in the copyright policy as detailed in the pandas [LICENSE](https://github.com/pydata/pandas/blob/master/LICENSE).
+  - When writing tests, use 2.6 compatible `self.assertFoo` methods. Some polyfills such as `assertRaises`
+    can be found in `pandas.util.testing`.
+  - Generally, pandas source files should not contain attributions. You can include a "thanks to..."
+    in the release changelog. The rest is `git blame`/`git log`.
+  - When you start working on a PR, start by creating a new branch pointing at the latest
+    commit on github master.
+  - **Do not** merge upstream into a branch you're going to submit as a PR.
+    Use `git rebase` against the current github master.
+  - For extra brownie points, you can squash and reorder the commits in your PR using `git rebase -i`.
+    Use your own judgment to decide what history needs to be preserved. If git frightens you, that's OK too.
+  - Use `raise AssertionError` over `assert` unless you want the assertion stripped by `python -o`.
+  - The pandas copyright policy is detailed in the pandas [LICENSE](https://github.com/pydata/pandas/blob/master/LICENSE).
   - On the subject of [PEP8](http://www.python.org/dev/peps/pep-0008/): yes.
-  - On the subject of massive PEP8 fix PRs touching everything, please consider the following:
-    - They create noisy merge conflicts for people working in their own fork.
-    - They make `git blame` less effective.
-    - Different tools / people achieve PEP8 in different styles. This can create
-      "style wars" and churn that produces little real benefit.
-    - If your code changes are intermixed with style fixes, they are harder to review
-      before merging. Keep style fixes in separate commits.
-    - It's fine to clean-up a little around an area you just worked on.
-    - Generally it's a BAD idea to PEP8 on documentation.
-
-    Having said that, if you still feel a PEP8 storm is in order, go for it.
+  - On the subject of a massive PEP8-storm touching everything: not too often (once per release works).
 
 ### Notes on plotting function conventions
 
 https://groups.google.com/forum/#!topic/pystatsmodels/biNlCvJPNNY/discussion
 
-### Getting Travis-CI going
-
-Instructions for getting Travis-CI installed are available [here](http://about.travis-ci.org/docs/user/getting-started/).
-For those users who are new to Travis-CI and [continuous integration](https://en.wikipedia.org/wiki/Continuous_integration) in particular,
-Here's a few high-level notes:
-- Travis-CI is a free service (with premium account upgrades available) that integrates
-  well with GitHub.
-- Enabling Travis-CI on your GitHub fork of a project will cause any *new* commit
-  pushed to the repo to trigger a full build+test on Travis-CI's servers.
-- All the configuration for Travis-CI builds is already specified by `.travis.yml` in the repo.
-  That means all you have to do is enable Travis-CI once, and then just push commits
-  and you'll get full testing across py2/3 with pandas' considerable
-  [test-suite](https://github.com/pydata/pandas/tree/master/pandas/tests).
-- Enabling Travis-CI will attach the test results (red/green) to the Pull-Request
-  page for any PR you submit. For example:
-
-    https://github.com/pydata/pandas/pull/2532,
-
-See the Green "Good to merge!" banner? that's it.
-
-This is especially important for new contributors, as members of the pandas dev team
-like to know that the test suite passes before considering it for merging.
-Even regular contributors who test religiously on their local box (using tox
-for example) often rely on a PR+travis=green to make double sure everything
-works ok on another system, as occasionally, it doesn't.
-
-#### Steps to enable Travis-CI
-
-- Open https://travis-ci.org/
-- Select "Sign in with GitHub" (Top Navbar)
-- Select \[your username\] -> "Accounts" (Top Navbar)
-- Select 'Sync now' to refresh the list of repos from your GH account.
-- Flip the switch for the repos you want Travis-CI enabled for.
-  "pandas", obviously.
-- Then, pushing a *new* commit to a certain branch on that repo
-  will trigger a build/test for that branch. For example, the branch
-  might be `master` or `PR1234_fix_everything__atomically`, if that's the
-  name of your PR branch.
-
-You can see the build history and current builds for your fork
-at: https://travis-ci.org/(your_GH_username)/pandas.
-
-For example, the builds for the main pandas repo can be seen at:
-https://travis-ci.org/pydata/pandas.
-
 ####More developer docs
 
 * See the [developers](http://pandas.pydata.org/developers.html) page on the
 
@@ -2,9 +2,7 @@ include MANIFEST.in
 include LICENSE
 include RELEASE.md
 include README.rst
-include TODO.rst
 include setup.py
-include setupegg.py
 
 graft doc
 prune doc/build
 
@@ -1,16 +1,24 @@
 #!/usr/bin/env python
 
 
-def show_versions():
+
+def show_versions(as_json=False):
     import imp
     import os
     fn = __file__
     this_dir = os.path.dirname(fn)
     pandas_dir = os.path.abspath(os.path.join(this_dir,".."))
     sv_path = os.path.join(pandas_dir, 'pandas','util')
     mod = imp.load_module('pvmod', *imp.find_module('print_versions', [sv_path]))
-    return mod.show_versions()
+    return mod.show_versions(as_json)
 
 
 if __name__ == '__main__':
-    show_versions()
+    # optparse is 2.6-safe
+    from optparse import OptionParser
+    parser = OptionParser()
+    parser.add_option("-j", "--json", action="store_true", help="Format output as JSON")
+
+    (options, args) = parser.parse_args()
+
+    show_versions(as_json=options.json)
@@ -13,6 +13,7 @@
    import pandas as pd
    np.set_printoptions(precision=4, suppress=True)
    options.display.mpl_style='default'
+   options.display.max_rows=15
 
    #### portions of this were borrowed from the
    #### Pandas cheatsheet
 
@@ -1112,6 +1112,42 @@ Conversion
    DatetimeIndex.to_pydatetime
 
 
+GroupBy
+-------
+.. currentmodule:: pandas.core.groupby
+
+GroupBy objects are returned by groupby calls: :func:`pandas.DataFrame.groupby`, :func:`pandas.Series.groupby`, etc.
+
+Indexing, iteration
+~~~~~~~~~~~~~~~~~~~
+.. autosummary::
+   :toctree: generated/
+   
+   GroupBy.__iter__
+   GroupBy.groups
+   GroupBy.indices
+   GroupBy.get_group
+
+Function application
+~~~~~~~~~~~~~~~~~~~~
+.. autosummary::
+   :toctree: generated/
+
+   GroupBy.apply
+   GroupBy.aggregate
+   GroupBy.transform
+
+Computations / Descriptive Stats
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+.. autosummary::
+   :toctree: generated/
+   
+   GroupBy.mean
+   GroupBy.median
+   GroupBy.std
+   GroupBy.var
+   GroupBy.ohlc
+
 ..
     HACK - see github issue #4539. To ensure old links remain valid, include
     here the autosummaries with previous currentmodules as a comment and add
 
@@ -9,6 +9,7 @@
    randn = np.random.randn
    np.set_printoptions(precision=4, suppress=True)
    from pandas.compat import lrange
+   options.display.max_rows=15
 
 ==============================
  Essential Basic Functionality
@@ -103,8 +104,8 @@ a set of specialized cython routines that are especially fast when dealing with
 Here is a sample (using 100 column x 100,000 row ``DataFrames``):
 
 .. csv-table::
-    :header: "Operation", "0.11.0 (ms)", "Prior Vern (ms)", "Ratio to Prior"
-    :widths: 30, 30, 30, 30
+    :header: "Operation", "0.11.0 (ms)", "Prior Version (ms)", "Ratio to Prior"
+    :widths: 25, 25, 25, 25
     :delim: ;
 
     ``df1 > df2``; 13.32; 125.35;  0.1063
@@ -556,12 +557,11 @@ will either be of lower dimension or the same dimension.
 about a data set. For example, suppose we wanted to extract the date where the
 maximum value for each column occurred:
 
-
 .. ipython:: python
 
    tsdf = DataFrame(randn(1000, 3), columns=['A', 'B', 'C'],
                     index=date_range('1/1/2000', periods=1000))
-   tsdf.apply(lambda x: x[x.idxmax()])
+   tsdf.apply(lambda x: x.idxmax())
 
 You may also pass additional arguments and keyword arguments to the ``apply``
 method. For instance, consider the following function you would like to apply:
@@ -1029,7 +1029,7 @@ with more than one group returns a DataFrame with one column per group.
 
    Series(['a1', 'b2', 'c3']).str.extract('([ab])(\d)')
 
-Elements that do not match return a row of ``NaN``s.
+Elements that do not match return a row filled with ``NaN``.
 Thus, a Series of messy strings can be "converted" into a
 like-indexed Series or DataFrame of cleaned-up or more useful strings,
 without necessitating ``get()`` to access tuples or ``re.match`` objects.
@@ -1051,18 +1051,35 @@ can also be used.
 Testing for Strings that Match or Contain a Pattern
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 
-In previous versions, *extracting* match groups was accomplished by ``match``,
-which returned a not-so-convenient Series of tuples. Starting in version 0.14,
-the default behavior of match will change. It will return a boolean
-indexer, analagous to the method ``contains``.
 
-The distinction between
-``match`` and ``contains`` is strictness: ``match`` relies on
-strict ``re.match`` while ``contains`` relies on ``re.search``.
+You can check whether elements contain a pattern:
+
+.. ipython:: python
+
+   pattern = r'[a-z][0-9]'
+   Series(['1', '2', '3a', '3b', '03c']).str.contains(pattern)
+
+or match a pattern:
 
-In version 0.13, ``match`` performs its old, deprecated behavior by default,
-but the new behavior is availabe through the keyword argument
-``as_indexer=True``.
+
+.. ipython:: python
+
+   Series(['1', '2', '3a', '3b', '03c']).str.match(pattern, as_indexer=True)
+
+The distinction between ``match`` and ``contains`` is strictness: ``match`` 
+relies on strict ``re.match``, while ``contains`` relies on ``re.search``.
+
+.. warning::
+
+   In previous versions, ``match`` was for *extracting* groups,
+   returning a not-so-convenient Series of tuples. The new method ``extract``
+   (described in the previous section) is now preferred.
+
+   This old, deprecated behavior of ``match`` is still the default. As
+   demonstrated above, use the new behavior by setting ``as_indexer=True``.
+   In this mode, ``match`` is analagous to ``contains``, returning a boolean
+   Series. The new behavior will become the default behavior in a future 
+   release.
 
 Methods like ``match``, ``contains``, ``startswith``, and ``endswith`` take
  an extra ``na`` arguement so missing values can be considered True or False:
@@ -1457,6 +1474,21 @@ It's also possible to reset multiple options at once (using a regex):
    reset_option("^display")
 
 
+.. versionadded:: 0.13.1
+
+   Beginning with v0.13.1 the `option_context` context manager has been exposed through
+   the top-level API, allowing you to execute code with given option values. Option values
+   are restored automatically when you exit the `with` block:
+
+.. ipython:: python
+
+   with option_context("display.max_rows",10,"display.max_columns", 5):
+      print get_option("display.max_rows")
+      print get_option("display.max_columns")
+
+   print get_option("display.max_rows")
+   print get_option("display.max_columns")
+
 
 Console Output Formatting
 -------------------------