neurodebian
diff --git a/‎RELEASE.rst
Lines changed: 53 additions & 3 deletions b/‎RELEASE.rst
Lines changed: 53 additions & 3 deletions
diff --git a/‎doc/source/conf.py
Lines changed: 1 addition & 1 deletion b/‎doc/source/conf.py
Lines changed: 1 addition & 1 deletion
diff --git a/‎doc/source/io.rst
Lines changed: 178 additions & 17 deletions b/‎doc/source/io.rst
Lines changed: 178 additions & 17 deletions
diff --git a/‎doc/source/timeseries.rst
Lines changed: 4 additions & 0 deletions b/‎doc/source/timeseries.rst
Lines changed: 4 additions & 0 deletions
@@ -29,10 +29,21 @@ pandas 0.8.0
 
 **New features**
 
+  - New unified DatetimeIndex class for nanosecond-level timestamp data
+  - New Timestamp datetime.datetime subclass with easy time zone conversions,
+    and support for nanoseconds
+  - New PeriodIndex class for timespans, calendar logic, and Period scalar object
+  - High performance resampling of timestamp and period data. New `resample`
+    method of all pandas data structures
+  - New frequency names plus shortcut string aliases like '15h', '1h30min'
+  - Time series string indexing shorthand (#222)
+  - Add week, dayofyear array and other timestamp array-valued field accessor
+    functions to DatetimeIndex
   - Add GroupBy.prod optimized aggregation function and 'prod' fast time series
     conversion method (#1018)
   - Implement robust frequency inference function and `inferred_freq` attribute
     on DatetimeIndex (#391)
+  - New ``tz_convert`` methods in Series / DataFrame
   - Convert DatetimeIndexes to UTC if time zones are different in join/setops
     (#864)
   - Add limit argument for forward/backward filling to reindex, fillna,
@@ -49,7 +60,7 @@ pandas 0.8.0
   - Can pass list of (name, function) to GroupBy.aggregate to get aggregates in
     a particular order (#610)
   - Can pass dicts with lists of functions or dicts to GroupBy aggregate to do
-    much more flexible multiple function aggregation (#642)
+    much more flexible multiple function aggregation (#642, #610)
   - New ordered_merge functions for merging DataFrames with ordered
     data. Also supports group-wise merging for panel data (#813)
   - Add keys() method to DataFrame
@@ -59,6 +70,14 @@ pandas 0.8.0
   - More flexible multiple function aggregation with GroupBy
   - Add pct_change function to Series/DataFrame
   - Add option to interpolate by Index values in Series.interpolate (#1206)
+  - Add ``max_colwidth`` option for DataFrame, defaulting to 50
+  - Conversion of DataFrame through rpy2 to R data.frame (#1282, )
+  - Add keys() method on DataFrame (#1240)
+  - Add new ``match`` function to API (similar to R) (#502)
+  - Add dayfirst option to parsers (#854)
+  - Add ``method`` argument to ``align`` method for forward/backward fillin
+    (#216)
+  - Add Panel.transpose method for rearranging axes (#695)
 
 **Improvements to existing features**
 
@@ -80,15 +99,29 @@ pandas 0.8.0
     DataFrame.drop_duplicates (#805, #207)
   - More helpful error message when nothing passed to Series.reindex (#1267)
   - Can mix array and scalars as dict-value inputs to DataFrame ctor (#1329)
+  - Use DataFrame columns' name for legend title in plots
+  - Preserve frequency in DatetimeIndex when possible in boolean indexing
+    operations
+  - Promote datetime.date values in data alignment operations (#867)
+  - Add ``order`` method to Index classes (#1028)
+  - Avoid hash table creation in large monotonic hash table indexes (#1160)
+  - Store time zones in HDFStore (#1232)
+  - Enable storage of sparse data structures in HDFStore (#85)
+  - Enable Series.asof to work with arrays of timestamp inputs
+  - Cython implementation of DataFrame.corr speeds up by > 100x (#1349)
+
 
 **API Changes**
 
+  - Frequency name overhaul, WEEKDAY/EOM and rules with @
+    deprecated. get_legacy_offset_name backwards compatibility function added
   - Raise ValueError in DataFrame.__nonzero__, so "if df" no longer works
     (#1073)
-  - Change BDay (business day) to not normalize dates by default
+  - Change BDay (business day) to not normalize dates by default (#506)
   - Remove deprecated DataMatrix name
   - Default merge suffixes for overlap now have underscores instead of periods
     to facilitate tab completion, etc. (#1239)
+  - Deprecation of offset, time_rule timeRule parameters throughout codebase
 
 **Bug fixes**
 
@@ -97,7 +130,7 @@ pandas 0.8.0
   - Fix logical error with February leap year end in YearEnd offset
   - Series([False, nan]) was getting casted to float64 (GH #1074)
   - Fix binary operations between boolean Series and object Series with
-    booleans and NAs (GH #1074)
+    booleans and NAs (GH #1074, #1079)
   - Couldn't assign whole array to column in mixed-type DataFrame via .ix
     (#1142)
   - Fix label slicing issues with float index values (#1167)
@@ -113,6 +146,23 @@ pandas 0.8.0
     to datetime64 representation (#1081, #809)
   - Fix DataFrame.duplicated/drop_duplicates NA value handling (#557)
   - Actually raise exceptions in fast reducer (#1243)
+  - Fix various timezone-handling bugs from 0.7.3 (#969)
+  - GroupBy on level=0 discarded index name (#1313)
+  - Better error message with unmergeable DataFrames (#1307)
+  - Series.__repr__ alignment fix with unicode index values (#1279)
+  - Better error message if nothing passed to reindex (#1267)
+  - More robust NA handling in DataFrame.drop_duplicates (#557)
+  - Resolve locale-based and pre-epoch HDF5 timestamp deserialization issues
+    (#973, #1081, #179)
+  - Implement Series.repeat (#1229)
+  - Fix indexing with namedtuple and other tuple subclasses (#1026)
+  - Fix float64 slicing bug (#1167)
+  - Parsing integers with commas (#796)
+  - Fix groupby improper data type when group consists of one value (#1065)
+  - Fix negative variance possibility in nanvar resulting from floating point
+    error (#1090)
+  - Consistently set name on groupby pieces (#184)
+  - Treat dict return values as Series in GroupBy.apply (#823)
 
 pandas 0.7.3
 ============
 
@@ -209,7 +209,7 @@
 latex_documents = [
   ('index', 'pandas.tex',
    u'pandas: powerful Python data analysis toolkit',
-   u'Wes McKinney\n& PyData Development Team', 'manual'),
+   u'Wes McKinney\n\& PyData Development Team', 'manual'),
 ]
 
 # The name of an image file (relative to this directory) to place at the top of
 
@@ -68,35 +68,53 @@ data into a DataFrame object. They can take a number of arguments:
     whitespace.
   - ``header``: row number to use as the column names, and the start of the data.
     Defaults to 0 (first row); specify None if there is no header row.
-  - ``names``: List of column names to use. If passed, header will be
-    implicitly set to None.
   - ``skiprows``: A collection of numbers for rows in the file to skip. Can
     also be an integer to skip the first ``n`` rows
-  - ``index_col``: column number, or list of column numbers, to use as the
-    ``index`` (row labels) of the resulting DataFrame. By default, it will number
-    the rows without using any column, unless there is one more data column than
-    there are headers, in which case the first column is taken as the index.
-  - ``parse_dates``: If True, attempt to parse the index column as dates. False
-    by default.
+  - ``index_col``: column number, column name, or list of column numbers/names,
+    to use as the ``index`` (row labels) of the resulting DataFrame. By default,
+    it will number the rows without using any column, unless there is one more
+    data column than there are headers, in which case the first column is taken
+    as the index.
+  - ``names``: List of column names to use. If passed, header will be
+    implicitly set to None.
+  - ``na_values``: optional list of strings to recognize as NaN (missing values),
+    in addition to a default set.
+  - ``parse_dates``: if True then index will be parsed as dates
+    (False by default). You can specify more complicated options to parse
+    a subset of columns or a combination of columns into a single date column
+    (list of ints or names, list of lists, or dict)
+    [1, 2, 3] -> try parsing columns 1, 2, 3 each as a separate date column
+    [[1, 3]] -> combine columns 1 and 3 and parse as a single date column
+    {'foo' : [1, 3]} -> parse columns 1, 3 as date and call result 'foo'
+  - ``keep_date_col``: if True, then date component columns passed into
+    ``parse_dates`` will be retained in the output (False by default).
   - ``date_parser``: function to use to parse strings into datetime
     objects. If ``parse_dates`` is True, it defaults to the very robust
     ``dateutil.parser``. Specifying this implicitly sets ``parse_dates`` as True.
-  - ``na_values``: optional list of strings to recognize as NaN (missing values),
-    in addition to a default set.
+    You can also use functions from community supported date converters from
+    date_converters.py
+  - ``dayfirst``: if True then uses the DD/MM international/European date format
+    (This is False by default)
+  - ``thousands``: sepcifies the thousands separator. If not None, then parser
+    will try to look for it in the output and parse relevant data to integers.
+    Because it has to essentially scan through the data again, this causes a
+    significant performance hit so only use if necessary.
+  - ``comment``: denotes the start of a comment and ignores the rest of the line.
+    Currently line commenting is not supported.
   - ``nrows``: Number of rows to read out of the file. Useful to only read a
     small portion of a large file
+  - ``iterator``: If True, return a ``TextParser`` to enable reading a file
+    into memory piece by piece
   - ``chunksize``: An number of rows to be used to "chunk" a file into
     pieces. Will cause an ``TextParser`` object to be returned. More on this
     below in the section on :ref:`iterating and chunking <io.chunking>`
-  - ``iterator``: If True, return a ``TextParser`` to enable reading a file
-    into memory piece by piece
   - ``skip_footer``: number of lines to skip at bottom of file (default 0)
   - ``converters``: a dictionary of functions for converting values in certain
     columns, where keys are either integers or column labels
   - ``encoding``: a string representing the encoding to use if the contents are
     non-ascii
-  - ``verbose`` : show number of NA values inserted in non-numeric columns
-
+  - ``verbose``: show number of NA values inserted in non-numeric columns
+  - ``squeeze``: if True then output with only one column is turned into Series
 
 .. ipython:: python
    :suppress:
@@ -117,8 +135,22 @@ The default for `read_csv` is to create a DataFrame with simple numbered rows:
 
    read_csv('foo.csv')
 
-In the case of indexed data, you can pass the column number (or a list of
-column numbers, for a hierarchical index) you wish to use as the index.
+In the case of indexed data, you can pass the column number or column name you
+wish to use as the index:
+
+.. ipython:: python
+
+   read_csv('foo.csv', index_col=0)
+
+.. ipython:: python
+
+   read_csv('foo.csv', index_col='date')
+
+You can also use a list of columns to create a hierarchical index:
+
+.. ipython:: python
+
+   read_csv('foo.csv', index_col=[0, 'A'])
 
 The parsers make every attempt to "do the right thing" and not be very
 fragile. Type inference is a pretty big deal. So if a column can be coerced to
@@ -127,6 +159,9 @@ columns will come through as object dtype as with the rest of pandas objects.
 
 .. _io.parse_dates:
 
+Specifying Date Columns
+~~~~~~~~~~~~~~~~~~~~~~~
+
 To better facilitate working with datetime data, :func:`~pandas.io.parsers.read_csv` and :func:`~pandas.io.parsers.read_table`
 uses the keyword arguments ``parse_dates`` and ``date_parser`` to allow users
 to specify a variety of columns and date/time formats to turn the input text
@@ -139,6 +174,7 @@ The simplest case is to just pass in ``parse_dates=True``:
    # Use a column as an index, and parse it as dates.
    df = read_csv('foo.csv', index_col=0, parse_dates=True)
    df
+
    # These are python datetime objects
    df.index
 
@@ -184,6 +220,12 @@ to retain them via the ``keep_date_col`` keyword:
                  keep_date_col=True)
    df
 
+Note that if you wish to combine multiple columns into a single date column, a
+nested list must be used. In other words, ``parse_dates=[1, 2]`` indicates that
+the second and third columns should each be parsed as separate date columns
+while ``parse_dates=[[1, 2]]`` means the two columns should be parsed into a
+single column.
+
 You can also use a dict to specify custom name columns:
 
 .. ipython:: python
@@ -192,6 +234,8 @@ You can also use a dict to specify custom name columns:
    df = read_csv('tmp.csv', header=None, parse_dates=date_spec)
    df
 
+Date Parsing Functions
+~~~~~~~~~~~~~~~~~~~~~~
 Finally, the parser allows you can specify a custom ``date_parser`` function to
 take full advantage of the flexiblity of the date parsing API:
 
@@ -204,7 +248,124 @@ take full advantage of the flexiblity of the date parsing API:
 
 You can explore the date parsing functionality in ``date_converters.py`` and
 add your own. We would love to turn this module into a community supported set
-of date/time parsers.
+of date/time parsers. To get you started, ``date_converters.py`` contains
+functions to parse dual date and time columns, year/month/day columns,
+and year/month/day/hour/minute/second columns. It also contains a
+``generic_parser`` function so you can curry it with a function that deals with
+a single date rather than the entire array.
+
+.. ipython:: python
+   :suppress:
+
+   os.remove('tmp.csv')
+
+.. _io.convenience:
+
+Thousand Separators
+~~~~~~~~~~~~~~~~~~~
+For large integers that have been written with a thousands separator, you can
+set the ``thousands`` keyword to ``True`` so that integers will be parsed
+correctly:
+
+.. ipython:: python
+   :suppress:
+
+   data =  ("ID|level|category\n"
+            "Patient1|123,000|x\n"
+            "Patient2|23,000|y\n"
+            "Patient3|1,234,018|z")
+
+   with open('tmp.csv', 'w') as fh:
+       fh.write(data)
+
+By default, integers with a thousands separator will be parsed as strings
+
+.. ipython:: python
+
+    print open('tmp.csv').read()
+    df = read_csv('tmp.csv', sep='|')
+    df
+
+    df.level.dtype
+
+The ``thousands`` keyword allows integers to be parsed correctly
+
+.. ipython:: python
+
+    print open('tmp.csv').read()
+    df = read_csv('tmp.csv', sep='|', thousands=',')
+    df
+
+    df.level.dtype
+
+.. ipython:: python
+   :suppress:
+
+   os.remove('tmp.csv')
+
+Comments
+~~~~~~~~
+Sometimes comments or meta data may be included in a file:
+
+.. ipython:: python
+   :suppress:
+
+   data =  ("ID,level,category\n"
+            "Patient1,123000,x # really unpleasant\n"
+            "Patient2,23000,y # wouldn't take his medicine\n"
+            "Patient3,1234018,z # awesome")
+
+   with open('tmp.csv', 'w') as fh:
+       fh.write(data)
+
+.. ipython:: python
+
+   print open('tmp.csv').read()
+
+By default, the parse includes the comments in the output:
+
+.. ipython:: python
+
+   df = read_csv('tmp.csv')
+   df
+
+We can suppress the comments using the ``comment`` keyword:
+
+.. ipython:: python
+
+   df = read_csv('tmp.csv', comment='#')
+   df
+
+.. ipython:: python
+   :suppress:
+
+   os.remove('tmp.csv')
+
+Returning Series
+~~~~~~~~~~~~~~~~
+
+Using the ``squeeze`` keyword, the parser will return output with a single column
+as a ``Series``:
+
+.. ipython:: python
+   :suppress:
+
+   data =  ("level\n"
+            "Patient1,123000\n"
+            "Patient2,23000\n"
+            "Patient3,1234018")
+
+   with open('tmp.csv', 'w') as fh:
+       fh.write(data)
+
+.. ipython:: python
+
+   print open('tmp.csv').read()
+
+   output =  read_csv('tmp.csv', squeeze=True)
+   output
+
+   type(output)
 
 .. ipython:: python
    :suppress:
 
@@ -596,6 +596,10 @@ and array and produces an aggregated values:
 
    ts.resample('5Min', how=np.max)
 
+Any function available via :ref:`dispatching <groupby.dispatch>` can be given to
+the ``how`` parameter by name, including ``sum``, ``mean``, ``std``, ``max``,
+``min``, ``median``, ``first``, ``last``, ``ohlc``.
+
 For downsampling, ``closed`` can be set to 'left' or 'right' to specify which
 end of the interval is closed:
Original file line number	Diff line number	Diff line change
`@@ -209,7 +209,7 @@`
`209`	`209`	`latex_documents = [`
`210`	`210`	`('index', 'pandas.tex',`
`211`	`211`	`u'pandas: powerful Python data analysis toolkit',`
`212`		`- u'Wes McKinney\n& PyData Development Team', 'manual'),`
	`212`	`+ u'Wes McKinney\n\& PyData Development Team', 'manual'),`
`213`	`213`	`]`
`214`	`214`
`215`	`215`	`# The name of an image file (relative to this directory) to place at the top of`