Skip to content

Commit 2b0c017

Browse files
committed
DOC: add documentation for read_spss(pandas-dev#27476)
1 parent d7eb306 commit 2b0c017

File tree

2 files changed

+42
-0
lines changed

2 files changed

+42
-0
lines changed

doc/source/reference/io.rst

+7
Original file line numberDiff line numberDiff line change
@@ -105,6 +105,13 @@ SAS
105105

106106
read_sas
107107

108+
SPSS
109+
~~~~
110+
.. autosummary::
111+
:toctree: api/
112+
113+
read_spss
114+
108115
SQL
109116
~~~
110117
.. autosummary::

doc/source/user_guide/io.rst

+35
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,7 @@ The pandas I/O API is a set of top level ``reader`` functions accessed like
3939
binary;`Msgpack <https://msgpack.org/index.html>`__;:ref:`read_msgpack<io.msgpack>`;:ref:`to_msgpack<io.msgpack>`
4040
binary;`Stata <https://en.wikipedia.org/wiki/Stata>`__;:ref:`read_stata<io.stata_reader>`;:ref:`to_stata<io.stata_writer>`
4141
binary;`SAS <https://en.wikipedia.org/wiki/SAS_(software)>`__;:ref:`read_sas<io.sas_reader>`;
42+
binary;`SPSS <https://en.wikipedia.org/wiki/SPSS>`__;:ref:`read_spss<io.spss_reader>`;
4243
binary;`Python Pickle Format <https://docs.python.org/3/library/pickle.html>`__;:ref:`read_pickle<io.pickle>`;:ref:`to_pickle<io.pickle>`
4344
SQL;`SQL <https://en.wikipedia.org/wiki/SQL>`__;:ref:`read_sql<io.sql>`;:ref:`to_sql<io.sql>`
4445
SQL;`Google Big Query <https://en.wikipedia.org/wiki/BigQuery>`__;:ref:`read_gbq<io.bigquery>`;:ref:`to_gbq<io.bigquery>`
@@ -5477,6 +5478,40 @@ web site.
54775478

54785479
No official documentation is available for the SAS7BDAT format.
54795480

5481+
.. _io.spss:
5482+
5483+
.. _io.spss_reader:
5484+
5485+
SPSS formats
5486+
-----------
5487+
5488+
The top-level function :func:`read_spss` can read (but not write) SPSS
5489+
`sav` (.sav) and `zsav` (.zsav) format files(since *v0.25.0*).
5490+
5491+
SPSS files contain column names. By default the
5492+
whole file is read and returned as a ``DataFrame`` with all columns.
5493+
5494+
Specify a ``usecols`` to obtain a subset of columns. Specify ``apply_value_formats=True``
5495+
to convert categorical columns into ``pd.Categorical``.
5496+
5497+
Read a spss file:
5498+
5499+
.. code-block:: python
5500+
5501+
df = pd.read_spss('spss_data.zsav')
5502+
5503+
Extract a subset of columns ``usecols`` from SPSS file and
5504+
convert categorical columns into ``pd.Categorical``:
5505+
5506+
.. code-block:: python
5507+
5508+
df = pd.read_sas('spss_data.zsav', usecols=usecols, apply_value_formats=True)
5509+
5510+
More info_ about the sav and zsav file format is available from the IBM
5511+
web site.
5512+
5513+
.. _info: https://www.ibm.com/support/knowledgecenter/en/SSLVMB_22.0.0/com.ibm.spss.statistics.help/spss/base/savedatatypes.htm
5514+
54805515
.. _io.other:
54815516

54825517
Other file formats

0 commit comments

Comments
 (0)