You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
pandas supports the integration with many file formats or data sources out of the box (csv, excel, sql, json, parquet,…). Importing data from each of these
138
-
data sources is provided by function with the prefix ``read_*``. Similarly, the ``to_*`` methods are used to store data.
137
+
pandas supports the integration with many file formats or data sources out of the box (csv, excel, sql, json, parquet,…). The ability to import data from each of these
138
+
data sources is provided by functions with the prefix, ``read_*``. Similarly, the ``to_*`` methods are used to store data.
139
139
140
140
.. image:: ../_static/schemas/02_io_readwrite.svg
141
141
:align:center
@@ -181,7 +181,7 @@ data sources is provided by function with the prefix ``read_*``. Similarly, the
Basic statistics (mean, median, min, max, counts...) are easily calculable. These or custom aggregations can be applied on the entire
325
+
Basic statistics (mean, median, min, max, counts...) are easily calculable across data frames. These, or even custom aggregations, can be applied on the entire
326
326
data set, a sliding window of the data, or grouped by categories. The latter is also known as the split-apply-combine approach.
327
327
328
328
.. image:: ../_static/schemas/06_groupby.svg
@@ -369,8 +369,8 @@ data set, a sliding window of the data, or grouped by categories. The latter is
Change the structure of your data table in multiple ways. You can :func:`~pandas.melt` your data table from wide to long/tidy form or:func:`~pandas.pivot`
373
-
from long to wide format. With aggregations built-in, a pivot table is created with a single command.
372
+
Change the structure of your data table in a variety of ways. You can use :func:`~pandas.melt` to reshape your data from a wide format to a long and tidy one. Use:func:`~pandas.pivot`
373
+
to go from long to wide format. With aggregations built-in, a pivot table can be created with a single command.
374
374
375
375
.. image:: ../_static/schemas/07_melt.svg
376
376
:align:center
@@ -416,7 +416,7 @@ from long to wide format. With aggregations built-in, a pivot table is created w
Multiple tables can be concatenated both column wise and row wise as database-like join/merge operations are provided to combine multiple tables of data.
419
+
Multiple tables can be concatenated column wise or row wise with pandas' database-like join and merge operations.
420
420
421
421
.. image:: ../_static/schemas/08_concat_row.svg
422
422
:align:center
@@ -505,7 +505,7 @@ pandas has great support for time series and has an extensive set of tools for w
Data sets do not only contain numerical data. pandas provides a wide range of functions to clean textual data and extract useful information from it.
508
+
Data sets often contain more than just numerical data. pandas provides a wide range of functions to clean textual data and extract useful information from it.
509
509
510
510
.. raw:: html
511
511
@@ -551,9 +551,9 @@ the pandas-equivalent operations compared to software you already know:
551
551
:class-card: comparison-card
552
552
:shadow: md
553
553
554
-
The `R programming language <https://www.r-project.org/>`__ provides the
555
-
``data.frame`` data structure and multiple packages, such as
556
-
`tidyverse <https://www.tidyverse.org>`__ use and extend ``data.frame``
554
+
The `R programming language <https://www.r-project.org/>`__ provides a
555
+
``data.frame`` data structure as well as packages like
556
+
`tidyverse <https://www.tidyverse.org>`__ which use and extend ``data.frame``
557
557
for convenient data handling functionalities similar to pandas.
558
558
559
559
+++
@@ -572,8 +572,8 @@ the pandas-equivalent operations compared to software you already know:
572
572
:class-card: comparison-card
573
573
:shadow: md
574
574
575
-
Already familiar to ``SELECT``, ``GROUP BY``, ``JOIN``, etc.?
576
-
Most of these SQL manipulations do have equivalents in pandas.
575
+
Already familiar with ``SELECT``, ``GROUP BY``, ``JOIN``, etc.?
576
+
Many SQL manipulations have equivalents in pandas.
577
577
578
578
+++
579
579
@@ -631,10 +631,10 @@ the pandas-equivalent operations compared to software you already know:
631
631
:class-card: comparison-card
632
632
:shadow: md
633
633
634
-
The `SAS <https://en.wikipedia.org/wiki/SAS_(software)>`__statistical software suite
635
-
also provides the ``data set`` corresponding to the pandas ``DataFrame``.
636
-
Also SAS vectorized operations, filtering, string processing operations,
637
-
and more have similar functions in pandas.
634
+
`SAS <https://en.wikipedia.org/wiki/SAS_(software)>`__, the statistical software suite,
635
+
uses the ``data set`` structure, which closely corresponds pandas' ``DataFrame``.
636
+
Also SAS vectorized operations such as filtering or string processing operations
Copy file name to clipboardExpand all lines: doc/source/user_guide/missing_data.rst
+2-4
Original file line number
Diff line number
Diff line change
@@ -337,10 +337,8 @@ When taking the product, NA values or empty data will be treated as 1.
337
337
pd.Series([], dtype="float64").prod()
338
338
339
339
Cumulative methods like :meth:`~DataFrame.cumsum` and :meth:`~DataFrame.cumprod`
340
-
ignore NA values by default preserve them in the result. This behavior can be changed
341
-
with ``skipna``
342
-
343
-
* Cumulative methods like :meth:`~DataFrame.cumsum` and :meth:`~DataFrame.cumprod` ignore NA values by default, but preserve them in the resulting arrays. To override this behaviour and include NA values, use ``skipna=False``.
340
+
ignore NA values by default, but preserve them in the resulting array. To override
341
+
this behaviour and include NA values in the calculation, use ``skipna=False``.
0 commit comments