Fix for #18178 and #18187 by changing the concat of empty RangeIndex #18191

SmokinCaterpillar · 2017-11-09T15:06:25Z

The _concat_rangeindex_same_dtype function now keeps track of the last non-empty RangeIndex to extract the new stop value.

This fixes two issues with concatenating non-empty and empty DataFrames and Series.

Two regression tests were added as well.

closes BUG: pd.concat raises if called on mixture of empty and non-empty dataframes #18178 and closes BUG: pd.concat returns empty series if called on mixture of empty and non-empty series #18187
2 regression tests added
Submission passes git diff master -u -- "*.py" | flake8 --diff
1 whatsnew entry

…of empty RangeIndex The `_concat_rangeindex_same_dtype` now keeps track of the last non-empty RangeIndex to extract the new stop value. This fixes two issues with concatenating non-empty and empty DataFrames and Series. Two regression tests were added as well.

SmokinCaterpillar · 2017-11-09T15:23:21Z

As an alternative solution, instead of using

if not len(obj):
    continue

and this new last_non_empty variable,
one could simply replace continue as follows:

if not len(obj):
    from pandas import Int64Index
    return _concat_index_same_dtype(indexes, klass=Int64Index)

However, in this case, the returned new index would always be changed to an integer one in case of a single empty RangeIndex in indexes. Accordingly, some already existing tests would need to be changed as well.

jreback · 2017-11-09T17:01:29Z

i am ok with the soln

jreback · 2017-11-09T17:02:14Z

pandas/core/dtypes/concat.py


    for obj in indexes:
        if not len(obj):
            continue
+        # Remember the last non-empty index for the stop value
+        last_non_empty = obj


maybe filter out he non empties first? then i think this loop should be ok?

SmokinCaterpillar · 2017-11-10T08:31:08Z

I made the code a bit nicer by first filtering the empty indexes with a list comprehension

codecov · 2017-11-10T09:05:25Z

Codecov Report

Merging #18191 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #18191      +/-   ##
==========================================
+ Coverage    91.4%    91.4%   +<.01%     
==========================================
  Files         163      163              
  Lines       50064    50065       +1     
==========================================
+ Hits        45763    45764       +1     
  Misses       4301     4301

Flag	Coverage Δ
#multiple	`89.21% <100%> (ø)`	⬆️
#single	`40.35% <33.33%> (-0.01%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/dtypes/concat.py	`99.14% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 17e0b13...3082d0e. Read the comment docs.

jorisvandenbossche

Thanks!

Side note: the behaviour in case of all empties to take the start/stop/step feels a bit strange to me (why not the first? or convert to empty Int64Index?), but given that this is how it already was before, we can leave that for another PR (if we would want to change that)

SmokinCaterpillar · 2017-11-10T09:32:57Z

@jorisvandenbossche yes, another pull request for these changes is probably a good idea, especially because IMO this involves changing existing RangeIndex tests.

jreback · 2017-11-10T13:54:13Z

thanks @SmokinCaterpillar

yeah I find this logic slightly strange. If you'd like to follow up with a PR to clarify would be great!

SmokinCaterpillar · 2017-11-10T14:33:11Z

Opened a follow up PR #18214 :-)

… of empty RangeIndex (pandas-dev#18191)

… of empty RangeIndex (pandas-dev#18191) (cherry picked from commit 6b3641b)

…#18191) (cherry picked from commit 6b3641b)

Robert Meyer added 2 commits November 9, 2017 15:46

Added whatsnew entry

ffc3b6c

jreback reviewed Nov 9, 2017

View reviewed changes

jreback added Regression Functionality that used to work in a prior pandas version Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Nov 9, 2017

Filtering the non empty indexes before the main loop

3082d0e

jorisvandenbossche approved these changes Nov 10, 2017

View reviewed changes

jorisvandenbossche added this to the 0.21.1 milestone Nov 10, 2017

jreback merged commit 6b3641b into pandas-dev:master Nov 10, 2017

jreback added the Needs Backport label Nov 10, 2017

SmokinCaterpillar mentioned this pull request Nov 10, 2017

_concat_rangeindex_... now returns an empty RangeIndex for empty ranges #18214

Merged

2 tasks

jreback mentioned this pull request Nov 11, 2017

inconsistency in concat behavior in pandas 0.21.0 #18227

Closed

No-Stream pushed a commit to No-Stream/pandas that referenced this pull request Nov 28, 2017

Fix for pandas-dev#18178 and pandas-dev#18187 by changing the concat…

f7e931b

… of empty RangeIndex (pandas-dev#18191)

TomAugspurger pushed a commit to TomAugspurger/pandas that referenced this pull request Dec 8, 2017

Fix for pandas-dev#18178 and pandas-dev#18187 by changing the concat…

de4be1f

… of empty RangeIndex (pandas-dev#18191) (cherry picked from commit 6b3641b)

TomAugspurger pushed a commit that referenced this pull request Dec 11, 2017

Fix for #18178 and #18187 by changing the concat of empty RangeIndex (…

fc1c2b8

…#18191) (cherry picked from commit 6b3641b)

TomAugspurger removed the Needs Backport label Dec 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for #18178 and #18187 by changing the concat of empty RangeIndex #18191

Fix for #18178 and #18187 by changing the concat of empty RangeIndex #18191

SmokinCaterpillar commented Nov 9, 2017 •

edited

Loading

SmokinCaterpillar commented Nov 9, 2017

jreback commented Nov 9, 2017

jreback Nov 9, 2017

SmokinCaterpillar commented Nov 10, 2017

codecov bot commented Nov 10, 2017 •

edited

Loading

jorisvandenbossche left a comment

SmokinCaterpillar commented Nov 10, 2017

jreback commented Nov 10, 2017

SmokinCaterpillar commented Nov 10, 2017

Fix for #18178 and #18187 by changing the concat of empty RangeIndex #18191

Fix for #18178 and #18187 by changing the concat of empty RangeIndex #18191

Conversation

SmokinCaterpillar commented Nov 9, 2017 • edited Loading

SmokinCaterpillar commented Nov 9, 2017

jreback commented Nov 9, 2017

jreback Nov 9, 2017

Choose a reason for hiding this comment

SmokinCaterpillar commented Nov 10, 2017

codecov bot commented Nov 10, 2017 • edited Loading

Codecov Report

jorisvandenbossche left a comment

Choose a reason for hiding this comment

SmokinCaterpillar commented Nov 10, 2017

jreback commented Nov 10, 2017

SmokinCaterpillar commented Nov 10, 2017

SmokinCaterpillar commented Nov 9, 2017 •

edited

Loading

codecov bot commented Nov 10, 2017 •

edited

Loading