BUG: dropna changes index even when nothing dropped #43618

debnathshoham · 2021-09-17T10:32:32Z

closes BUG: dropna('rows') changes index type even when no row was dropped #41965
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

pandas/core/frame.py

pandas/tests/frame/methods/test_dropna.py

pandas/core/indexes/range.py

jbrockmendel · 2021-09-21T18:26:54Z

pandas/core/indexes/range.py

- fill_value=fill_value,
- **kwargs,
- )
+ if len(self) == len(indices) and np.all(self == indices):


instead, could use lib.maybe_indices_to_slice; if you do get a slice then can return self[slc]

In general, checking this might give a considerable performance bottleneck in the take function. Can you check some cases how much this impact the performance of idx.take(..) with a few examples?

yes there is perf drop.
Would the slice approach help with that?

before after ratio [f3d48175] [7d511e16] <master> <gh41965> + 1.97±0.2ms 3.28±1ms 1.67 indexing.Take.time_take('int')

Made the change suggested by @jbrockmendel, and this no longer hits perf

@jbrockmendel , thoughts why this is failing on 32bit?

pandas/core/indexes/range.py

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

jreback · 2021-09-25T21:28:59Z

pandas/core/indexes/range.py

+ slc = maybe_indices_to_slice(indices, len(indices))
+ return self[slc]
+ except (ValueError, TypeError):
+ pass


when does this raise (and you are catching)?

this was raising in cases where I was not getting a slice (either because indices was BlockPlacement -> TypeError, or when indices contained anything other than intp_t -> ValueError).

can you just do something like

if isinstance(indices, slice): .....

to avoid the try/except

tried to do that.. but there can be a lot of possibilities of the contents of indices, and that's where the error is thrown

pandas/core/indexes/range.py

jreback · 2021-09-28T18:13:34Z

pandas/core/indexes/range.py

+ slc = maybe_indices_to_slice(indices, len(indices))
+ return self[slc]
+ except (ValueError, TypeError):
+ pass


can you just do something like

if isinstance(indices, slice): .....

to avoid the try/except

pandas/tests/reshape/concat/test_concat.py

jreback · 2021-09-29T13:21:21Z

pandas/core/indexes/range.py

 with rewrite_exception("Int64Index", type(self).__name__):
+ if (
+ fill_value is None
+ and isinstance(indices, np.ndarray)


this is way too complicated and likely hiding bugs. @jbrockmendel can you suggest anything here.

take a look at what we do in DatetimeTimedeltaMixin.take; should be possible to do this without the try/except

jbrockmendel · 2021-10-18T22:00:08Z

pandas/tests/frame/methods/test_dropna.py

 expected = DataFrame({"a": [1, 2, 3]})
 tm.assert_frame_equal(result, expected)
+
+ def test_dropna_retains_RangeIndex_when_nothing_dropped(self):


RangeIndex -> rangeindex

jreback · 2021-10-21T01:57:17Z

need to merge master and address comments

github-actions · 2021-11-21T00:13:54Z

This pull request is stale because it has been open for thirty days with no activity. Please update or respond to this comment if you're still interested in working on this.

jreback · 2021-11-28T20:45:18Z

closing as stale if you want to continue, pls ping to reopen.

debnathshoham added 2 commits September 17, 2021 15:59

BUG: dropna changes index even when nothing dropped

4ad5f13

added whatsnew

d7cfef2

jreback requested changes Sep 17, 2021

View reviewed changes

pandas/core/frame.py Outdated Show resolved Hide resolved

pandas/tests/frame/methods/test_dropna.py Outdated Show resolved Hide resolved

jreback added the Indexing Related to indexing on series/frames, not to indexes themselves label Sep 17, 2021

debnathshoham added 2 commits September 17, 2021 22:50

changed repr to check_exact

39a2f5c

revert change in dropna

2e11c4d

debnathshoham marked this pull request as draft September 17, 2021 18:09

jreback requested changes Sep 17, 2021

View reviewed changes

pandas/tests/frame/methods/test_dropna.py Outdated Show resolved Hide resolved

amend in RangeIndex.take

ec9344f

debnathshoham marked this pull request as ready for review September 18, 2021 08:26

debnathshoham requested a review from jreback September 18, 2021 08:26

mzeitlin11 reviewed Sep 18, 2021

View reviewed changes

pandas/core/indexes/range.py Outdated Show resolved Hide resolved

changed all to np.all

ce90c65

jbrockmendel reviewed Sep 21, 2021

View reviewed changes

jorisvandenbossche reviewed Sep 21, 2021

View reviewed changes

pandas/core/indexes/range.py Outdated Show resolved Hide resolved

debnathshoham and others added 9 commits September 23, 2021 21:19

Update pandas/core/indexes/range.py

939c0d8

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

Merge branch 'master' into gh41965

7d511e1

use lib.maybe_indices_to_slice

5cbe1f0

Merge branch 'master' into gh41965

4ef2825

try except if slice raises

e875e74

handle fill_value separately

aa5320c

fixed incorrect RangeIndex concat

2cff764

Merge branch 'master' into gh41965

1a1cabb

spelling correction in whatsnew

afd20b8

jreback requested changes Sep 25, 2021

View reviewed changes

Merge branch 'master' into gh41965

8a0089f

jreback requested changes Sep 28, 2021

View reviewed changes

debnathshoham added 3 commits September 29, 2021 00:22

added test with increasing RangeIndex; removed try-except

3bf4a82

Merge branch 'master' into gh41965

df074e4

Merge branch 'master' into gh41965

2c42b44

added try-except for ValueError

81b9c14

jreback reviewed Sep 29, 2021

View reviewed changes

jbrockmendel reviewed Oct 18, 2021

View reviewed changes

github-actions bot added the Stale label Nov 21, 2021

jreback closed this Nov 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: dropna changes index even when nothing dropped #43618

BUG: dropna changes index even when nothing dropped #43618

Uh oh!

debnathshoham commented Sep 17, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbrockmendel Sep 21, 2021

jorisvandenbossche Sep 21, 2021

debnathshoham Sep 23, 2021

debnathshoham Sep 24, 2021

debnathshoham Sep 25, 2021

Uh oh!

jreback Sep 25, 2021

debnathshoham Sep 26, 2021 •

edited

Loading

jreback Sep 28, 2021

debnathshoham Sep 29, 2021

Uh oh!

jreback Sep 28, 2021

Uh oh!

jreback Sep 29, 2021

jreback Sep 29, 2021

jbrockmendel Oct 18, 2021

jbrockmendel Oct 18, 2021

jreback commented Oct 21, 2021

github-actions bot commented Nov 21, 2021

jreback commented Nov 28, 2021

Labels

5 participants

Uh oh!

BUG: dropna changes index even when nothing dropped #43618

BUG: dropna changes index even when nothing dropped #43618

Uh oh!

Conversation

debnathshoham commented Sep 17, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

debnathshoham Sep 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Oct 21, 2021

github-actions bot commented Nov 21, 2021

jreback commented Nov 28, 2021

Labels

5 participants

debnathshoham commented Sep 17, 2021 •

edited

Loading

debnathshoham Sep 26, 2021 •

edited

Loading