BUG: Set dtypes of new columns when stacking (#36991) #40127

maroth96 · 2021-02-28T20:53:35Z

closes BUG: MultiIndex loses category after .stack() #36991
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

pep8speaks · 2021-02-28T20:53:38Z

Hello @maroth96! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-03-10 01:29:17 UTC

pandas/core/reshape/reshape.py

jreback

can you add a whatsnew note. 1.3 bug fixes under reshaping is good.

jreback · 2021-03-01T20:56:41Z

pandas/core/reshape/reshape.py

+        new_columns = MultiIndex.from_arrays(
+            [
+                Index(new_level, dtype=level.dtype)
+                if None not in new_level


do we have any tests that hit the None case?

Yes, specifically test_stack_nan_in_multiindex_columns.

None is a tricky case---it is allowed for some Index types (like CategoricalIndex) but not others (like Int64Index). I am just avoiding levels with None entirely.

I've also looked at the way None is handled elsewhere, and it's not totally consistent:

>>> MultiIndex.from_arrays([[1, 2, None]]).levels[0] Int64Index([1, 2], dtype='int64')

But:

>>> Index([1, 2, None]) Index([1, 2, None], dtype='object')

Perhaps in the future it would be better if Index accepted None values, even when a dtype is specified. Then Index([1, 2, None], dtype='int64') would return Int64Index([1, 2], dtype='int64'). Then we wouldn't need such a condition here. Thoughts?

hmm interesting, ok can you open an issue specifically showing the MI vs Index cases. I agree we should do something about this. ok for here on this PR.

pandas/core/reshape/reshape.py

jreback

@jbrockmendel if you can have a look here

jreback · 2021-03-03T03:11:49Z

pandas/core/reshape/reshape.py

+        new_columns = MultiIndex.from_arrays(
+            [
+                Index(new_level, dtype=level.dtype)
+                if None not in new_level


hmm interesting, ok can you open an issue specifically showing the MI vs Index cases. I agree we should do something about this. ok for here on this PR.

pandas/tests/frame/test_stack_unstack.py

maroth96 · 2021-03-03T07:13:58Z

I extracted a function for stacking the column index. Let me know if you prefer it inlined. Otherwise, the same is possible for the index (lines 714--732).

pandas/core/reshape/reshape.py

pandas/tests/frame/test_stack_unstack.py

jreback

looks good some comments

pandas/core/reshape/reshape.py

jreback · 2021-03-10T13:55:57Z

thanks @maroth96 very nice! keep em coming!

…as-dev#40127)

BUG: Set dtypes of new columns when stacking (pandas-dev#36991)

1b90519

Fix PEP8 issues

f709330

jreback requested changes Feb 28, 2021

View reviewed changes

pandas/core/reshape/reshape.py Outdated Show resolved Hide resolved

maroth96 added 2 commits February 28, 2021 13:05

Fix pre-commit issues and add GH comment

23ff7ba

Use MultiIndex.from_arrays

9cb2055

maroth96 requested a review from jreback February 28, 2021 21:47

Reformat

75cb56c

jreback requested changes Mar 1, 2021

View reviewed changes

jreback added Bug Dtype Conversions Unexpected or buggy dtype conversions Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Mar 1, 2021

jreback added this to the 1.3 milestone Mar 1, 2021

maroth96 added 3 commits March 1, 2021 17:15

Remove unnecessary list()

61164b2

Fix mypy error

611841a

Add whatsnew entry

d8e7517

maroth96 requested a review from jreback March 2, 2021 01:31

jreback requested changes Mar 3, 2021

View reviewed changes

maroth96 added 3 commits March 2, 2021 22:28

Rewrite test such that it compares the series

8016c7f

Refactor such that unique_groups is no longer needed

ce161c7

Extract method for stacking the column index

bc69788

maroth96 requested a review from jreback March 3, 2021 07:10

Reformat

379bde5

jbrockmendel reviewed Mar 3, 2021

View reviewed changes

pandas/core/reshape/reshape.py Show resolved Hide resolved

jbrockmendel reviewed Mar 3, 2021

View reviewed changes

pandas/core/reshape/reshape.py Outdated Show resolved Hide resolved

Separate complex expression and add a comment

17e841d

maroth96 requested a review from jbrockmendel March 3, 2021 20:42

jbrockmendel reviewed Mar 6, 2021

View reviewed changes

pandas/core/reshape/reshape.py Outdated Show resolved Hide resolved

jbrockmendel reviewed Mar 6, 2021

View reviewed changes

pandas/core/reshape/reshape.py Show resolved Hide resolved

maroth96 and others added 2 commits March 6, 2021 12:23

Add function annotation and explanatory comments

9a7b29e

Merge branch 'master' into b-36991

76b5465

jreback requested changes Mar 8, 2021

View reviewed changes

Rewrite loop as list comprehension

c09697b

maroth96 requested a review from jreback March 9, 2021 04:35

jreback requested changes Mar 9, 2021

View reviewed changes

pandas/core/reshape/reshape.py Show resolved Hide resolved

pandas/core/reshape/reshape.py Outdated Show resolved Hide resolved

Add typing to _stack_multi_column_index

c6ab291

maroth96 requested a review from jreback March 10, 2021 01:29

jreback approved these changes Mar 10, 2021

View reviewed changes

jreback merged commit 7b5957f into pandas-dev:master Mar 10, 2021

jbrockmendel pushed a commit to jbrockmendel/pandas that referenced this pull request Mar 11, 2021

BUG: Set dtypes of new columns when stacking (pandas-dev#36991) (pand…

a9935f1

…as-dev#40127)

maroth96 mentioned this pull request Mar 11, 2021

BUG: Inconsistent handling of None in indices #40366

Open

3 tasks

maroth96 deleted the b-36991 branch March 13, 2021 07:15

This was referenced Feb 1, 2022

Make TSDataset.to_flatten faster tinkoff-ai/etna#475

Merged

Speed up TSDataset.to_flatten tinkoff-ai/etna#472

Closed

Uh oh!

BUG: Set dtypes of new columns when stacking (#36991) #40127

BUG: Set dtypes of new columns when stacking (#36991) #40127

Uh oh!

Conversation

maroth96 commented Feb 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pep8speaks commented Feb 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated at 2021-03-10 01:29:17 UTC

Uh oh!

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

jreback Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

maroth96 Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

jreback Mar 3, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

jreback Mar 3, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

maroth96 commented Mar 3, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jreback commented Mar 10, 2021

Uh oh!

Uh oh!

maroth96 commented Feb 28, 2021 •

edited

Loading

pep8speaks commented Feb 28, 2021 •

edited

Loading