CLN: DataFrameGroupBy._cython_agg_general #30384

topper-123 · 2019-12-20T22:48:09Z

Some small cleanups that make the code easier to read IMO.

jbrockmendel · 2019-12-20T23:52:07Z

pandas/core/groupby/generic.py

@@ -63,7 +64,7 @@
 )
 from pandas.core.indexes.api import Index, MultiIndex, all_indexes_same
 import pandas.core.indexes.base as ibase
-from pandas.core.internals import BlockManager, make_block
+from pandas.core.internals import Block, BlockManager, make_block


if Block is only used for annotations, can you put it in a if TYPE_CHECKING

jbrockmendel · 2019-12-20T23:52:51Z

pandas/core/groupby/generic.py

@@ -1751,7 +1752,7 @@ def count(self):
        ]
        blk = map(make_block, counted, loc)


while you're at it, can you rename blk to blks or something else clearly plural

jbrockmendel · 2019-12-21T00:26:04Z

pandas/core/groupby/generic.py

@@ -1750,9 +1755,9 @@ def count(self):
        counted = [
            lib.count_level_2d(x, labels=ids, max_bin=ngroups, axis=1) for x in val
        ]
-        blk = map(make_block, counted, loc)
+        blocks = map(make_block, counted, loc)


@WillAyd is getting Block/BlockManager stuff out of here still something you're working on?

jbrockmendel · 2019-12-21T19:56:57Z

LGTM

jbrockmendel · 2019-12-23T01:15:57Z

cc @WillAyd for a second pair of eyes

WillAyd · 2019-12-23T08:57:11Z

pandas/core/groupby/generic.py

@@ -1691,17 +1697,17 @@ def _wrap_transformed_output(

        return result

-    def _wrap_agged_blocks(self, items, blocks):
+    def _agg_blocks_to_frame(self, items: Index, blocks: "List[Block]") -> DataFrame:


I would actually prefer to keep this named _wrap_agged_blocks since it offers the same functionality as the rest of the _wrap_* functions in groupby, albeit with different input types

That's a fair point, I wasn't aware of this systematic naming, I'll change it back. With the added typing we can now see better that a frame is returned

topper-123 · 2019-12-23T10:06:19Z

pandas/core/groupby/generic.py

@@ -1691,17 +1697,17 @@ def _wrap_transformed_output(

        return result

-    def _wrap_agged_blocks(self, items, blocks):
+    def _wrap_agged_blocks(self, blocks: "List[Block]", items: Index) -> DataFrame:


I've inverted the argument order. I think it makes more sense to have the blocks come first, like in BlockManager.

topper-123 · 2019-12-23T10:09:49Z

pandas/core/groupby/generic.py

        ]
-        blk = map(make_block, counted, loc)
+        blocks = [make_block(val, placement=loc) for val, loc in zip(counted, locs)]


Just some minor cleanups above: pluralizing names + use list comprehension instead of map.

i tend to prefer these too

Can this just be a generator expression or does that fail? I see this is equivalent to existing code but maybe adds unnecessary overhead

It would work, but wouldn't make any difference because it's passed into the BlockManager where it's stored.

WillAyd · 2019-12-23T22:39:55Z

lgtm @jreback if you care to look

jreback · 2019-12-24T14:42:10Z

thanks!

jbrockmendel reviewed Dec 20, 2019

View reviewed changes

jbrockmendel reviewed Dec 21, 2019

View reviewed changes

gfyoung added Clean Typing type annotations, mypy/pyright type checking labels Dec 22, 2019

WillAyd requested changes Dec 23, 2019

View reviewed changes

WillAyd added this to the 1.0 milestone Dec 23, 2019

topper-123 commented Dec 23, 2019

View reviewed changes

WillAyd approved these changes Dec 23, 2019

View reviewed changes

topper-123 added 5 commits December 23, 2019 22:42

CLN: DataFrameGroupBy._cython_agg_general

911a119

changes according to comments

cccfd90

avoid flake8 complaint

ba5f8cf

_agg_blocks_to_frame -> _wrap_agged_blocks + cleaning

569cfa7

minor cleanups

e9e6b56

topper-123 force-pushed the cleaupn_DataFrameGroupBy._cython_agg_general branch from a20386b to e9e6b56 Compare December 23, 2019 22:44

jreback merged commit 7a36c79 into pandas-dev:master Dec 24, 2019

AlexKirko pushed a commit to AlexKirko/pandas that referenced this pull request Dec 29, 2019

CLN: DataFrameGroupBy._cython_agg_general (pandas-dev#30384)

a8faa48

		@@ -1751,7 +1752,7 @@ def count(self):
		]
		blk = map(make_block, counted, loc)

Uh oh!

CLN: DataFrameGroupBy._cython_agg_general #30384

CLN: DataFrameGroupBy._cython_agg_general #30384

Uh oh!

Conversation

topper-123 commented Dec 20, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jbrockmendel commented Dec 21, 2019

Uh oh!

jbrockmendel commented Dec 23, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

topper-123 Dec 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WillAyd commented Dec 23, 2019

Uh oh!

jreback commented Dec 24, 2019

Uh oh!

Uh oh!

topper-123 Dec 23, 2019 •

edited

Loading