Skip to content

PERF: DataFrame.groupby.nunique #15201

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed

Conversation

jreback
Copy link
Contributor

@jreback jreback commented Jan 23, 2017

closes #15197

In [2]: %timeit df.groupby(['key1', 'key2']).nunique()
100 loops, best of 3: 9.31 ms per loop
    before     after       ratio
  [be3f2aea] [6d026165]
-  473.67μs   427.27μs      0.90  groupby.GroupBySuite.time_nunique('float', 100)
-     4.41s     9.53ms      0.00  groupby.groupby_nunique.time_groupby_nunique

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.

@jreback jreback added Groupby Performance Memory or execution speed performance labels Jan 23, 2017
@jreback jreback added this to the 0.20.0 milestone Jan 23, 2017
@jreback jreback closed this in dc40058 Jan 24, 2017
AnkurDedania pushed a commit to AnkurDedania/pandas that referenced this pull request Mar 21, 2017
closes pandas-dev#15197

Author: Jeff Reback <[email protected]>

Closes pandas-dev#15201 from jreback/nunique and squashes the following commits:

6d02616 [Jeff Reback] PERF: DataFrame.groupby.nunique
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Groupby Performance Memory or execution speed performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

PERF: DataFrame.groupby.nunique is non-performant
1 participant