Skip to content

Issues: pandas-dev/pandas

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

ENH: Add first_inverted and last_inverted options to keep in DataFrame.duplicated Closing Candidate May be closeable, needs more eyeballs duplicated duplicated, drop_duplicates Enhancement Needs Discussion Requires discussion from core team before further action
#60394 opened Nov 22, 2024 by tommycarstensen
1 of 3 tasks
DOC: Add example of drop_duplicates dropping a first-level Docs duplicated duplicated, drop_duplicates
#47813 opened Jul 21, 2022 by smarie
1 of 3 tasks
BUG: duplicated() returns different results on consecutive runs! Bug duplicated duplicated, drop_duplicates
#46864 opened Apr 25, 2022 by emsi
2 of 3 tasks
TYP: Improve typing on pandas.core.algorithms.duplicated() duplicated duplicated, drop_duplicates Typing type annotations, mypy/pyright type checking
#42604 opened Jul 19, 2021 by Dr-Irv
ENH: consider using sets and not maps for isin, unique and duplicated duplicated duplicated, drop_duplicates Enhancement isin isin method Performance Memory or execution speed performance
#39799 opened Feb 13, 2021 by realead
BUG: DataFrame.drop_duplicates confuses NULL bytes Bug duplicated duplicated, drop_duplicates hashing hash_pandas_object Strings String extension data type and string data
#34551 opened Jun 3, 2020 by marco-neumann-by
Feature request: Allow the subset argument of dropna and drop_duplicates to take a single label API - Consistency Internal Consistency of API/Behavior Deprecate Functionality to remove in pandas duplicated duplicated, drop_duplicates
#32654 opened Mar 12, 2020 by tsoernes
API: Allow use of keep="all" in duplicated API Design duplicated duplicated, drop_duplicates
#23251 opened Oct 20, 2018 by svenharris
BUG: df.duplicated treats None as np.nan in object columns Bug duplicated duplicated, drop_duplicates Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#21720 opened Jul 3, 2018 by h-vetinari
BUG: drop_duplicates() doesn't work for object dtype series containing numpy nans Bug duplicated duplicated, drop_duplicates Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#16632 opened Jun 8, 2017 by ran404
PERF: use StringHashTable for value_counts / duplicated with strings Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff duplicated duplicated, drop_duplicates Performance Memory or execution speed performance Strings String extension data type and string data
#14860 opened Dec 12, 2016 by jreback
2 tasks
BUG: duplicated() on a empty DataFrame or a DataFrame with an empty subset of columns with a non-empty index Bug duplicated duplicated, drop_duplicates Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#12869 opened Apr 11, 2016 by sebov
df.duplicated and drop_duplicates raise TypeError with unhashable values. Bug duplicated duplicated, drop_duplicates Nested Data Data where the values are collections (lists, sets, dicts, objects, etc.).
#12693 opened Mar 22, 2016 by Abrosimov-a-a
ProTip! What’s not been updated in a month: updated:<2025-04-18.