-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
Issues: pandas-dev/pandas
FEEDBACK: PyArrow as a required dependency and PyArrow backed...
#54466
opened Aug 9, 2023 by
phofl
Open
154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ENH: Add first_inverted and last_inverted options to keep in DataFrame.duplicated
Closing Candidate
May be closeable, needs more eyeballs
duplicated
duplicated, drop_duplicates
Enhancement
Needs Discussion
Requires discussion from core team before further action
#60394
opened Nov 22, 2024 by
tommycarstensen
1 of 3 tasks
DOC: Add example of drop_duplicates dropping a first-level
Docs
duplicated
duplicated, drop_duplicates
#47813
opened Jul 21, 2022 by
smarie
1 of 3 tasks
BUG: duplicated() returns different results on consecutive runs!
Bug
duplicated
duplicated, drop_duplicates
#46864
opened Apr 25, 2022 by
emsi
2 of 3 tasks
TYP: Improve typing on pandas.core.algorithms.duplicated()
duplicated
duplicated, drop_duplicates
Typing
type annotations, mypy/pyright type checking
#42604
opened Jul 19, 2021 by
Dr-Irv
ENH: consider using sets and not maps for isin, unique and duplicated
duplicated
duplicated, drop_duplicates
Enhancement
isin
isin method
Performance
Memory or execution speed performance
#39799
opened Feb 13, 2021 by
realead
BUG: DataFrame.drop_duplicates confuses NULL bytes
Bug
duplicated
duplicated, drop_duplicates
hashing
hash_pandas_object
Strings
String extension data type and string data
#34551
opened Jun 3, 2020 by
marco-neumann-by
Feature request: Allow the subset argument of dropna and drop_duplicates to take a single label
API - Consistency
Internal Consistency of API/Behavior
Deprecate
Functionality to remove in pandas
duplicated
duplicated, drop_duplicates
#32654
opened Mar 12, 2020 by
tsoernes
Unstable hashtable / duplicated algo for object dtype
Bug
duplicated
duplicated, drop_duplicates
#27035
opened Jun 25, 2019 by
jorisvandenbossche
API: Allow use of keep="all" in duplicated
API Design
duplicated
duplicated, drop_duplicates
#23251
opened Oct 20, 2018 by
svenharris
BUG: df.duplicated treats None as np.nan in object columns
Bug
duplicated
duplicated, drop_duplicates
Missing-data
np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#21720
opened Jul 3, 2018 by
h-vetinari
BUG: drop_duplicates() doesn't work for object dtype series containing numpy nans
Bug
duplicated
duplicated, drop_duplicates
Missing-data
np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#16632
opened Jun 8, 2017 by
ran404
PERF: use StringHashTable for value_counts / duplicated with strings
Algos
Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff
duplicated
duplicated, drop_duplicates
Performance
Memory or execution speed performance
Strings
String extension data type and string data
#14860
opened Dec 12, 2016 by
jreback
2 tasks
BUG: duplicated() on a empty DataFrame or a DataFrame with an empty subset of columns with a non-empty index
Bug
duplicated
duplicated, drop_duplicates
Missing-data
np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
#12869
opened Apr 11, 2016 by
sebov
df.duplicated and drop_duplicates raise TypeError with unhashable values.
Bug
duplicated
duplicated, drop_duplicates
Nested Data
Data where the values are collections (lists, sets, dicts, objects, etc.).
#12693
opened Mar 22, 2016 by
Abrosimov-a-a
ENH: add axis argument to .drop_duplicates and .duplicated
duplicated
duplicated, drop_duplicates
Enhancement
#11250
opened Oct 6, 2015 by
121onto
ENH: drop_duplicates(consecutive=True) to drop only consecutive duplicates
API Design
duplicated
duplicated, drop_duplicates
Enhancement
#10540
opened Jul 10, 2015 by
bwillers
ProTip!
What’s not been updated in a month: updated:<2025-04-18.