API: argmin/argmax behaviour for nullable dtypes with skipna=False

Now we are adding ExtensionArray.argmin/argmax (#24382 / #27801), we also need to decide on its behaviour for nullable / masked arrays. 
For the default of `skipna=True`, I think the behaviour is clear (we skip those values, and calculate the argmin/argmax of the remaining values), but we need to decide on the behaviour in case of `skipna=False` in presence of missing values.

I don't think it makes much sense to follow the current behaviour for Series with NaNs:

```
In [25]: pd.Series([1, 2, 3, np.nan]).argmin() 
Out[25]: 0

In [26]: pd.Series([1, 2, 3, np.nan]).argmin(skipna=False)  
Out[26]: -1
```

(the -1 is discussed separately in https://github.com/pandas-dev/pandas/issues/33941 as well)

We also can't really compare with numpy, as there NaNs are regarded as the largest values (which follows somewhat from the sorting behaviour with NaNs).

For other reductions, the logic is: once there is an NA present, the result is also an NA with `skipna=False` (with the logic here that the with NAs, the min/max is not known, and thus also not the location). 

So something like:

```python
>>> pd.Series([1, 2, 3, pd.NA], dtype="Int64").argmin(skipna=False)  
<NA>
```

However, this also means that `argmin`/`argmax` return something that is not an integer, and since those methods are typically used to get a value that can be used to index, that might also be annoying. 
An alternative could also be to raise an error (similar as for empty arrays).



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: argmin/argmax behaviour for nullable dtypes with skipna=False #33942

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

API: argmin/argmax behaviour for nullable dtypes with skipna=False #33942

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions