Default string dtype should not raise fallback performance warnings

Issue related to the new default `"string"` dtype (https://github.com/pandas-dev/pandas/issues/54792)

As long as the pyarrow-backed string dtype was opt-in, so users could get the better performance, it might have made sense to warn for certain cases where it was not actually using pyarrow but was falling back to the slower object-dtype implementation.

However, when we make the pyarrow-backed dtype the default string dtype for pandas 3.0, I don't think the current warnings make sense:

```python
>>> pd.options.future.infer_string = True
>>> s = pd.Series(["a", "b", "c"])
>>> import re
>>> s.str.contains("a", flags=re.IGNORECASE)
PerformanceWarning: Falling back on a non-pyarrow code path which may decrease performance.
0     True
1    False
2    False
dtype: bool
```

Of course, it's still taking a slower code path, but 1) the user cannot do anything about this, and 2) the user also didn't choose this explicitly, it just got this dtype by default, so we don't need to warn the user that their choice for the faster dtype didn't have any effect. 

So I think we should remove the warning _by default_ (maybe still give users a way to get them optionally, now we have an option for this, xref https://github.com/pandas-dev/pandas/pull/56921), and better document in the docstring if a certain option will cause a slowdown.

Or if we keep a warning, it should at least be an _actionable_ warning, if that is possible. For example in the specific case above, a user could pass `case=False` instead of `flags=re.IGNORECASE` for the same result and without fallback. But then 1) the warning should actually point users to that better option, and 2) I don't know if such alternative will always exist.

cc @mroeschke @phofl 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default string dtype should not raise fallback performance warnings #58581

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Default string dtype should not raise fallback performance warnings #58581

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions