ENH: make infer_datetime_format strict

[to_datetime](https://pandas.pydata.org/docs/reference/api/pandas.to_datetime.html) has an argument `infer_datetime_format` which, if set to `True`, will guess the format from the first non-NaN row.

People ([users](https://github.com/pandas-dev/pandas/issues/46210), but also core devs, e.g. [here](https://github.com/pandas-dev/pandas/pull/47745#discussion_r940141720) and [here](https://github.com/pandas-dev/pandas/pull/35428#pullrequestreview-456225397)), expect that the format inferred from the first row will be applied to the rest of the series. i.e. that the following two should behave the same:

```
pd.to_datetime(['01-31-2000', '20-01-2000'], infer_datetime_format=True)
pd.to_datetime(['01-31-2000', '20-01-2000'], format='%m-%d-%Y')
```

However, they don't: the latter raises, whilst the first one swaps format midway.

Although[ this is documented in the user guide](https://pandas.pydata.org/docs/user_guide/io.html#inferring-datetime-format), it's not what people expect.

Making this argument strict would align more to people's expectations, but also simplify the codebase, as it would get rid of special-casing such as

https://github.com/pandas-dev/pandas/blob/ac648eeaf5c27ab957e8cd284eb7e49a45232f00/pandas/core/tools/datetimes.py#L488-L499

**TL;RD** I'm suggesting that when using `infer_datetime_format=True`, the format detected from the first non-NaN value should be used to parse the rest of the Series, exactly as if the user had passed it to `format=`

This would be one step towards addressing #12585

@pandas-dev/pandas-core any thoughts here?

----

EDIT: I'm hoping that https://github.com/pandas-dev/pandas/pull/48621 can supersede this

	if not infer_datetime_format:
	if errors == "raise":
	raise
	elif errors == "coerce":
	result = np.empty(arg.shape, dtype="M8[ns]")
	iresult = result.view("i8")
	iresult.fill(iNaT)
	else:
	result = arg
	else:
	# Indicates to the caller to fallback to objects_to_datetime64ns
	return None

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: make infer_datetime_format strict #48596

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ENH: make infer_datetime_format strict #48596

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions