Description
Is your feature request related to a problem? Please describe.
Handling cftime and numpy time coordinates can sometimes be exhausting. Here I am thinking of the following common problems:
- Querying the calendar type from a time coordinate.
- Converting a dataset from a calendar type to another.
- Generating a time coordinate in the correct calendar.
Describe the solution you'd like
ds.time.dt.calendar
would be magic.xr.convert_calendar(ds, "new_cal")
could be nice?xr.date_range(start, stop, calendar=cal)
, same as pandas' (see context below).
Describe alternatives you've considered
We have implemented all this in (xclim)[https://xclim.readthedocs.io/en/stable/api.html#calendar-handling-utilities] (and more). But it seems to make sense that some of the simplest things there could move to xarray? We had this discussion in xarray-contrib/cf-xarray#193 and suggestion was made to see what fits here before implementing this there.
Additional context
At xclim, to differentiate numpy datetime64 from cftime types, we call the former "default". This way a time coordinate using cftime's "proleptic_gregorian" calendar is distinct from one using numpy's datetime64.
- is easy (xclim function). If the datatype is numpy return "default", if cftime, look into the first non-null value and get the calendar.
- xclim function The calendar type of each time element is transformed to the new calendar. Our way is to drop any dates that do not exist in the new calendar (like Feb 29th when going to "noleap"). In the other direction, there is an option to either fill with some fill value of simply not include them. It can't be a DataArray method, but could be a Dataset one, or simply a top-level function. Related to Converting
cftime.datetime
objects tonp.datetime64
values throughastype
#5107.
We also have an interp_calendar
function that reinterps data on a yearly basis. This is a bit narrower, because it only makes sense on daily data (or coarser).
- With the definition of a "default" calendar,
date_range
anddate_range_like
simply chose betweenpd.date_range
andxr.cftime_range
according to the target calendar.
What do you think? I have time to move whatever code makes sense to move.