Skip to content

ENH: col descriptions that'd save in df schemas, helping users avoid creating separate documentation? #42582

Open
@chrisjdixon

Description

@chrisjdixon

Asked at SO: I need to share well described data and want to do this in a modern way that avoids managing bureaucratic documentation no one will read. Fields require some description or note (eg. "values don't include ABC because XYZ") which I'd like to associate to columns that'll be saved with pd.to_<whatever>().

Looks like JSON supports annotations and I'd love to have the option of using them with pandas, but couldn't figure out how to.

Could we please develop functionality to add descriptions in a convenient way (eg. df[col].description = 'string') and have that save in output schemas? And maybe have that be selectable and show with df.info(verbose=True) or similar?

I know documentation is boring but maintaining bureaucratic paperwork is even worse. Also, data documentation is a requirement common to big orgs and schools / unis, and I reckon providing innovative functionality to make boring tasks more enjoyable is an efficient way of getting more people to stop using excel use pandas and newer technology in general, making the world a better place.

Unfortunately I don't understand pandas enough to see how this might be a stupid idea. Is this possible?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions