Skip to content

ENH: add gzip/bz2 compression to relevant read_* methods #15644

Closed
@gfairchild

Description

@gfairchild

This issue is a branch off of #11666, which implemented compression support for read_pickle. There are still a few other read_* methods that could possibly benefit from compression support. Looking at the I/O API reference, this jump out at me:

  • read_json - This can definitely benefit from compression. I've stored very large gzipped JSON files before. As a general rule, any read_* method that supports any kind of plaintext format should support compression.
  • read_stata- I don't use Stata, but it looks like a .dta file is not a plaintext file. Is it naturally compressed, or can they be compressed significantly like pickles?
  • read_sas - I've also never used SAS, and like Stata's .dta files, it looks like .xpt and .sas7bdat files are both some binary format. Can they be compressed well?

Metadata

Metadata

Assignees

No one assigned

    Labels

    IO DataIO issues that don't fit into a more specific labelIO JSONread_json, to_json, json_normalizeIO SASSAS: read_sasIO Stataread_stata, to_stata

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions