Closed
Description
Code Sample, a copy-pastable example if possible
import pandas as pd
df = pd.DataFrame({
'a': [1, 2, 3, 4],
'b': ['foo', 'bar', 'baz', 'qux'],
'c': [5, 6, 7, 8]
})
print(df['a'].values.strides)
store = pd.HDFStore('example.h5')
store['df'] = df
print(store['df']['a'].values.strides)
## -- End pasted text --
(8,)
(16,)
Problem description
I ran across this when doing some benchmarking. This has some rather serious performance implications for large DataFrames. Is this the result of an underlying limitation in HDF5?