PERF: Slow hdf_read on 10+ million row data + My fix

### Pandas version checks

- [X] I have checked that this issue has not already been reported.

- [X] I have confirmed this issue exists on the [latest version](https://pandas.pydata.org/docs/whatsnew/index.html) of pandas.

- [x] I have confirmed this issue exists on the main branch of pandas.


### Reproducible Example

```python
import pandas as pd
from time import time
file = "./example_file.h5"
t1 = time()
df = pd.read_hdf(file, "example_data")
t2 = time()
print(t2-t1)
```


I've noticed that pandas seems to be very slow at reading certain data from a hdf5 file when there are 10s of millions of rows. I frequently work with data that can get into the 100s of millions so this has significantly slowed me down. 
I did some digging into the pytables module and found that there is a concat that occurs on line  3221 of the **BlockManagerFixed** **read()** method as in the image bellow. By default copy is True so I experimented with setting this to False and noticed an instant 10x performance gain. 

I have included a zipped h5 file which you can use to repeat this test. With copy set to False in the concat it loaded for me in less than 2 seconds, otherwise it took over 20 seconds. 

I have tested this on Linux, Windows and Mac and get the same level of performance improvement every time. 

![image](https://user-images.githubusercontent.com/43964549/179116276-f539e072-b3e8-49dc-b9a9-93e848e4c31c.png)
[example_file.zip](https://github.com/pandas-dev/pandas/files/9116521/example_file.zip)



### Installed Versions

<details>

commit           : e8093ba372f9adfe79439d90fe74b0b5b6dea9d6
python           : 3.8.8.final.0
python-bits      : 64
OS               : Linux
OS-release       : 3.10.0-1062.9.1.el7.x86_64
Version          : #1 SMP Fri Dec 6 15:49:49 UTC 2019
machine          : x86_64
processor        : 
byteorder        : little
LC_ALL           : en_US.UTF-8
LANG             : en_US.UTF-8
LOCALE           : en_US.UTF-8

pandas           : 1.4.3
numpy            : 1.21.5
pytz             : 2021.3
dateutil         : 2.8.2
setuptools       : 61.2.0
pip              : 22.1.2
Cython           : None
pytest           : None
hypothesis       : None
sphinx           : None
blosc            : None
feather          : None
xlsxwriter       : None
lxml.etree       : None
html5lib         : None
pymysql          : None
psycopg2         : None
jinja2           : None
IPython          : 8.4.0
pandas_datareader: None
bs4              : None
bottleneck       : 1.3.4
brotli           : None
fastparquet      : None
fsspec           : None
gcsfs            : None
markupsafe       : None
matplotlib       : 3.3.4
numba            : None
numexpr          : 2.8.1
odfpy            : None
openpyxl         : None
pandas_gbq       : None
pyarrow          : None
pyreadstat       : None
pyxlsb           : None
s3fs             : None
scipy            : None
snappy           : None
sqlalchemy       : None
tables           : 3.6.1
tabulate         : None
xarray           : None
xlrd             : None
xlwt             : None
zstandard        : None

</details>


### Prior Performance

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: Slow hdf_read on 10+ million row data + My fix #47726

Pandas version checks

Reproducible Example

Installed Versions

Prior Performance

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

PERF: Slow hdf_read on 10+ million row data + My fix #47726

Description

Pandas version checks

Reproducible Example

Installed Versions

Prior Performance

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions