Skip to content

skip rows after header leads to shifted header #10898

Closed
@dacoex

Description

@dacoex

Here's my file:

 ;;;;
 ;;;;
 ;;;;
 ;;;;
 ;;;;
 ;;;;
;;;;
 ;;;;
 ;;;;
;;;;
date;Param1;Param2;Param4;Param5
    ;m²;°C;m²;A
;;;;
01.01.1990 00:00;1;1;2;3
01.01.1990 01:00;5;3;4;5
01.01.1990 02:00;9;5;6;7
01.01.1990 03:00;13;7;8;9
01.01.1990 04:00;17;9;10;11
01.01.1990 05:00;21;11;12;13

If I read it with the following line:

 pd.read_csv(file, sep=';', header=10, parse_dates=True, skip_blank_lines=True, skiprows=0)

It gets the headers but then takes the two rows between header and data values.

date Param1 Param2 Param4 Param5
0                       m²     °C     m²      A
1               NaN    NaN    NaN    NaN    NaN
2  01.01.1990 00:00      1      1      2      3
3  01.01.1990 01:00      5      3      4      5
4  01.01.1990 02:00      9      5      6      7

If I use

 pd.read_csv(file, sep=';', header=10, parse_dates=True, skip_blank_lines=True, skiprows=2)

It simply moves the header down, i.e. same effect as saying

pd.read_csv(file, sep=';', header=12, parse_dates=True, skip_blank_lines=True, skiprows=0)

I think the skiprows needs to skip rows either before or after header.

But how do I skip after the header, only?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions