Skip to content

Column Selection: Compatibility with sklearn.compose.make_column_selector #239

Closed
@mdancho84

Description

@mdancho84

Hey, awesome package. I'm planning to use it in my python course I'm developing.

Question

Is it possible to apply transformations by data type or with a column selection method similar to sklearn.compose.make_column_selector?

Example

A simple example is to select all numeric columns for scaling. With sklearn I'd do something like this:

column_transformer_scale_numeric = make_column_transformer(
    (StandardScaler(), make_column_selector(dtype_include = np.number)), 
    remainder = 'passthrough'
)

Ideally I'd be able to use the make_column_selector() from sklearn and apply it within the sklearn_pandas.DataFrameMapper().

mapper_scale = DataFrameMapper(
    features  = [
        (make_column_selector(dtype_include = np.number), StandardScaler())
    ]
)

Any thoughts on this? Also let me know if this has been addressed previously and if I just missed searching for it.

Thanks!

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions