SparkLGBSimpleFeatures

class sparklightautoml.pipelines.features.lgb_pipeline.SparkLGBSimpleFeatures[source]

Bases: SparkFeaturesPipeline, SparkTabularDataFeatures

Creates simple pipeline for tree based models.

Simple but is ok for select features. Numeric stay as is, Datetime transforms to numeric. Categorical label encoding. Maps input to output features exactly one-to-one.

create_pipeline(train)[source]

Create tree pipeline.

Parameters:

train (SparkDataset) – Dataset with train features.

Return type:

Union[SparkUnionTransformer, SparkSequentialTransformer]

Returns:

Composite datetime, categorical, numeric transformer.