mlnext.pipeline.Sorter¶
- class mlnext.pipeline.Sorter(*, columns: Sequence[str], ascending: bool = True, axis: int = 0)[source]¶
Bases:
BaseEstimator,TransformerMixinSorts the dataframe by a list of columns. Wrapper for pd.DataFrame.sort_values.
- Parameters:
columns (T.List[str]) – List of column names to sort by.
ascending (bool) – Whether to sort ascending. Defaults to True.
axis (int) – Axis to sort by.
Example
>>> data = pd.DataFrame({'a': [0, 1], 'b': [1, 0]}) >>> Sorter(columns=['b'], ascending=True).transform(data) pd.DataFrame({'a': [1, 0], 'b': [0, 1]})
Methods
fitFit to data, then transform it.
Get metadata routing of this object.
Get parameters for this estimator.
Set output container.
Set the parameters of this estimator.
Sorts
Xbycolumns.- fit_transform(X, y=None, **fit_params)¶
Fit to data, then transform it.
Fits transformer to X and y with optional parameters fit_params and returns a transformed version of X.
- Parameters:
X (array-like of shape (n_samples, n_features)) – Input samples.
y (array-like of shape (n_samples,) or (n_samples, n_outputs), default=None) – Target values (None for unsupervised transformations).
**fit_params (dict) – Additional fit parameters.
- Returns:
X_new – Transformed array.
- Return type:
ndarray array of shape (n_samples, n_features_new)
- get_metadata_routing()¶
Get metadata routing of this object.
Please check User Guide on how the routing mechanism works.
- Returns:
routing – A
MetadataRequestencapsulating routing information.- Return type:
MetadataRequest
- get_params(deep=True)¶
Get parameters for this estimator.
- Parameters:
deep (bool, default=True) – If True, will return the parameters for this estimator and contained subobjects that are estimators.
- Returns:
params – Parameter names mapped to their values.
- Return type:
dict
- set_output(*, transform=None)¶
Set output container.
See sphx_glr_auto_examples_miscellaneous_plot_set_output.py for an example on how to use the API.
- Parameters:
transform ({"default", "pandas", "polars"}, default=None) –
Configure output of transform and fit_transform.
”default”: Default output format of a transformer
”pandas”: DataFrame output
”polars”: Polars output
None: Transform configuration is unchanged
Added in version 1.4: “polars” option was added.
- Returns:
self – Estimator instance.
- Return type:
estimator instance
- set_params(**params)¶
Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects (such as
Pipeline). The latter have parameters of the form<component>__<parameter>so that it’s possible to update each component of a nested object.- Parameters:
**params (dict) – Estimator parameters.
- Returns:
self – Estimator instance.
- Return type:
estimator instance