Preparation

Creating a HyperTransformer

Use a HyperTransformer to manage all the transformers you're applying to a multi-column dataset.

Create one by importing it from the rdt library. There are no parameters.

from rdt import HyperTransformer
ht = HyperTransformer()

Loading your data

The RDT library uses pandas -- a popular open source library for data manipulation. The HyperTransformer expects your data is a pandas DataFrame object.

There are a variety of ways to load your data into the expected format. The most common case is your dataset being a csv file:

import pandas as pd
customers = pd.read_csv('./datasets/customers.csv')

Refer to the pandas documentation for more information about reading csv files or other types of files.

PreviousHyperTransformer NextConfiguration

Last updated 10 months ago