A transformer is responsible for transforming a single column in a larger dataset. Each transformer uses a specific technique to change the data.
LabelEncoderis a transformer that converts categories into numbers using a simple labeling scheme (
0, 1, 2,...).
Use this glossary to browse through the different transformers that you can apply to your dataset.
Each transformer is designed to work with a specific semantic data type (sdtype). Use the tabs below to explore transformers by sdtype.
boolean & categorical
Transformers that work on text:
Your choice to use a particular transformer might be based on what you want to achieve.
These transformers format the data into numerical values. These are the basic transformers that you'll likely use to clean and prepare your data for data science.
By default, the RDT automatically assigns a transformer based on the sdtype of the column.