Categorical
Last updated
Last updated
Categorical columns contain distinct categories. The defining aspect of a categorical column is that there is a set number of predefined categories.
The categories may be ordered or unordered. Ordered categories are known as ordinal while unordered categories are known as nominal.
Encode categories as numerical labels.
Encode categories as numerical labels using a pre-determined order.
Encode categories as a smooth distribution. Useful for imbalanced values.
Encode categories as a smooth distribution using a pre-determined order. Useful for imbalanced values.
Encode categories into a multi-modal distribution by using a frequency-based analysis.
Encode categories into multiple, binary columns using one hot encoding.
Encode boolean data as binary 0/1
labels.
"VISA"
, "AMEX"
or "DISCOVER"
. This is nominal because the categories don't have any order.