FloatFormatter
Compatibility: numerical
data
The FloatFormatter
transforms numerical data. By default, it does nothing because numerical data is already ready to use for data science. But it can optionally handle missing values, learn rounding schemes and min/max bounds.
Parameters
missing_value_replacement
: Add this argument to replace missing values during the transform phase
(default) | Replace missing values with a random value. The value is chosen uniformly at random from the min/max range. |
| Replace all missing values with the average value. |
| Replace all missing values with the most frequently occurring value |
| Replace all missing values with the specified number ( |
| Deprecated. Do not replace missing values. The transformed data will continue to have missing values. |
(deprecated) model_missing_values
: Use the missing_value_generation
parameter instead.
missing_value_generation
: Add this argument to determine how to recreate missing values during the reverse transform phase
(default) | Randomly assign missing values in roughly the same proportion as the original data. |
| Create a new column to store whether the value should be missing. Use it to recreate missing values. Note: Adding extra columns uses more memory and increases the RDT processing time. |
| Do not recreate missing values. |
enforce_min_max_values
: Add this argument to allow the transformer to learn the min and max allowed values from the data.
(default) | Do not learn any min or max values from the dataset. When reverse transforming the data, the values may be above or below what was originally present. |
| Learn the min and max values from the input data. When reverse transforming the data, any out-of-bounds values will be clipped to the min or max value. |
learn_rounding_scheme
: Add this argument to allow the transformer to learn about rounded values in your dataset.
(default) | Do not learn or enforce any rounding scheme. When reverse transforming the data, there may be many decimal places present. |
| Learn the rounding rules from the input data. When reverse transforming the data, round the number of digits to match the original. |
computer_representation
: Add this argument when the original data has a specific representation, even if it's not loaded that way into Python. The transformer will make sure that any reverse transformed data is compatible with this representation.
(default) | The data is a float |
| The data is a signed integer represented as an 8, 16, 32 or 64-bit number |
| The data is an unsigned integer represented as an 9, 16, 32 or 64-bit number |
Examples
FAQs
Last updated