Multiclass Classification
Multiclass Classification metrics calculate the success of using synthetic data to perform an ML classification prediction task. Each metric uses a different ML algorithm for the computation:
MulticlassDecisionTreeClassifier
MulticlassMLPClassifier
Data Compatibility
Boolean: This metric convert boolean columns into 0/1 values. Missing values are replaced by the most commonly occurring value (mode).
Categorical: These metric convert categorical columns into multiple, one hot encoded columns.
Datetime: These metrics convert datetime columns into numerical values using the Unix timestamp. They impute missing values using the mean.
Numerical: These metrics are designed to work with numerical columns. They impute missing values using the mean
This metric should not be used with any other column, such as primary keys or anonymized columns.
Score
(best) 1.0: Given the training data with the provided ML algorithm, you will be able to perform ML tasks with 100% accuracy on the test data
(worst) 0.0: Given the training data with the provided ML algorithm, you will not be able to predict any of the test data correctly
How does it work?
All ML efficacy metrics perform the same steps:
Train the ML algorithm using the training data (usually synthetic data). The output is an ML model that can predict the value of a given target column.
Test the ML model by making predictions on the testing data (usually real data) and comparing against the actual values.
Return the F1 [1] test score.
Usage
Access this metric from the single_table
module and use the compute
method.
Parameters
(required)
test_data
: A pandas.DataFrame containing the full data to test on. This should include the column that you are trying to predict.(required)
train_data
: A pandas.DataFrame containing the full data to train on. This should include the column that you are trying to predict.(required)
target
: A string representing the name of the column that you want to predict. This must be a categorical column.metadata
: A description of the dataset. See Single Table Metadata
FAQs
This is metric is in Beta. Be careful when using the metric and interpreting its score.
The score depends on underlying algorithm used to model the data. If the dataset is not suited for a particular machine learning method, then the predicted values may not be valid.
In a real world scenario, you may spend more effort in building an ML model. These metrics only allow you to select from specific algorithms (Tree, MLP)
References
[1] https://en.wikipedia.org/wiki/F-score
Last updated