Metrics Glossary
Each metric in this section quantifies a different aspect of the synthetic data.
Browse by use case
We recommend applying the metrics using SDMetrics Reports. Reports determine which exact metrics to use based on your data.
These metrics capture basic, statistical properties of synthetic data that we expect to be captured.
Get started using the Diagnostic Report.
Or apply these metrics individually:
BoundaryAdherence, CategoryAdherence: measure the validity of statistical values
KeyUniqueness: measure the validity of primary keys
ReferentialIntegrity, CardinalityBoundaryAdherence: measure the validity of a connection between a foreign and primary key
TableStructure: measure whether the overall structure of the data is the same
MissingValueSimilarity, StatisticSimilarity: compare data using basic statistics
CategoryCoverage, RangeCoverage: measure whether the overall synthetic data spans the possibilities
SequenceLengthSimilarity, StatisticMSAS: compares the quality of real and synthetic data that represents sequential information
Browse by data granularity
Different metrics compare data at various levels of granularity.
These metrics compare single columns of data.
CategoryCoverage, RangeCoverage: measure data coverage
BoundaryAdherence, CategoryAdherence: measure whether the synthetic data is chosen from valid boundaries or lists of options
KSComplement, TVComplement: compare shapes (aka marginal distributions, histograms)
StatisticSimilarity, MissingValueSimilarity: compare data statistics
KeyUniqueness: measures whether the primary keys are unique
These metrics compare single columns that represent sequential information.
SequenceLengthSimilarity: compares the length of sequences between real and synthetic data
More Metrics
Can't find what you're looking for? Check out the Metrics in Beta for additional options.
Last updated