Explore SDV
Last updated
Last updated
SDV is available in Public or Enterprise formats. Use this page to determine which one is right for your project needs.
These synthesizers use AI to learn patterns from your data and use them to recreate synthetic data.
These synthesizers create random test data based on metadata alone. They do not use AI so you do not need to input any training data.
These features make it easy to integrate the SDV into your application and pipeline.
Transformers are used to pre-process your data, which can improve data quality. SDV synthesizers select transformers by default, but you can always customize these to your dataset.
Transformers are used to pre-process your data, which can improve data quality. SDV synthesizers select transformers by default, but you can always customize these to your dataset.
These transformers are geared towards columns that correspond to industry or domain-specific concepts. Their structure may be human-created.
Constraints represent business rules and logic that you can apply to your synthesizer.
Support for custom constraints and additional predefined logic
Evaluate your synthetic data by comparing it against the real data.
statistical AI
, , neural networks
for sequential data
multi-table for limited tables (<5)
multi-table for unlimited tables
multi-table for unlimited tables
single table
multi table
using data CSVs or DataFrames
with a DDL file from an SQL schema
for missing value imputation, numerical columns
and Normalizers statistical transforms
, , and Encoding for discrete variables ( and )
Encoding including datetime format parsing
for numerical outliers
, for keys and IDs
general-purpose anonymization
for general pseudo-anonymization with a mapping
understanding domains
understanding locations
understanding country and area codes
[Coming soon!] understanding geographical areas and distances
Predefined logic for individual columns: , , , ,
Predefined logic for multiple columns: , , ,
Write your own
Advanced, predefined logic:
Access to library vendor-agnostic, open source
basic data validity checks , single and multi-table
statistical similarity, single and multi-table
Privacy Metrics: , ,
1D and 2D bars, scatterplots, heatmaps and more
Use case-specific metrics: ,
Public SDV
Explore Synthetic Data. Train a generative AI with your own, simple datasets as a proof-of-concept. Create synthetic data that has the same patterns.
Publicly available with a Business Source License. Get started today!
SDV Enterprise
Ready for scale? Expand synthetic data solutions in your enterprise. Create generate AIs for more complex datasets.
To learn more about pricing and plans, visit our website.