Synthesizers
Last updated
Last updated
The SDV offers a variety of synthesizers, which use different algorithms to generate synthetic data.
These synthesizers are available in the SDV Community package. They build a generative AI model using your real data, and use it to create synthetic data.
We recommend starting with for fast performance, good quality, and customization.
For higher fidelity, try a neural network-based synthesizer such as or . Modeling and sampling performance may be slower for these synthesizers, especially if you have categorical columns with many different values (high cardinality).
Experimental synthesizer: The combines classical statistics with GAN-based modeling.
Specialty synthesizers are available for special situations. You can also use them to improve speed and performance.
Generate synthetic data from scratch. Use this when you don't have a lot of real data.
Use a classical ML algorithm to learn from real data. This is fast, transparent, and customizable.
Use GAN-based ML algorithm to learn from real data. This may take longer to learn and be harder to debug.
Use a variational autoencoder ML model to learn from real data. This may take longer to learn and be harder to debug.
Use extra features on top of Gaussian Copula for higher quality synthetic data and improved performance.
Use this synthesizer when your real data is highly segmented, with different patterns for each.
*
❖
❖