SDGym
GitHubSlackDataCebo
  • Welcome to SDGym!
  • Installation
  • Benchmarking
    • Running a Benchmark
    • Interpreting Results
  • Customization
    • Synthesizers
      • SDV Synthesizers
      • Basic Synthesizers
      • 3rd Party Synthesizers
      • Custom Synthesizers
    • Datasets
      • Public SDV Datasets
      • Custom Datasets
    • AWS Integration
  • Resources
    • Metadata
Powered by GitBook

© Copyright 2023, DataCebo, Inc.

On this page
  1. Customization

Synthesizers

Last updated 3 months ago

A synthesizer is able to learn information about your real dataset, and use it to create synthetic data that matches the format and mathematical properties. Each synthesizer runs through two steps:

  1. Model: In this stage, the synthesizer uses various methods (typically machine learning) to learn a model from the real data

  2. Sample: Once the model has been learned, the synthesizer can use it to create synthetic data

All synthesizers in SDGym follow these steps. The benchmarking scripts will return several, descriptive statistics about each phase.

Which synthesizers can I use?

The SDGym library includes synthesizers that are ready to use. You can also create a custom synthesizer based on your own methods.

SDV Synthesizers

Use a synthesizer from the open source SDV library.

Basic Synthesizers

Use a basic data generator as a baseline to compare other synthesizers.

3rd Party Synthesizer

Use a synthesizer from an external library for benchmarking.

Your Custom Synthesizer

Define your own, custom technique for creating synthetic data.