SDGym
GitHubSlackDataCebo
  • Welcome to SDGym!
  • Installation
  • Benchmarking
    • Running a Benchmark
    • Interpreting Results
  • Customization
    • Synthesizers
      • SDV Synthesizers
      • Basic Synthesizers
      • 3rd Party Synthesizers
      • Custom Synthesizers
    • Datasets
      • Public SDV Datasets
      • Custom Datasets
    • AWS Integration
  • Resources
    • Metadata
Powered by GitBook

© Copyright 2023, DataCebo, Inc.

On this page
  1. Customization

Datasets

Last updated 11 months ago

To get a good understanding of how a synthesizer performs, we recommend running it on a wide variety of datasets that represent real world scenarios.

Which datasets can I use?

The SDGym library is compatible with any single table dataset that is stored in a zipped, CSV format.

The SDV library offers a variety of demo datasets that are publicly available. Browse these datasets and apply any combination during your benchmarking run.

Provide your own datasets to use for benchmarking. These can be stored on your own computer or on an Amazon S3 bucket.

Public Datasets
Custom Datasets