Datasets

To get a good understanding of how a synthesizer performs, we recommend running it on a wide variety of datasets that represent real world scenarios.

Which datasets can I use?

The SDGym library is compatible with any single table dataset that is stored in a zipped, CSV format.

The SDV library offers a variety of demo datasets that are publicly available. Browse these datasets and apply any combination during your benchmarking run.

Provide your own datasets to use for benchmarking. These can be stored on your own computer or on an Amazon S3 bucket.

Last updated

© Copyright 2023, DataCebo, Inc.