Synthetic Data Vault
GitHubSlackDataCebo
  • Welcome to the SDV!
  • Tutorials
  • Explore SDV
    • SDV Community
    • SDV Enterprise
      • ⭐Compare Features
    • SDV Bundles
      • ❖ AI Connectors
      • ❖ CAG
      • ❖ Differential Privacy
      • ❖ XSynthesizers
  • Single Table Data
    • Data Preparation
      • Loading Data
      • Creating Metadata
    • Modeling
      • Synthesizers
        • GaussianCopulaSynthesizer
        • CTGANSynthesizer
        • TVAESynthesizer
        • ❖ XGCSynthesizer
        • ❖ BootstrapSynthesizer
        • ❖ SegmentSynthesizer
        • * DayZSynthesizer
        • ❖ DPGCSynthesizer
        • ❖ DPGCFlexSynthesizer
        • CopulaGANSynthesizer
      • Customizations
        • Constraints
        • Preprocessing
    • Sampling
      • Sample Realistic Data
      • Conditional Sampling
    • Evaluation
      • Diagnostic
      • Data Quality
      • Visualization
      • Privacy
        • Empirical Differential Privacy
        • SDMetrics: Privacy Metrics
  • Multi Table Data
    • Data Preparation
      • Loading Data
        • Demo Data
        • CSV
        • Excel
        • ❖ AlloyDB
        • ❖ BigQuery
        • ❖ MSSQL
        • ❖ Oracle
        • ❖ Spanner
      • Cleaning Your Data
      • Creating Metadata
    • Modeling
      • Synthesizers
        • * DayZSynthesizer
        • * IndependentSynthesizer
        • HMASynthesizer
        • * HSASynthesizer
      • Customizations
        • Constraints
        • Preprocessing
      • * Performance Estimates
    • Sampling
    • Evaluation
      • Diagnostic
      • Data Quality
      • Visualization
  • Sequential Data
    • Data Preparation
      • Loading Data
      • Cleaning Your Data
      • Creating Metadata
    • Modeling
      • PARSynthesizer
      • Customizations
    • Sampling
      • Sample Realistic Data
      • Conditional Sampling
    • Evaluation
  • Concepts
    • Metadata
      • Sdtypes
      • Metadata API
      • Metadata JSON
    • Constraint-Augmented Generation (CAG)
      • Predefined Constraints
        • FixedCombinations
        • FixedIncrements
        • Inequality
        • OneHotEncoding
        • Range
        • ❖ CarryOverColumns
        • * ChainedInequality
        • ❖ CompositeKeys
        • ❖ FixedNullCombinations
        • ❖ ForeignToForeignKey
        • ❖ ForeignToPrimaryKeySubset
        • ❖ MixedScales
        • ❖ PrimaryToPrimaryKey
        • ❖ PrimaryToPrimaryKeySubset
        • ❖ ReferenceTable
        • ❖ SelfReferentialHierarchy
        • ❖ UniqueBridgeTable
      • Program Your Own Constraint
      • Constraints API
  • Support
    • Troubleshooting
      • Help with Installation
      • Help with SDV
    • Versioning & Backwards Compatibility Policy
Powered by GitBook

Copyright (c) 2023, DataCebo, Inc.

On this page
  • AI-Based Synthesizers
  • Test Data Synthesizers
  • Data Integrations
  • Pre-Process Statistical Information
  • Understand & Anonymize Real-World Concepts
  • Constraint-Augmented Generation
  • Synthetic Data Evaluation
  1. Explore SDV
  2. SDV Enterprise

Compare Features

PreviousSDV EnterpriseNextSDV Bundles

Last updated 13 days ago

Compare the features available across SDV Community and SDV Enterprise. SDV Enterprise users also have the option of purchasing , which are optional add-on packages for targeted needs.

AI-Based Synthesizers

These synthesizers use AI to learn patterns from your data and use them to recreate synthetic data.

SDV Community
SDV Enterprise

✅

✅

✅

✅

❌

❌

❌

❌

✅

✅

✅

✅

❌

✅

❌

✅

❌

✅

Test Data Synthesizers

These synthesizers create random test data based on metadata alone. They do not use AI so you do not need to input any training data.

SDV Community
SDV Enterprise

❌

✅

❌

✅

Data Integrations

These features make it easy to integrate the SDV into your application and pipeline.

SDV Community
SDV Enterprise

✅

✅

❌

❌

❌

Pre-Process Statistical Information

Transformers are used to pre-process your data, which can improve data quality. SDV synthesizers select transformers by default, but you can always customize these to your dataset.

SDV Community
SDV Enterprise

✅

✅

✅

✅

❌

❌

✅

✅

✅

✅

❌

✅

❌

❌

Understand & Anonymize Real-World Concepts

Transformers are used to pre-process your data, which can improve data quality. SDV synthesizers select transformers by default, but you can always customize these to your dataset.

These transformers are geared towards columns that correspond to industry or domain-specific concepts. Their structure may be human-created.

SDV Community
SDV Enterprise

✅

✅

✅

✅

✅

✅

❌

✅

❌

✅

❌

✅

❌

✅

Constraint-Augmented Generation

Input business rules into your synthesizer using constraints. This ensures high-quality, valid synthetic data, 100% of the time.

SDV Community
SDV Enterprise

✅

✅

✅

✅

❌

✅

❌

❌

✅

✅

Support for programming your constraint and additional predefined logic

❌

✅

Synthetic Data Evaluation

Evaluate your synthetic data by comparing it against the real data.

Public SDV
SDV Enterprise

✅

✅

✅

✅

✅

✅

✅

✅

❌

✅

✅

❌

✅

statistical AI

, , neural networks

advanced Copula modeling with flexible shapes, faster runtime and more

💠

for separately modeling highly segmented data

💠

for modeling data with a few rows

💠

, and Synthesizers for creating synthetic data with differential privacy guarantees

💠

for sequential data

multi-table for limited tables (<5)

multi-table for unlimited tables

multi-table for unlimited tables

for multi-table synthesizers with various dataset sizes

single table

multi table

using data CSVs or DataFrames

based on your database

💠

Directly connect to a database for and creating metadata

💠

Connect to a database for

💠

for missing value imputation, numerical columns

and statistical transforms

with support for 100+ statistical distributions

💠

to normalize any distribution with high fidelity

💠

, , and Encoding for discrete variables ( and )

Encoding including datetime format parsing

for numerical outliers

, , , for adding noise to a column to guarantee differential privacy

💠

, for normalizing a column while guaranteeing differential privacy

💠

, for keys and IDs

general-purpose anonymization

for general pseudo-anonymization with a mapping

understanding domains

understanding locations

understanding country and area codes

understanding geographical areas and distances

Predefined logic for individual tables: , , , ,

for single tables

Advanced, predefined logic for individual tables:

Advanced predefined logic for individual tables: ,

💠

Advanced, predefined logic for multi-table tables: , , , , and .

💠

for multi-table

Access to library vendor-agnostic, open source

basic data validity checks , single and multi-table

statistical similarity, single and multi-table

Measure the privacy of your data: and

of any synthesizer algorithm

💠

1D and 2D bars, scatterplots, heatmaps and more

Use case-specific metrics: ,

⭐
SDV Bundles
GaussianCopula
CTGAN
TVAE
CopulaGAN
XGC
XSynthesizers bundle
SegmentSynthesizer
XSynthesizers bundle
BootstrapSynthesizer
XSynthesizers bundle
DPGC
DPGCFlex
Differential Privacy bundle
PAR
HMA
HSA
Independent
Performance estimates
DayZSynthesizer
DayZSynthesizer
Auto-detect metadata
Auto-detect metadata
AI Connectors bundle
importing real data
AI Connectors bundle
exporting synthetic data
AI Connectors bundle
FloatFormatter
ClusterBasedNormalizer
GaussianNormalizer
XGaussianNormalizer
XSynthesizers bundle
ECDFNormalizer
XSynthesizers bundle
Uniform
Label
OneHot
Datetime
OutlierEncoder
DPLaplaceNoiser
DPTimestampLaplaceNoiser
DPResponseRandomizer
DPWeightedResponseRandomizer
Differential Privacy bundle
DPECDFNormalizer
DPDiscreteECDFNormalizer
Differential Privacy bundle
RegexGenerator
IDGenerator
AnonymizedFaker
PsuedoAnonymizedFaker
Emails
Addresses
Phone Numbers
GPS Coordinates
FixedIncrements
FixedCombinations
Inequality
OneHotEncoding
Range
Program your own constraint
ChainedInequality
FixedNullCombinations
MixedScales
CAG bundle
CarryOverColumns
CompositeKey
ForeignToPrimaryKeySubset
UniqueBridgeTable
more
CAG bundle
Program your own constraint
SDMetrics
Diagnostic Report
Quality Report
DisclosureProtection
DisclosureProtectionEstimate
Verify the differential privacy
Differential Privacy bundle
Visualization
OutlierCoverage
SmoothnessSimilarity