❖ FixedNullCombinations
Do you want to apply this constraint to PII data? As part of its algorithm, this constraint will learn and remember some parameters in the synthesizer. While you can always share your synthetic data, we recommend caution when sharing out your synthesizer itself, as it will contain some of these parameters.
The FixedNullCombinations constraint enforces that the null combinations between a set of columns are fixed. That is, no other permutations or shuffling of null values is allowed other than what's already observed in the data.
Constraint API
Create a FixedNullCombinations
constraint.
Parameters:
(required)
column_names
: A list of two or more columns whose combinations are fixed when it comes to null values. The SDV will not further shuffle the null values between these column names.table_name
: A string with the name of the table to apply this to. Required if you have a multi-table dataset.
from sdv.cag import FixedNullCombinations
my_constraint = FixedNullCombinations(
column_names=['city', 'country']
)
Usage
Apply the constraint to any SDV synthesizer. Then fit and sample as usual.
synthesizer = GaussianCopulaSynthesizer(metadata)
synthesizer.add_constraints([my_constraint])
synthesizer.fit(data)
synthetic_data = synthesizer.sample()
For more information about using predefined constraints, please see the Constraint-Augmented Generation tutorial.
FAQs
Last updated