Visualization
Last updated
Last updated
Use these functions to visualize your actual data in 1 or 2-dimensional space. This can help you see what kind of patterns the synthetic data has learned, and identify differences between the real and synthetic data.
Use this function to visualize a real column against the same synthetic column. You can plot any column of type: boolean
, categorical
, datetime
or numerical
.
Parameters
(required) column_name
: The name of the column you want to plot
plot_type
: The type of plot to create
(default) None
: Determine an appropriate plot type based on your data type, as specified in the metadata.
'bar'
: Plot the data as distinct bar graphs
'displot'
: Plot the data as a smooth, continuous curves
Use this utility to visualize the trends between a pair of columns for real and synthetic data. You can plot any 2 columns of type: boolean
, categorical
, datetime
or numerical
. The columns do not have to the be the same type.
Parameters
(required) column_names
: A list with the names of the 2 columns you want to plot
plot_type
: The type of plot to create
(default) None
: Determine an appropriate plot type based on your data type, as specified in the metadata.
'box'
: Create a box plot showing the quartiles, broken down by different attributes
'heatmap'
: Create a side-by-side headmap showing the frequency of each pair of values
'scatter'
: Create a scatter plot that plots each point on a 2D axis
sample_size
: The number of data points to plot
(default) None
: Plot all the data points
<integer>
: Subsample rows from both the real and synthetic data before plotting. Use this if you have a lot of data points.
(required) real_data
: A object containing the table of your real data. To skip plotting the real data, input None
.
(required) synthetic_data
: A object containing the synthetic data. To skip plotting the synthetic data, input None
.
(required) metadata
: A object that describes the columns
Output A object that plots the distribution. This will change based on the sdtype.
(required) real_data
: A object containing the table of your real data. To skip plotting the real data, input None
.
(required) synthetic_data
: A object containing the synthetic data. To skip plotting the synthetic data, input None
.
(required) metadata
: A object that describes the columns
Output A object that plots the 2D distribution. This will change based on the sdtype.