Loading Data
Load your data into Python to use it for SDV modeling. SDV supports many different types of data formats for import and export.
Don't have any data yet? The SDV library contains many different demo datasets that you can use to get started. To learn more, see the SDV Demo Data page.
Local Data
If your data is already available as local files (on your own machine), load them into SDV using the functions below.
* Connect to a database
*SDV Enterprise Feature. This feature is only available for licensed, enterprise users. To learn more about the SDV Enterprise features and purchasing a license, visit our website.
If your data is available in a database, you create a connection to import some data for SDV. Later you can use the same functionality to export synthetic data.
* BigQuery
Connect to a BigQuery database and import a subset of data.
* MSSQL
Connect to a Microsoft SQL server, which you may be using with Azure.
Coming soon!
More database connectors are coming soon.
Do you have data in other formats?
The SDV uses the pandas library for data manipulation and synthesizing. If your data is in any other format, load it in as a pandas.DataFrame object to use in the SDV. For multi table data, make sure you format your data as a dictionary, mapping each table name to a different DataFrame object.
Pandas offers many methods to load in different types of data. For example: SQL table or JSON string.
For more options, see the pandas reference.
Last updated