> For the complete documentation index, see [llms.txt](https://docs.sdv.dev/sdv/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.sdv.dev/sdv/integration/overview.md).

# Overview

In order to use SDV, you'll need to start with two inputs loaded into Python:

* **Data**, which is the training data for SDV. SDV learns statistical patterns from your training data and aim to replicate them when it creates synthetic data. SDV supports single-table, multi-table, and sequential data. For more details, see the [Data Modalities](/sdv/integration/overview/data-modalities.md) doc.
* **Metadata**, which is a description of the dataset you want to synthesize. Metadata includes the names of tables, columns, data types in each of the columns, and relationships between tables. Metadata is represented as a Python dictionary.

Once you have your data and metadata, you'll be able to use SDV's synthesizers to create synthetic data.

## Start Your Integration

For the best experience, we recommend connecting your database to import data and metadata. Alternatively if you already have data locally (eg. in CSV files), you can load it into Python.

### ❖ Connect Your Database

To start your data integration, we recommend **Connecting Your Database** to SDV.&#x20;

{% stepper %}
{% step %}

#### Connect Your Database

Use the **❖ AI Connectors** feature to create a connection from SDV to your database.
{% endstep %}

{% step %}

#### Create Metadata

SDV automatically creates metadata based on your database schema. You can view and edit the metadata.
{% endstep %}

{% step %}

#### Import Data

SDV automatically imports training data by selecting a subset of your data from your database. The imported subset is ready-to-use with SDV.
{% endstep %}
{% endstepper %}

For more information see the [Connect Your Database](/sdv/integration/db.md) docs.

{% hint style="info" %}
❖ **SDV Enterprise Bundle**. This feature is available as part of the **AI Connectors Bundle**, an optional add-on to SDV Enterprise. For more information, please visit the [AI Connectors Bundle](/sdv/explore/sdv-bundles/ai-connectors.md) page.
{% endhint %}

### Import Local Data

Alternatively, you can **load in data from local files** such as CSVs.

{% stepper %}
{% step %}

#### Load Your Data

Load your data directly into Python using SDV's local data loading utilities. We recommend starting with CSV or Excel files.
{% endstep %}

{% step %}

#### Auto-Detect Metadata

SDV is able to auto-detect metadata based on your data. We recommending reviewing it for accuracy. You can view and edit the metadata.
{% endstep %}

{% step %}

#### Clean Your Data

In some cases, may need to further clean your data. For example, you may need to drop any unknown references to ensure that your dataset has referential integrity. Or you may want to subsample it for performance.
{% endstep %}
{% endstepper %}

&#x20;For more information, see the [Import Local Data](/sdv/integration/local.md) docs.

## FAQ

<details>

<summary>Do you have any example data that I can use for testing?</summary>

Yes, the SDV library contains many different demo datasets that you can use to get started. You'll be able to download the data and metadata for each of the demo datasets.

To learn more, see the [SDV Demo Data](/sdv/integration/utils/demo.md) docs.

</details>

<details>

<summary>If I don't have any data, can I generate synthetic data from scratch?</summary>

Yes, we have a specialized synthesizer called DayZSynthesizer that is able to generate data from scratch. You only have to provide metadata to describe the structure of the data you want to generate. The DayZSynthesizer is available in [SDV Enterprise](/sdv/explore/sdv-enterprise.md) for [single-table](/sdv/modeling/single-table-synthesizers/dayzsynthesizer.md) and [multi-table](/sdv/modeling/multi-table-synthesizers/dayzsynthesizer.md) datasets.

Note that SDV has been primarily designed to learn from training data and incorporate those patterns in the synthetic data. With the DayZSynthesizer, you'll receive data that is randomly generated without any specific patterns or correlations.

</details>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.sdv.dev/sdv/integration/overview.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
