LogoLogo
GitHubSlackDataCebo
  • RDT: Reversible Data Transforms
  • Getting Started
    • Installation
    • Quickstart
  • Usage
    • Basic Concepts
    • HyperTransformer
      • Preparation
      • Configuration
      • Transformation
  • Transformers Glossary
    • Numerical
      • ClusterBasedNormalizer
      • FloatFormatter
      • GaussianNormalizer
      • LogScaler
      • LogitScaler
      • * OutlierEncoder
      • ❖ DPECDFNormalizer
      • ❖ DPLaplaceNoiser
      • ❖ ECDFNormalizer
      • ❖ XGaussianNormalizer
    • Categorical
      • LabelEncoder
      • OrderedLabelEncoder
      • FrequencyEncoder
      • OneHotEncoder
      • OrderedUniformEncoder
      • UniformEncoder
      • BinaryEncoder
      • ❖ DPDiscreteECDFNormalizer
      • ❖ DPResponseRandomizer
      • ❖ DPWeightedResponseRandomizer
    • Datetime
      • OptimizedTimestampEncoder
      • UnixTimestampEncoder
      • ❖ DPTimestampLaplaceNoiser
    • ID
      • AnonymizedFaker
      • IndexGenerator
      • RegexGenerator
      • Treat IDs as categorical labels
    • Generic PII Anonymization
      • AnonymizedFaker
      • PseudoAnonymizedFaker
    • * Deep Data Understanding
      • * Address
        • * RandomLocationGenerator
        • * RegionalAnonymizer
      • * Email
        • * DomainBasedAnonymizer
        • * DomainBasedMapper
        • * DomainExtractor
      • * GPS Coordinates
        • * RandomLocationGenerator
        • * GPSNoiser
        • * MetroAreaAnonymizer
      • * Phone Number
        • * AnonymizedGeoExtractor
        • * NewNumberMapper
        • * GeoExtractor
  • Resources
    • Use Cases
      • Contextual Anonymization
      • Differential Privacy
      • Statistical Preprocessing
    • For Businesses
    • For Developers
Powered by GitBook
On this page
  • Key Features
  • Supported sdtypes
  • Browse Transformers
  1. Transformers Glossary
  2. * Deep Data Understanding

* Address

Previous* Deep Data UnderstandingNext* RandomLocationGenerator

Last updated 6 months ago

Address data represents physical locations around the world, for example an office or personal mailing address. Addresses are usually structured, requiring several columns.

Addresses are also a type of PII data. A key consideration is that you do not want the precise address locations of your dataset to leak such as the exact street address and building number. However, you may want to create realistic, broader regions such as the city, postal code or country.

Key Features

Supported sdtypes

An address comprises of multiple columns, each with specific concept such as a city.

Below are the supported sdtypes. You can supply one or more of these in combination to form an address.

sdtype
Definition

country_code

administrative_unit, state

The full name of the broader regions in a country. In some countries, these are called states. The general name of the term is administrative unit. For example "Massachusetts" or "California".

state_abbr

The shortened abbreviation for the administrative unit. These are determined by the country. For example Massachusetts is "MA" and California is "CA".

city

The full name of the city, for example "Boston" or "San Francisco".

postcode

The internationally-recognized, 5-digit postal service code such as "02116" or "94103".

street_address

The building number and street such as "229 Berkeley St" or "77 Massachusetts Ave".

secondary_address

An optional, second line of the address that further specifies the unit or apartment number. For example "Apt #4" or "Building 204".

Browse Transformers

Create random, realistic regions from anywhere in the world. When you do this, the combinations of all your address columns will make sense when put together. For example a city, state, country combination such as Boston, Massachusetts, USA.

You can also use the the technique on addresses. Preserve the real regions from your data while anonymizing precise locations only, such as the street address and building number.

A 2-character ISO country code such as "US" or "BE". For a full list of ISO country codes, view the.

🌎
🔓
Contextual Anonymization
ISO guide

Create anonymize columns without considering the context of the overall address.

Create a realistic combination of address columns from random, worldwide locations.

Anonymize precise addresses while preserving the overall regional information.

*

*

AnonymizedFaker
RandomLocationGenerator
RegionalAnonymizer

*SDV Enterprise Feature. This feature is available to our licensed users and is not currently in our public library. For more information, visit our page to .

Explore SDV