LogoLogo
GitHubSlackDataCebo
  • RDT: Reversible Data Transforms
  • Getting Started
    • Installation
    • Quickstart
  • Usage
    • Basic Concepts
    • HyperTransformer
      • Preparation
      • Configuration
      • Transformation
  • Transformers Glossary
    • Numerical
      • ClusterBasedNormalizer
      • FloatFormatter
      • GaussianNormalizer
      • LogScaler
      • LogitScaler
      • * OutlierEncoder
      • ❖ DPECDFNormalizer
      • ❖ DPLaplaceNoiser
      • ❖ ECDFNormalizer
      • ❖ XGaussianNormalizer
    • Categorical
      • LabelEncoder
      • OrderedLabelEncoder
      • FrequencyEncoder
      • OneHotEncoder
      • OrderedUniformEncoder
      • UniformEncoder
      • BinaryEncoder
      • ❖ DPDiscreteECDFNormalizer
      • ❖ DPResponseRandomizer
      • ❖ DPWeightedResponseRandomizer
    • Datetime
      • OptimizedTimestampEncoder
      • UnixTimestampEncoder
      • ❖ DPTimestampLaplaceNoiser
    • ID
      • AnonymizedFaker
      • IndexGenerator
      • RegexGenerator
      • Treat IDs as categorical labels
    • Generic PII Anonymization
      • AnonymizedFaker
      • PseudoAnonymizedFaker
    • * Deep Data Understanding
      • * Address
        • * RandomLocationGenerator
        • * RegionalAnonymizer
      • * Email
        • * DomainBasedAnonymizer
        • * DomainBasedMapper
        • * DomainExtractor
      • * GPS Coordinates
        • * RandomLocationGenerator
        • * GPSNoiser
        • * MetroAreaAnonymizer
      • * Phone Number
        • * AnonymizedGeoExtractor
        • * NewNumberMapper
        • * GeoExtractor
  • Resources
    • Use Cases
      • Contextual Anonymization
      • Differential Privacy
      • Statistical Preprocessing
    • For Businesses
    • For Developers
Powered by GitBook
On this page
  • General Numerical Transformers
  • Differential Privacy Transformers
  1. Transformers Glossary

Numerical

PreviousTransformationNextClusterBasedNormalizer

Last updated 12 days ago

Numerical columns contain numbers. The defining aspect of numerical data is that the numbers have an order and you can apply a variety of mathematical computations to them (average, sum, etc.) The actual values may follow a specific format, such as being rounded to 2 decimal digits and remaining between min/max bounds.

If the data is stored as integers, it does not mean it's necessarily numerical. For example, HTTP response codes such as 200, 404, etc. are stored as integers but they are distinct categories. They cannot be combined or averaged.

General Numerical Transformers

These transformers can format, analyze, and reshape your numerical data into simpler shapes for data science.

Differential Privacy Transformers

These transformers use differential privacy techniques to add noise or reshape your column of numerical data. As a result, your column — and any statistics about it — can be shared with differential privacy guarantees.

Use Gaussian Mixture Models to cluster and normalize the data.

Use a probability integral transform to normalize the data. Choose from several distributions.

A general transformer that imputes missing values.

Detect and encode outliers for more accurate data science.

Use a probability integral transform to normalize data. Choose from 100s of distributions.

Normalize the data directly instead of using a predefined shape.

Normalize the data by computing CDF function and adding noise.

Privatize the data by adding Laplacian noise

*

❖

❖

❖

❖

ClusterBasedNormalizer
GaussianNormalizer
FloatFormatter
OutlierEncoder
XGaussianNormalizer
ECDFNormalizer
DPECDFNormalizer
DPLaplaceNoiser
For example, you might be storing product purchase amounts (USD) with 2 decimal digits. You might be storing the ages of your customers as whole numbers that must be 18 or above.

*SDV Enterprise Feature. This feature is available to our licensed users and is not currently in our public library. For more information, visit our page to .

Explore SDV

❖ SDV Enterprise Bundle. This feature is available as part of the Differential Privacy Bundle, an optional add-on to SDV Enterprise. For more information, please visit the page. Coming soon!

Differential Privacy Bundle

❖ SDV Enterprise bundle. This feature is available for purchase as an SDV Enterprise bundle. For more information, visit our page to .

Explore SDV