Data Generation and Test Data Management Within the One Tool

Synthetic yet Privacy-Preserving Data Generation

Fault-Tolerant Data Replication and Advanced Sub-setting

Plain Integrations with Popular Databases

Synthetic yet Privacy-Preserving Data Generation

Replace Your Real Data with Anonymized

TDspora is a privacy-preserving solution that replaces real data with semantically similar, yet anonymous data. The algorithm uses state-of-the-art differential privacy techniques to ensure the privacy of sensitive information while still providing meaningful insights. Differential privacy is widely used by government agencies and large enterprises, including FAANG companies, to safely release datasets to the public.

Choose Size of Data Set

The choice of the size of the dataset is decided based on balancing privacy and utility. Smaller datasets prioritize privacy, larger datasets prioritize utility with proper privacy measures. You can choose it by considering privacy budget, sensitivity of data and project goals.

Preserve Secured Perimeter Premises

TDspora users do not have access to production data and it remains within a secure perimeter, ensuring that sensitive information is protected.

Privacy-Preserving Data Generation

How does Your Team Benefit From the Generated Data?

Testing Team

Run an automated test suite against data that is as close to production as possible while preserving their ability to inspect the system in case of errors

Data Migration and Data Integration Teams

Get data with glitches and inconsistencies, including broken table relationships to test data validation rules and error handling routines

Development Team

Distribute data that mirrors production to individual developers to ensure that new features are developed with the existing data in mind

Performance Testing Team

Produce a vast amount of data that repeats several behavioral patterns, but with slight variations

Product Team

Get instant social portraits and behavioral patterns of product users by addressing the statistical properties of the dataset without the risk of exposing any user’s personal information

Data Scientists

Who use scarce datasets can improve performance of classification and regression models by generation of more frequent case representation and using adjustable noise levels

Fault-Tolerant Data Replication and Advanced Sub-setting

Produce Representative Samples

Training of data generation models requires representative samples extracted from the highly repetitive production data. TDspora provides an advanced sub-setting algorithm that walks through relationships between tables and extracts dependent business entities.

Design Data Set

You can tailor the representative sample and make the replica as simple as a random sample or as complex as a subset of tables with filters and relationships:

Choose columns, tables, and relationships to send to the target database;

Create tables and relationships in the target database;

Filter tables in the sub-set;

Manually define relationships.

Ensure Data Delivery

Automatic restarts ensure fault-tolerance in data delivery by preventing failure of the data copy process.

Plain Integrations with Popular Databases

Supported Authentication and Encryption Technologies:

Secure Sockets Layer (SSL) for encryption and authentication

Kerberos

LDAP and LDAPS

Supported Databases

All editions of Oracle starting from version 10g and later;

PostgreSQL version 9.3 and later.

On-demand Database Integrations

Implement a Database integration in less than 4 weeks due to flexible architecture and technologies like Apache Spark, JDBC, and jOOQ implemented in the tool.