Skip to content

Getting Started with YData SDK

The ydata-sdk is a Python package designed to simplify data access, processing, and synthetic data generation within the YData ecosystem. It enables users to manage datasets, run profiling, and generate high-quality synthetic data for analytics, machine learning, and data privacy applications.

This section introduces the SDK’s core capabilities, structured into four key areas:

  • Connectors – Learn how to connect to different data sources, including databases, cloud storage, and local files.
  • Metadata – Explore how to extract, manage, and enhance dataset metadata for improved data quality.
  • Synthetic Data – Generate high-fidelity synthetic datasets for analytics, machine learning, and privacy-preserving applications.
  • Report – Automate and generate reports on data quality, profiling insights, and integrity checks.

Click on any section to dive deeper and start working with ydata-sdk right away.