Skip to content

Metadata

YData's Datasources are entities that represent specific data assets such as databases, tables, files, or other structured formats within the YData ecosystem. They offer a centralized framework for managing, cataloging, and profiling data, enhancing data management and quality.

Benefits

  • Summarized metadata information: YData's Datasources provide comprehensive metadata management, offering detailed information about each datasource, including schema details, descriptions, tags, and data lineage. This metadata helps users understand the structure and context of their data.

  • Data Quality Management: Users can find data quality warnings, validation results, cleansing suggestions, and quality scores. These features help in identifying and addressing data quality issues automatically, ensuring reliable data for analysis and decision-making.

  • Data Profiling: Data profiling tools analyze the content and structure of datasources, providing statistical summaries, detecting patterns, assessing completeness, and evaluating data uniqueness. These insights help in understanding and improving data quality.

  • PII Identification and Management: YData detects and manages Personally Identifiable Information (PII) within datasources. It includes automatic PII detection, masking tools, and compliance reporting to protect sensitive data and ensure regulatory compliance.

The YData SDK provides powerful metadata management capabilities to help you understand and manage your data effectively.

Core Features

Data Understanding

  • Dataset metadata extraction
  • Data quality metrics
  • Data lineage tracking
  • Schema analysis

Enhanced Management

  • Automated metadata collection
  • Version control
  • Quality monitoring
  • Custom metadata fields

Getting Started Examples

For practical examples of using metadata features, check out our Getting Started guides: