What's in your data? Extract schema, statistics and entities from datasets
-
Updated
Jan 15, 2025 - Python
What's in your data? Extract schema, statistics and entities from datasets
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.
SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.
Data Cleansing Basics
DATA PROFILING is a process of examining, analyzing, and creating useful summaries of data. The process yields a high-level overview which aids in the discovery of data quality issues, risks, and overall trends.
sales_analysis
customer review on jupyter notebook
Add a description, image, and links to the dataprofiling topic page so that developers can more easily learn about it.
To associate your repository with the dataprofiling topic, visit your repo's landing page and select "manage topics."