ydata-profiling

ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.

Features

Automatic detection of columns’ data types (Categorical, Numerical, Date, etc.)
A summary of the problems/challenges in the data that you might need to work on (missing data, inaccuracies, skewness, etc.)
Descriptive statistics (mean, median, mode, etc) and informative visualizations such as distribution histograms
Correlations, a detailed analysis of missing data, duplicate rows, and visual support for variables pairwise interaction
Different statistical information relative to time dependent data such as auto-correlation and seasonality, along ACF and PACF plots
Most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic)

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow ydata-profiling

ydata-profiling Web Site

User Reviews

Be the first to post a review of ydata-profiling!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Quality Tool

Registered

2023-06-12

Similar Business Software

dbt

dbt helps data teams transform raw data into trusted, analysis-ready datasets faster. With dbt, data analysts and data engineers can collaborate on version-controlled SQL models, enforce testing and documentation standards, lean on detailed metadata to troubleshoot and optimize pipelines, and...

See Software
DataHub

DataHub Cloud is an event-driven AI & Data Context Platform that uses active metadata for real-time visibility across your entire data ecosystem. Unlike traditional data catalogs that provide outdated snapshots, DataHub Cloud instantly propagates changes, automatically enforces policies, and...

See Software
D&B Connect

Realize the true potential of your first-party data. D&B Connect is a customizable, self-service master data management solution built to scale. Eliminate data silos across the organization and bring all your data together using the D&B Connect family of products. Benchmark, cleanse, and enrich...

See Software
Semarchy xDM

Use Semarchy unified data platform to experience xDM. Discover, govern, enrich, enlighten and manage data. You can easily transform data into insights with xDM and rapidly deliver data-rich applications with automated master data management. Its business-centric interfaces provide for rapid...

See Software
DataBuck

DataBuck is an AI-powered data validation platform that automates risk detection across dynamic, high-volume, and evolving data environments. DataBuck empowers your teams to: ✅ Enhance trust in analytics and reports, ensuring they are built on accurate and reliable data. ✅ Reduce maintenance...

See Software
BDEX

BDEX’S Omni IQ helps you find more customers identical to your ideal clients using a proprietary extended audience AI technology. The BDEX Identity Graph helps companies identify consumers across all channels. We authenticate over 470 million hashed email-MAID-IP matches linked to 113M...

See Software

Report inappropriate content

ydata-profiling

Create HTML profiling reports from pandas DataFrame objects

Get an email when there's a new version of ydata-profiling

Features

Project Samples

Project Activity

Categories

License

Follow ydata-profiling

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered