ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.

Features

  • Automatic detection of columns’ data types (Categorical, Numerical, Date, etc.)
  • A summary of the problems/challenges in the data that you might need to work on (missing data, inaccuracies, skewness, etc.)
  • Descriptive statistics (mean, median, mode, etc) and informative visualizations such as distribution histograms
  • Correlations, a detailed analysis of missing data, duplicate rows, and visual support for variables pairwise interaction
  • Different statistical information relative to time dependent data such as auto-correlation and seasonality, along ACF and PACF plots
  • Most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic)

Project Samples

Project Activity

See All Activity >

Categories

Data Quality

License

MIT License

Follow ydata-profiling

ydata-profiling Web Site

You Might Also Like
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of ydata-profiling!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Quality Tool

Registered

2023-06-12