Open Access Open Access  Restricted Access Subscription Access

Exploratory Data Analysis on Autopilot: Python's Automatic Solutions

I.V. Dwaraka Srihith, A. David Donald, T. Aditya Sai Srinivas, G. Thippanna, P. Vijaya Lakshmi

Abstract


Python has gained immense popularity in the fields of data science and machine learning due to its extensive libraries and efficient coding capabilities, enabling time-saving solutions. This article presents a comprehensive tutorial on Automatic Exploratory Data Analysis (EDA) using Python. By leveraging Python libraries, we can swiftly extract valuable insights and statistical information from datasets, reducing the manual effort involved in data exploration. The article aims to equip readers with the knowledge and tools to efficiently analyze data, revealing hidden patterns and trends, all accomplished through just a few lines of code. By the end of this article, readers will have a clear understanding of how Python's automated EDA techniques can revolutionize the data analysis process, maximizing efficiency and productivity.

Full Text:

PDF

References


Seshadri, R., Chandakkar, P. S., & Mathur, A. (2019). AutoViz: Automatic Visualization of Data. IEEE Transactions on Visualization and Computer Graphics, 26(1), 1018-1028.

Polfliet, J., Van den Bossche, J., & De Wachter, S. (2020). Pandas Profiling: Automatic Exploratory Data Analysis. Journal of Open Source Software, 5(54), 2591.

Watson, A. (2021). AutoEDA: Automatic Exploratory Data Analysis with a Focus on Statistical Testing. Journal of Open Source Software, 6(57), 2892.

Bertrand, F. (2020). Sweetviz: Automated EDA and Visualization. Journal of Open Source Software, 5(55), 2763.

DataPrep - A Python Library for Data Preprocessing. Retrieved from: https://github.com/sfu-db/DataPrep


Refbacks

  • There are currently no refbacks.