You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

1.0 KiB

D03 Piscine AI - Data Science

Author:

Introduction

While working on a dataset it is important to check the distribution of the data. Obviously, for most of humans it is difficult to visualize the data in more than 3 dimensions

Viz is important to understand the data and to show results. We have already seen there are some basinc viz functionalities in Pandas. Now we'll discover two of the most know viz libraries in Python:

  • Pandas viz
  • Matplotlib
  • Plotly

Pandas viz is pratique: rapid plot, relies on Matplotlib. (check matplotlib doc sometimes not all params are detailed in pandas doc) For more elaborate plots Matplotlib is necessary

And finaly Plotly is a interactive plot library.s

Rules

Always a title, legend, ...s

Ressources

s https://matplotlib.org/3.3.3/tutorials/index.html https://towardsdatascience.com/matplotlib-tutorial-learn-basics-of-pythons-powerful-plotting-library-b5d1b8f67596

https://github.com/rougier/matplotlib-tutorial https://jakevdp.github.io/PythonDataScienceHandbook/05.13-kernel-density-estimation.html