Difference between revisions of "Python for Data Science"

From Sinfronteras
Jump to: navigation, search
(Some of the most popular Python Data Science Libraries)
(Replaced content with "~ Migrated")
(Tag: Replaced)
 
Line 1: Line 1:
<br />
+
~ Migrated
==Anaconda==
 
Anaconda is a free and open source distribution of the Python and R programming languages for data science and machine learning related applications.
 
 
 
En otras palabras, Anaconda puede ser visto como un paquete (a distribution) que incluye no solo Python (or R) but many libraries that are used in Data Science, as well as its own virtual environment system. It's an "all-in-one" install that is extremely popular in data science and Machine Learning.
 
 
 
 
 
Anaconda comes with a few IDE:
 
* Jupyter Lab
 
* Jupyter Notebook
 
* Spyder
 
* Qtconsole
 
* and others
 
 
 
 
 
<br />
 
==Jupyter==
 
Jupyter comes with Anaconda. It is a development environment (IDE) where we can write codes; but it also allows us to display images, and write down markdown notes.
 
 
 
 
 
<br />
 
===Share Jupyter Notebook online===
 
* '''GitHub:'''
 
: https://docs.github.com/en/github/managing-files-in-a-repository/working-with-jupyter-notebook-files-on-github
 
: Example: https://github.com/adeloaleman/AmazonLaptopsDashboard/blob/master/DataAnalysis/data_analysis2.ipynb
 
 
 
* '''Nbviewer'''
 
: https://nbviewer.jupyter.org/
 
: Example: https://nbviewer.jupyter.org/github/bokeh/bokeh-notebooks/blob/main/tutorial/06%20-%20Linking%20and%20Interactions.ipynb
 
 
 
 
 
<br />
 
 
 
===Online Jupyter===
 
There are many sites that provides solutions to run your Jupyter Notebook in the cloud: https://www.dataschool.io/cloud-services-for-jupyter-notebook/
 
 
 
For example: https://colab.research.google.com
 
 
 
 
 
<br />
 
 
 
==Popular Python Data Science Libraries==
 
 
 
* NumPy
 
* SciPy
 
* Pandas
 
* Seaborn
 
* SciKit'Learn
 
* MatplotLib
 
* Plotly
 
* PySpartk
 
 
 
 
 
<br />
 
 
 
==[[NumPy and Pandas]]==
 
 
 
 
 
<br />
 
==[[Data Visualization with Python]]==
 
 
 
 
 
<br />
 
 
 
==[[Natural Language Processing]]==
 
 
 
 
 
<br />
 
==[[Plotly Dash]]==
 
 
 
 
 
<br />
 
==Using SQL in Jupyter==
 
Connecting to a database in Jupyter
 
 
 
 
 
https://pypi.org/project/ipython-sql/
 
 
 
https://stackoverflow.com/questions/454854/no-module-named-mysqldb
 
 
 
https://stackoverflow.com/questions/5178292/pip-install-mysql-python-fails-with-environmenterror-mysql-config-not-found
 
 
 
 
 
pip install ipython-sql
 
 
sudo apt install default-libmysqlclient-dev
 
 
pip install mysqlclient
 
 
sudo apt-get install python3-mysqldb
 
 
 
 
 
Luego adding SQL syntax highlighting to Jupyter as describe above in the corrrespoinding source.
 
 
 
 
 
<br />
 

Latest revision as of 11:33, 28 February 2026

~ Migrated