Difference between revisions of "Python for Data Science"

From Sinfronteras
Jump to: navigation, search
(Share Jupyter Notebook online)
(Replaced content with "~ Migrated")
(Tag: Replaced)
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
<br />
+
~ Migrated
==Anaconda==
 
Anaconda is a free and open source distribution of the Python and R programming languages for data science and machine learning related applications.
 
 
 
En otras palabras, Anaconda puede ser visto como un paquete (a distribution) que incluye no solo Python (or R) but many libraries that are used in Data Science, as well as its own virtual environment system. It's an "all-in-one" install that is extremely popular in data science and Machine Learning.
 
 
 
 
 
Anaconda comes with a few IDE:
 
* Jupyter Lab
 
* Jupyter Notebook
 
* Spyder
 
* Qtconsole
 
* and others
 
 
 
 
 
<br />
 
==Jupyter==
 
Jupyter comes with Anaconda. It is a development environment (IDE) where we can write codes; but it also allows us to display images, and write down markdown notes.
 
 
 
 
 
<br />
 
===Share Jupyter Notebook online===
 
* '''GitHub:'''
 
: https://docs.github.com/en/github/managing-files-in-a-repository/working-with-jupyter-notebook-files-on-github
 
: Example: https://github.com/adeloaleman/AmazonLaptopsDashboard/blob/master/DataAnalysis/data_analysis2.ipynb
 
 
 
* '''Nbviewer'''
 
: https://nbviewer.jupyter.org/
 
: Example: https://nbviewer.jupyter.org/github/bokeh/bokeh-notebooks/blob/main/tutorial/06%20-%20Linking%20and%20Interactions.ipynb
 
 
 
 
 
<br />
 
 
 
===Online Jupyter===
 
There are many sites that provides solutions to run your Jupyter Notebook in the cloud: https://www.dataschool.io/cloud-services-for-jupyter-notebook/
 
 
 
For example: https://colab.research.google.com
 
 
 
 
 
<br />
 
 
 
==Some of the most popular Python Data Science Libraries==
 
 
 
* NumPy
 
* SciPy
 
* Pandas
 
* Seaborn
 
* SciKit'Learn
 
* MatplotLib
 
* Plotly
 
* PySpartk
 
 
 
 
 
<br />
 
==[[NumPy and Pandas]]==
 
 
 
 
 
<br />
 
==[[Data Visualization with Python]]==
 
 
 
 
 
<br />
 
 
 
==[[Natural Language Processing]]==
 
 
 
 
 
<br />
 
==[[Plotly Dash]]==
 
 
 
 
 
<br />
 
==Using SQL in Jupyter==
 
Connecting to a database in Jupyter
 
 
 
 
 
https://pypi.org/project/ipython-sql/
 
 
 
https://stackoverflow.com/questions/454854/no-module-named-mysqldb
 
 
 
https://stackoverflow.com/questions/5178292/pip-install-mysql-python-fails-with-environmenterror-mysql-config-not-found
 
 
 
https://docs.kyso.io/guides/sql-interface-within-jupyterlab
 
 
 
https://www.datacamp.com/community/tutorials/sql-interface-within-jupyterlab
 
 
 
https://stackoverflow.com/questions/43641362/adding-syntax-highlighting-to-jupyter-notebook-cell-magic
 
 
 
https://www.sqlshack.com/learn-jupyter-notebooks-for-sql-server/
 
 
 
 
 
Verificar las fuentes above. Creo que lo único que tuve que hacer la última vez que lo instalé fue basado en las 3 primeras sources:
 
 
 
pip install ipython-sql
 
 
sudo apt install default-libmysqlclient-dev
 
 
pip install mysqlclient
 
 
sudo apt-get install python3-mysqldb
 
 
 
 
 
Luego adding SQL syntax highlighting to Jupyter as describe above in the corrrespoinding source.
 
 
 
 
 
<br />
 

Latest revision as of 11:33, 28 February 2026

~ Migrated