Skip to content

WellWang/pythonfordatascience

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

Python for Data Science

Notebooks for Python for R Users: A Data Science Approach

Interfaces

  • Command Line
  • Rodeo
  • IDLE
  • Jupyter
  • Beaker

Cloud

Basic Python

Introductory Python https://nbviewer.jupyter.org/gist/decisionstats/ce2c16ee98abcf328177 Selecting Data in Pandas https://nbviewer.jupyter.org/gist/decisionstats/01fc540363f1081c5358

Data Input

Web Scraping

Databases

  • MySQL
  • MongoDB
  • HDFS
  • Spark

Data Manipulation

Data Exploration

Data Visualization

Regression Modeling

  • Using Statsmodels (Boston Dataset)
  • Using Pandas
  • Using Scikit-learn

Data Mining

  • Decision Trees
  • Association Analysis
  • Clustering Kmeans and Hierarchical
  • Neural Networks
  • ROC Curves for Models

Time Series Forecasting

  • ETS Models
  • Arima Models

Optimizing Code

  • Measuring Code Speed
  • Measuring Code Performance

Text Mining

  • Word Cloud (corpus,stopwords,association,tdm)
  • Sentiment Analysis

Complete Tutorial

Current To Dos

Spatial Data using Python http://sensitivecities.com/so-youd-like-to-make-a-map-using-python-EN.html#.V4EneVgrJ-8 http://nbviewer.jupyter.org/gist/urschrei/74c6223d9f6a5dea4e75 http://spatialdemography.org/essential-python-geospatial-libraries/

New Version of Python ggplot

http://ggplot.yhathq.com/

#Datasets http://www.gunviolencearchive.org/ Washington Post https://github.com/washingtonpost/data-police-shootings

About

Python for Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors