Skip to content

anshulkgupta93/pythonfordatascience

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Python for Data Science

Notebooks for Python for R Users: A Data Science Approach

Interfaces

  • Command Line
  • Rodeo
  • IDLE
  • Jupyter
  • Beaker

Cloud

Basic Python

Introductory Python https://nbviewer.jupyter.org/gist/decisionstats/ce2c16ee98abcf328177 Selecting Data in Pandas https://nbviewer.jupyter.org/gist/decisionstats/01fc540363f1081c5358

Data Input

Web Scraping

Databases

  • MySQL
  • MongoDB
  • HDFS
  • Spark

Data Manipulation

Data Exploration

Data Visualization

Regression Modeling

  • Using Statsmodels (Boston Dataset)
  • Using Pandas
  • Using Scikit-learn

Data Mining

  • Decision Trees
  • Association Analysis
  • Clustering Kmeans and Hierarchical
  • Neural Networks
  • ROC Curves for Models

Time Series Forecasting

  • ETS Models
  • Arima Models

Optimizing Code

  • Measuring Code Speed
  • Measuring Code Performance

Text Mining

  • Word Cloud (corpus,stopwords,association,tdm)
  • Sentiment Analysis

Complete Tutorial

About

Python for Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors