Skip to content

Commit 111f8b1

Browse files
committed
update and fix typos in readme
1 parent 0062f1f commit 111f8b1

1 file changed

Lines changed: 8 additions & 19 deletions

File tree

README.md

Lines changed: 8 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -4,24 +4,11 @@
44

55
This Jupyter notebook introduces you to some basic principles data exploration and visualization using the Python language. The lessons contained in the notebook, revolve around exploration of the characteristics of a number of automobiles. The ultimate goal is to build a model for predicting the price of a car from its characteristics. The goal is to understand the relationships in these data using visualization methods.
66

7-
## About this Jupyter Notebook
8-
The Jupyter notebook contains material to help you
9-
10-
``pip install seaborn``
11-
127

138

149
## About the author
1510

16-
These lessons have been prepared by **Steve Elston**, co-founder and principle consultant at Quantia Analytics, LLC. Steve is a long time data geek and analytical software pioneer, having lead R&D at the SPlus companies and co-fou learn how to explore data visually. This notebook and the data set can be downloaded from GitHub:
17-
18-
**https://github.com/StephenElston/ExploringDataWithPython**
19-
20-
This notebook was constructed using the Anaconda 3.5 Python distribution. If you are not running version Anaconda 3.5 or higher, we suggest you update your Anaconda distribution now. You can download the Python 3 Anaconda distribution for your operating system from the [Continum Analytics web site](https://www.continuum.i
21-
22-
``conda install seaborn``
23-
24-
or nding FinAnalytica, among other positions. He assists clients with staff training and analytics problems in a number of areas. A summary of his activities include:
11+
These lessons have been prepared by **Steve Elston**, co-founder and principle consultant at Quantia Analytics, LLC. Steve is a long time data geek and analytical software pioneer, having lead R&D at the SPlus companies and co-founder of FinAnalytica, among other positions. He assists clients with staff training and analytics problems in a number of areas. A summary of his activities include:
2512

2613
- Instructor, University of Washington and Harvard University data science programs
2714
- O'Reilly author and creator of four edX data science courses
@@ -30,15 +17,17 @@ or nding FinAnalytica, among other positions. He assists clients with staff trai
3017
- Experience in several industries: payment, telecom, capital markets, logistics, energy
3118
- PhD, MS in Geophysics from Princeton University
3219

33-
## Why visualization?
20+
**https://github.com/StephenElston/ExploringDataWithPython**
3421

35-
Visualization is an essential method in any data scientist’s toolbox. Visualization is a key first step in the exploration of most data sets. As a general rule, you should never start creating models until you have an understanding of the relationships in your data.
22+
This notebook was constructed using the Anaconda 3.5 Python distribution. If you are not running version Anaconda 3.5 or higher, we suggest you update your Anaconda distribution now. You can download the Python 3 Anaconda distribution for your operating system from the [Continum Analytics web site](https://www.continuum.io
3623

37-
To run this notebook you need the Seaborn graphics packages. If you have not done so, you will need to install Seaborn as it is not in the Anaconda distribution as of now. From a command prompt on your computer type the following command. If no errors occur, you will have installed Seaborn.
24+
Run ``conda install pandas numpy seaborn`` from your terminal or Anaconda Prompt.
3825

39-
``pip install seaborn``
4026

41-
or you have examined the data and understand the relationships. Otherwise, you risk wasting your time creating models blindly. Visualization is also a powerful tool for presentation of results and for determining sources of problems with analytics.
27+
28+
## Why visualization?
29+
30+
Visualization is an essential method in any data scientist’s toolbox. Visualization is a key first step in the exploration of most data sets. As a general rule, you should never start creating models until you have an understanding of the relationships in your data or you have examined the data and understand the relationships. Otherwise, you risk wasting your time creating models blindly. Visualization is also a powerful tool for presentation of results and for determining sources of problems with analytics.
4231

4332
The concepts of exploring a data set visually were pioneered by John Tukey in the 1960s and 1970s. Tukey consolidated his many ideas on data exploration into a book in the late 1970s, ***John Tukey, Exploratory Data Analysis, 1977, Addison-Westley***.
4433

0 commit comments

Comments
 (0)