This assignment uses data from "https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip" which measures 561 motion parameters from a Samsung Galaxy II cell phone while the subjects in the study do six different activities. The output of this script produces a tidy data set of the calculated mean and standard deviation columns in the original data and provides the means of those columns grouped and arranged by subect and activiity
This script does the following
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive variable names.
- From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
- zipped file is downloaded and files are extracted
- all relevant data files are read in using read.table
- columns which contain mean and standard deviation measures are identifed (meanstdcolumns)
- test and Training data are merged using rbind
- the merged data is subsetted by meanstdcolumns
- Subject and activity columns are prepended using cbind
- column names are changed to be more descriptive and user friendly - all are converted to lower case and all punctuation is removed
- the activity column is changed from numeric code to character for clarity
- means of each column are calculated and grouped by subject and activity
- the tidy data set is output as text file