Skip to content

Commit b259806

Browse files
authored
Update README.rst
1 parent 4a1870b commit b259806

1 file changed

Lines changed: 2 additions & 0 deletions

File tree

README.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -374,9 +374,11 @@ Dimensionality Reduction
374374
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
375375
Principal Component Analysis (PCA)
376376
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
377+
Principle component analysis~(PCA) is the most popular technique in multivariate analysis and dimensionality reduction. PCA is a method to identify a subspace in which the data approximately lies. This means finding new variables that are uncorrelated and maximizing the variance to preserve as much variability as possible.
377378

378379

379380
Example of PCA on text dataset (20newsgroups) from tf-idf with 75000 features to 2000 components:
381+
380382
.. code:: python
381383
382384
from sklearn.feature_extraction.text import TfidfVectorizer

0 commit comments

Comments
 (0)