Page 444 - Jolliffe I. Principal Component Analysis
P. 444

A.1. Numerical Calculation of Principal Components
                                Finding PCs reduces to finding the eigenvalues and eigenvectors of a
                              positive-semidefinite matrix. We now look briefly at some of the possible
                              algorithms that can be used to solve such an eigenproblem.    409
                              The Power Method
                              A form of the power method was described by Hotelling (1933) in his
                              original paper on PCA, and an accelerated version of the technique was
                              presented in Hotelling (1936). In its simplest form, the power method is a
                              technique for finding the largest eigenvalue and the corresponding eigen-
                              vector of a (p × p) matrix T. The idea is to choose an initial p-element
                              vector u 0 , and then form the sequence
                                                     u 1 = Tu 0
                                                                    2
                                                     u 2 = Tu 1  = T u 0
                                                        .        .
                                                        .        .
                                                        .        .
                                                                    r
                                                     u r = Tu r−1 = T u 0
                                                        .        .
                                                        .        .
                                                        .        .
                              lf α 1 , α 2 ,..., α p are the eigenvectors of T, then they form a basis for
                              p-dimensional space, and we can write, for arbitrary u 0 ,

                                                              p

                                                        u 0 =   κ k α k
                                                             k=1
                              for some set of constants κ 1 ,κ 2 ,...,κ p . Then
                                                         p           p

                                             u 1 = Tu 0 =  κ k Tα k =   κ k λ k α k ,
                                                        k=1         k=1
                              where λ 1 ,λ 2 ,...,λ p are the eigenvalues of T. Continuing, we get for r =
                              2, 3,...
                                                            p

                                                       u r =      r
                                                               κ k λ α k
                                                                  k
                                                           k=1
                              and

                                                             r                  r
                                                     κ 2  λ 2           κ p  λ p
                                            =   α 1 +         α 2 + ··· +         α p .
                                        u r
                                      (κ 1 λ )       κ 1  λ 1           κ 1  λ 1
                                          r
                                          1
                              Assuming that the first eigenvalue of T is distinct from the remaining
                              eigenvalues, so that λ 1 >λ 2 ≥ ··· ≥ λ p , it follows that a suitably nor-
                              malized version of u r → α 1 as r →∞. It also follows that the ratios of
                              corresponding elements of u r and u r−1 → λ 1 as r →∞.
                                The power method thus gives a simple algorithm for finding the first
                              (largest) eigenvalue of a covariance or correlation matrix and its corre-
                              sponding eigenvector, from which the first PC and its variance can be
   439   440   441   442   443   444   445   446   447   448   449