Page 184 - Jolliffe I. Principal Component Analysis
P. 184

7.2. Estimation of the Factor Model
                                                                                            153
                              be used, and it also means that there is indeterminacy in the solutions—the
                              ‘best-fitting’ solution is not unique.
                                Estimation of the model is usually done initially in terms of the pa-
                              rameters in Λ and Ψ, while estimates of f are found at a later stage.
                              Given the assumptions of the previous section, the covariance matrix can
                              be calculated for both sides of (7.1.2) giving

                                                        Σ = ΛΛ + Ψ.                      (7.2.1)
                              In practice, we have the sample covariance (or correlation) matrix S, rather
                              than Σ,and Λ and Ψ are found so as to satisfy

                                                        S = ΛΛ + Ψ,
                              (which does not involve the unknown vector of factor scores f) as closely as
                              possible. The indeterminacy of the solution now becomes obvious; if Λ, Ψ
                              is a solution of (7.2.1) and T is an orthogonal matrix, then Λ , Ψ is also
                                                                                    ∗
                                               ∗
                              a solution, where Λ = ΛT. This follows since

                                                      ∗
                                                     Λ Λ ∗  =(ΛT)(ΛT)

                                                           = ΛTT Λ

                                                           = ΛΛ ,
                              as T is orthogonal.
                                Because of the indeterminacy, estimation of Λ and Ψ typically proceeds
                              in two stages. In the first, some restrictions are placed on Λ in order to find
                              a unique initial solution. Having found an initial solution, other solutions
                              which can be found by rotation of Λ, that is, multiplication by an orthog-
                              onal matrix T, are explored. The ‘best’ of these rotated solutions is chosen
                              according to some particular criterion. There are several possible criteria,
                              but all are designed to make the structure of Λ as simple as possible in some
                              sense, with most elements of Λ either ‘close to zero’ or ‘far from zero,’ and
                              with as few as possible of the elements taking intermediate values. Most
                              statistical computer packages provide options for several different rotation
                              criteria, such as varimax, quartimax and promax. Cattell (1978, p. 136),
                              Richman (1986) give non-exhaustive lists of eleven and nineteen automatic
                              rotation methods, respectively, including some like oblimax that enable the
                              factors to become oblique by allowing T to be not necessarily orthogonal.
                              For illustration, we give the formula for what is probably the most popular
                              rotation criterion, varimax. It is the default in several of the best known
                              software packages. For details of other rotation criteria see Cattell (1978,
                              p. 136), Lawley and Maxwell (1971, Chapter 6), Lewis-Beck (1994, Section
                              II.3), Richman (1986) or Rummel (1970, Chapters 16 and 17) An example
                              illustrating the results of using two rotation criteria is given in Section 7.4.
                                Suppose that B = ΛT and that B has elements b jk ,j =1, 2,... ,p; k =
                              1, 2,... ,m. Then for varimax rotation the orthogonal rotation matrix T is
   179   180   181   182   183   184   185   186   187   188   189