Page 167 - Jolliffe I. Principal Component Analysis
P. 167

6. Choosing a Subset of Principal Components or Variables
                              136





























                              Figure 6.2. LEV diagram for the covariance matrix: gas chromatography data.


                              logue of the scree diagram. However, although this interpretation may be
                              valid for the correlation matrices in his simulations, it does not seem to
                              hold for the dominant variance structures exhibited in Tables 6.2 and 6.3.
                                For correlation matrices, and presumably for covariance matrices with
                              less extreme variation among eigenvalues, the ad hoc methods and the
                              cross-validatory criteria are likely to give more similar results. This is illus-
                              trated by a simulation study in Krzanowski (1983), where W is compared
                                                                          ∗
                                                                                        ∗
                              with the first two ad hoc rules with cut-offs at t = 75% and l =1,
                              respectively. Bartlett’s test, described in Section 6.1.4, is also included in
                              the comparison but, as expected from the earlier discussion, it retains too
                              many PCs in most circumstances. The behaviour of W compared with the
                              two ad hoc rules is the reverse of that observed in the example above. W
                              retains fewer PCs than the t m > 75% criterion, despite the fairly low cut-
                              off of 75%. Similar numbers of PCs are retained for W and for the rule
                              based on l k > 1. The latter rule retains more PCs if the cut-off is lowered
                              to 0.7 rather than 1.0, as suggested in Section 6.1.2. It can also be argued
                              that the cut-off for W should be reduced below unity (see Section 6.1.5),
                              in which case all three rules will give similar results.
   162   163   164   165   166   167   168   169   170   171   172