Page 149 - Six Sigma Advanced Tools for Black Belts and Master Black Belts
P. 149

2:57
                               Char Count= 0
          August 31, 2006
 JWBK119-10
        134       Process Capability Analysis for Non-Normal Data with MINITAB
                              Box-Cox Plot of Data
                                   Upper CL
                                                                 Lambda
                  1.9
                                                             (using 95.0% confidence)
                                                             Estimate  −3.53725
                  1.8                                        Lower CL
                                                             Upper CL  −0.50142
                  1.7
                                                             Best Value  −3.53725
                 StDev  1.6
                  1.5

                  1.4
                  1.3                                   Limit

                  1.2
                     −5.0    −2.5    0.0     2.5     5.0
                                   Lambda
                          Figure 10.2 Box--Cox plot for case study data.



        10.2.1.2 Verifying the transformed data
        Having transformed the data, we need to check that the transformation is a good one
        using both a normality test and visual assessment (fitting a normal curve over the
        histogram).
          There are many normality tests available. The Anderson--Darling test is the test
                                 1
        recommended by Stephens. It tests the null hypothesis that the data come from a
        specified normal distribution by measuring the area between the fitted line based
        on the distribution and the nonparametric step function based on the plotted points.
        The statistic is a squared distance and is weighted more heavily in the tails of the
        distribution. It is given by
                     1
           2
          A =−N −         (2i − 1)(ln F(Y i ) + ln(1 − F(Y N+1−i )),
                     N
        where the Y i are the ordered data and

                     Y i − ¯x
          F(Y i ) =
                       s
        is the cumulative distribution function of the standard normal distribution. Smaller
        Anderson--Darling values indicate that there is are fewer differences between the data
        and the normal distribution, and hence that the data fits the specific distribution better.
          Another quantitative measure for reporting the result of the Anderson--Darling
        normality test is the p-value, representing the probability of concluding that the null
                                                   2
        hypothesis is false when it is true. If you know A you can calculate the p-value. Let

                        0.75  2.25
            2    2
          A = A * 1 +       +   2   ,
                         N     N
   144   145   146   147   148   149   150   151   152   153   154