650 likes | 660 Views
Statistical Tools for Multivariate Six Sigma. Dr. Neil W. Polhemus CTO & Director of Development StatPoint, Inc. The Challenge. The quality of an item or service usually depends on more than one characteristic.
E N D
Statistical Tools for Multivariate Six Sigma Dr. Neil W. Polhemus CTO & Director of Development StatPoint, Inc.
The Challenge The quality of an item or service usually depends on more than one characteristic. When the characteristics are not independent, considering each characteristic separately can give a misleading estimate of overall performance.
The Solution Proper analysis of data from such processes requires the use of multivariate statistical techniques.
Outline • Multivariate SPC • Multivariate control charts • Multivariate capability analysis • Data exploration and modeling • Principal components analysis (PCA) • Partial least squares (PLS) • Neural network classifiers • Design of experiments (DOE) • Multivariate optimization
Example #1 Textile fiber Characteristic #1: tensile strength - 115 ± 1 Characteristic #2: diameter - 1.05 ± 0.05
Sample Data n = 100
Multivariate Capability Determines joint probability of being within the specification limits on all characteristics
Mult. Capability Indices Defined to give the same DPM as in the univariate case.
More than 2 Characteristics Calculate T-squared: where S = sample covariance matrix = vector of sample means
T-Squared Decomposition Subtracts the value of T-squared if each variable is removed. Large values indicate that a variable has an important contribution.
Generalized Variance Chart Plots the determinant of the variance-covariance matrix for data that is sampled in subgroups.
Data Exploration and Modeling When the number of variables is large, the dimensionality of the problem often makes it difficult to determine the underlying relationships. Reduction of dimensionality can be very helpful.
Analysis Methods • Predicting certain characteristics based on others (regression and ANOVA) • Separating items into groups (classification) • Detecting unusual items
Principal Components The goal of a principal components analysis (PCA) is to construct k linear combinations of the p variables X that contain the greatest variance.
Scree Plot Shows the number of significant components.
Partial Least Squares (PLS) Similar to PCA, except that it finds components that minimize the variance in both the X’s and the Y’s. May be used with many X variables, even exceeding n.
Component Extraction Starts with number of components equal to the minimum of p and (n-1).
Classification Principal components can also be used to classify new observations. A useful method for classification is a Bayesian classifier, which can be expressed as a neural network.
Bayesian Classifier • Begins with prior probabilities for membership in each group • Uses a Parzen-like density estimator of the density function for each group
Options • The prior probabilities may be determined in several ways. • A training set is usually used to find a good value for s.