Locked lesson.
About this lesson
Statistical tests are often used to aid the problem analysis. The result of the test is a statistical measure of the validity of a hypothesis about the problem in the sample with an inference about that problem throughout the data population.
Exercise files
Download this lesson’s related exercise files.
Statistical Analysis Exercise.docx58.3 KB Statistical Analysis Solution.docx
59.4 KB
Quick reference
Statistical Analysis
Statistical tests are often used to aid the problem analysis by inferring attributes of the problem based on an analysis of a sample set of data. The statistical analysis provides a statistical measure, the P value, that is used to determine whether or not to accept the Null hypothesis.
When to use
In some cases, the problem analysis will point to an obvious root cause, but often the cause is not obvious. Statistical analysis is used to confirm a hypothesis when doing hypothesis testing as part of the Analyse stage of a Lean Six Sigma project.
Instructions
There is a context to statistical analysis and it is important to understand that context. Normally, statistical analysis in Lean Six Sigma projects is used in conjunction with hypothesis testing. When the statistical analysis is completed, a “P” value, or probability value, will be generated. Based upon the “P” value, the Null hypothesis is accepted or rejected.
Inferential Statistics
In most cases, the data set being used in the Lean Six Sigma project will not represent the entire possible population of the problem. It is difficult or impossible to get data from all potential instances with all relevant customers, relevant products, relevant locations, across all time periods – past, present, and future. Therefore, a subset, or sample, of the data is used. The statistical analysis is applied to the data from the sample and then the results are inferred to apply to the entire population.
In order to do this inference, additional analysis should be done based on sampling approach and confidence levels. By confidence level, we mean being able to state something about the entire population based on the sample data with a 90%, 95%, or 99% confidence. The detailed analysis of confidence levels and confidence intervals is addressed in the Hypothesis Testing course.
The desired Confidence Level and the descriptive statistics of the sample data (mean and standard deviation) can be used to calculate a confidence interval for the location of the total population mean. The confidence interval calculation can be used in reverse to determine the confidence that a particular value is the mean of the total population based on the sample mean
Hints & tips
- Don’t blindly apply a statistical analysis to your project. If the root cause is obvious, no analysis is needed. If the root cause is not obvious, create a set of hypotheses and based upon the hypothesis and characteristics of the data apply the one right test for each hypothesis.
- Confidence level is not precisely the probability that the population statistics falls within the range associated with that confidence level, but is essentially that.
- Most organizations use a 95% confidence level and that is the default in Minitab.
Lesson notes are only available for subscribers.
PMI, PMP, CAPM and PMBOK are registered marks of the Project Management Institute, Inc.