Locked lesson.
About this lesson
There are many types of continuous data distributions. These are often associated with physical characteristics of the data or system being studied. The ability to recognize the type of distribution aid in the selection and analysis of a hypothesis test.
Exercise files
Download this lesson’s related exercise files.
Continuous Data Distribution Exercise.xlsx10.8 KB Continuous Data Distribution Exercise Solution.docx
128.8 KB
Quick reference
Continuous Data Distributions
Datasets are often displayed in distributions. Different continuous data distributions are indicative of different physical phenomena. The ability to recognize a distribution will aid id the identification of process performance issues.
When to use
Visualizations of datasets are often easier to use when explaining the characteristics of data than tables of numbers. In addition, the different continuous data distributions have specific characteristics which will dictate what type of hypothesis test is appropriate for that data.
Instructions
Continuous Distribution
Continuous data is that which can take on an infinite number of values. Between any two data values, there is another data value that could be detected if the measurement system could accurately discriminate that level of fraction or decimal. The plots are characterized by a smooth curve, not histogram bars. In all these plots, the horizontal axis is the independent variable and the vertical axis is the process performance dependent variable.
Gaussian (Normal) Distribution
This is the bell-shaped curve that represents common cause or random variation. It is symmetric, peaked in the center and the tails approach zero. This is normally our desired distribution for analysis because we know that it represents random variation around the typical process performance.
Uniform Distribution
This is a horizontal line or essentially equal vertical value for all horizontal axis values. This represents the case where the process performance does not depend upon the independent variable.
Exponential Distribution
This is an asymmetric curve. One end starts a point on the vertical axis and the other end of the curve approaches – but never reaches – zero value. A typical physical phenomenon that follows this pattern is the failure rates of a product or system that is subject to infant mortality.
Log-normal Distribution
This is also an asymmetric curve. Both ends of the curve are at zero. However, one end quickly shoots up and then it slowly decays back to zero. This is also a commonly occurring pattern in the real world. For instance, machine downtime follows this pattern, It takes a finite amount of time to do a repair which is the major spike, and some repairs then take longer.
F Distribution
The F Distribution is a graph of the F statistic. The F statistic is used for comparing continuous data distributions with statistical techniques such as ANOVA. The actual shape of the curve will depend upon the number of degrees of freedom. The horizontal axis goes from zero to one. The vertical axis is probability.
Chi-Squared Distribution
This distribution is an asymmetrical distribution of the Chi-Squared statistic. The level of skewness will vary based on the application statistics. The horizontal axis is the number of factors in the test. The vertical axis is the Chi-squared value.
Beta Distribution
The Beta distribution is a family of curves based on several shape factors. The general form of the Beta distribution can vary from exponential to normal or even a bathtub with ends and a low center. The horizontal axis ranges from zero to one. The vertical axis is probability.
Gamma Distribution
The Gamma distribution is a family curve based on shape factors. The Exponential distribution and Chi-squared distribution are special cases of the Gamma distribution. The horizontal axis is continuous and the vertical axis is probability.
Weibull Distribution
The Weibull distribution is a family of curves based upon the Beta curve that can take on many shapes including an exponential, log-normal, or even normal. The actual shape varies based on factors or constants in the Weibull equation. This equation has proven very effective at modeling reliability in complex systems. The factors are based on the system design parameters.
Login to downloadLesson notes are only available for subscribers.
PMI, PMP, CAPM and PMBOK are registered marks of the Project Management Institute, Inc.