Locked lesson.
About this lesson
One of the most important statistical measures of a data set is the mean or average value. For inferential statistics to be valid, the mean of the sample should be approximately the same as the mean of the entire population of data. The Standard Error is the measure of how accurately the sample mean will approximate the population mean. In this lesson, we will determine how to calculate the standard error and how the sampling process can affect that error.
Exercise files
Download this lesson’s related exercise files.
Standard Error Exercise.xlsx11.3 KB Standard Error Exercise Solution.xlsx
12.4 KB
Quick reference
Standard Error
Standard error, also known as sampling error, is a quick method to calculate the amount of uncertainty between a sample mean value and the population mean value.
When to use
Standard error can be calculated whenever a sample of a population is statistically analyzed. The standard error is a quick calculation to test for the “goodness” of the sample metrics. It is often used with data that is comprised of means of subset data from many samples.
Instructions
Sampling Error (aka Standard Error) is a measure of the uncertainty of a population parameter for a sample based on the sample size and variability in the sample data.
Standard Error is computed by dividing the sample standard deviation by the square root of the number of items in the sample.
The Standard Error is often used with sample statistics, such as sample mean, to indicate the level of uncertainty between the sample statistic and the population statistic. In this regard, it serves a similar purpose as confidence interval. The confidence interval formula also includes a term that is a standard deviation divided by the number of points in the sample. The difference is that the confidence interval multiplies that ratio by a Z factor based on the confidence level that is being used.
Because of the similarity, an alternative formula for Confidence Interval is sometimes given as
The Z term is approximated at a value of 2. This is because the Z term for a 95% confidence level, which is the most common level in hypothesis testing, is 1.96. The other minor difference is the use of the population standard deviation in the Confidence Interval and the use of the sample standard deviation with the Standard Error. When the sample size becomes reasonably large, that is greater than 20 items, these values will typically be almost identical.
The last point to consider is the sources of sampling error and what to do about them:
- Sampling errors – the uncertainty due to random selection. The Standard Error calculation address this.
- Sampling bias – selecting the sample from a segment of the population and not the entire population. Examples could be always taking the first piece manufactured during a shift or taking all the samples from the same line, even though the organization has multiple lines in operation. Since some items can never be selected, this is not a truly random sample. The sampling approach needs to change to address as much of the full population as possible.
- Error in measurement – this is due to issues associated with the design and maintenance of the measurement equipment. A major aspect of this is calibration, but it also includes problems with operator use and linearity. A measurement systems analysis is needed to ensure that the system is capable and accurate.
Lack of measurement validity – the measurement approach is inappropriate for the characteristic being measured. It may be that it is measured at the wrong step in the process, that the measurement system used does not measure the correct parameter, or the measurement system has become contaminated in some manner. When this is the problem, it is normally necessary to change the measurement system.
Hints & tips
- Standard error is often reported by statistical software as a measure of the uncertainty in the results that have been calculated.
- Standard error is more frequently used in sociological statistical analysis than with Lean Six Sigma problem-solving.
- Standard error is often used when the data set is actually the descriptive statistics of numerous unique samples.
Lesson notes are only available for subscribers.
PMI, PMP, CAPM and PMBOK are registered marks of the Project Management Institute, Inc.