Locked lesson.
About this lesson
There are three types of data: variable, attribute and ranked. Each type is useful in measuring process performance by analyzing process problems, but they need to be treated differently.
Exercise files
Download this lesson’s related exercise files.
Data Types Exercise -2023.docx61 KB Data Types Exercise Solution - 2023.docx
61.1 KB
Quick reference
Data Types
There are three types of data, variable, attribute, and ranked. Each type is useful in measuring process performance and analyzing process problems, but they need to be treated differently.
When to use
Whenever data is collected or analyzed – which can happen in all phases of a Lean Six Sigma project – the differentiation between the data types should be clear so that the data is handled correctly.
Instructions
Data is the set of real-world facts about the problem or process that is being studied. Statistics are the numerical interpretation of those facts. We will spend many lessons discussing ways to interpret the data. However, it is important to recognize that depending upon the category of the data, different types of statistical analysis can be done.
There are three categories of data, attribute, variable, and ranked. All three are meaningful for our analysis.
Attribute data is used to indicate the status or condition of the process or problem. It is often expressed in descriptive terms such as: on/off, true/false, Brazil/Argentina/Chile, First place/Second place/Third place, or Good/Better/Best. It is often easier to collect than variable data because typically there are no measurements to be made, it is just a check of the status. Attribute data may be expressed in numerical terms, but it is still a status. For instance, on/off may be represented as 1/0.
Variable data is measured data along some scale. Because it is along a scale, it can take on many values, essentially an infinite number of values depending upon the ability of the scale to discriminate. Variable data is considered to be richer data than attribute because in the analysis we can see values moving and drifting and the movement can be analyzed. Examples of variable data include time, temperature, distance, dimensions, and percentages.
Ranked data is a combination of variables and attributes. The variable data points are sorted from ascending to descending order and then categories are applied to the unit scale of the data. The number of data points that lie within each category is counted. In this way, the variable data is converted into a count that can be used in graphical data analysis.
Hints & tips
- When collecting variable data, be sure you are using consistent units. There have been some disastrous analyses because one team member measured in meters and another with feet and the two data sets were combined without converting to common units.
- During the Measure phase, more data is good. Collect any and all data, even if you don’t know whether you will need it or not. When you get to the Analyse phase, it may become very valuable.
- Some in the Lean Six Sigma community treat attribute or ranked data with disdain because it is not as rich. However, it is much easier to collect. In addition, when the problem is a special cause problem – meaning a unique or isolated event - the attribute data will often provide a clearer picture of what is happening. Attribute data can identify massive changes very quickly whereas variable data is much more powerful for finding trends and slow changes. Ranked data is most useful with graphical data analysis.
Lesson notes are only available for subscribers.
PMI, PMP, CAPM and PMBOK are registered marks of the Project Management Institute, Inc.