Locked lesson.
About this lesson
When designing the study, there are several critical considerations based upon the type of test (destructive or non-destructive), the discrimination of the test system, and whether attribute tests can designate an item into multiple categories beyond Pass/Fail. These considerations impact the accuracy and performance of the analysis.
Exercise files
Download this lesson’s related exercise files.
Gage R&R Study Design Considerations.docx203.7 KB Gage R&R Study Design Considerations - Solution.docx
205 KB
Quick reference
Gage R&R Study Design Considerations
When designing a Gage R&R study there can be issues that create unusual of special considerations. These are often caused by the test equipment or the nature of the inspection or test. This lesson will review some of these considerations and explore approaches for addressing the issue.
When to use
Each of the considerations are based upon special circumstances. Nested studies are done when it is impossible for all operators and all equipment to test all items multiple times. If the discrimination of the test is inadequate, the study should either be redesigned, or the test method rejected. Finally, when attribute testing has multiple categories (more than just pass/fail) an additional test needs to be done to ensure the attribute data Gage R&R will be effective.
Instructions
The special conditions addressed in this lesson require a design modification or additional calculations in order to ensure the Gage R&R results are trustworthy.
Crossed and Nested Designs
The typical Gage R&R study is a crossed study. That means that all appraisers will test all items multiple times and if there are multiple pieces of equipment, they will all test all items. However, in some cases this can not be done. Some types of tests are destructive or consume the test item. This is often the case in chemical analysis or in some cases it is impossible to move the item between test centers. In these cases, a nested design is used.
The nested design is based upon a critical assumption. The assumption is that items can be batched and that items from the same batch are identical. This is often a valid assumption for chemicals. An advantage of the nested design is that you don’t need all operators to test all parts multiple times. The problem with a nested design is that it loses some accuracy. Interaction effects cannot be easily modelled, and the level of uncertainty is higher. If you need to do a nested design, I recommend that you refer to an advanced statistical book or rely on a statistical software program such as Minitab to assist in the design and analysis.
Number of Distinct Categories (NDC)
Another concern is the accuracy and the discrimination of the test equipment used within the study. Be sure to check the calibration status of your measuring equipment and if appropriate do a calibration to limit bias errors. Also, be sure the equipment is accurate in the region of the parts being measured. Check the linearity study to determine this. The NDC value is a number that is calculated after a study is completed to inform you whether the study can be trusted or if you either need to redesign the study or reject the measurement system.
NDC looks at the discrimination of the measurement system in the region of the items that were being measured. The higher the NDC the better. If NDC is less than 2 then the measurement system is unable to discriminate the value of one item from another. If the value is between 2 and 5, the system can divide the items into two or three categories, such small/large or low/medium/high. This may be acceptable for an attribute data Gage R&R study but is definitely not for a variable data Gage R&R study. If the NDC is between 5 and 14 it is considered marginal. If the item being measured is a critical parameter, you should consider improving the test method. A value over 14 is good and the Gage R&R results can be relied upon.
The NDC calculation is:The standard deviation of the part is based upon the parts that are used in the Gage R&R study. The standard deviation of the gage is calculated in the Gage R&R study. A common mistake that is made in variable data Gage R&R studies is that the parts used do represent the full range of allowable parts. If the parts are almost identical, the standard deviation for the parts will be very small and the NDC will also be small. For this reason, be sure that you are using the full tolerance range when selecting items for the Gage R&R study. If the full range is used and the value is still low, that is an indication that the measurement system should be rejected.
Fleis’ Kappa
The third special consideration is associated with attribute data Gage R&R studies. A typical attribute study uses two categories for analysis – pass/fail. However, in some cases that attribute analysis is categories, such as Likert scale good – better – best. In this case, there are multiple selection categories, so a different measure is used known as Fleis’ Kappa.
Fleis’ Kappa will have a value ranging form plus one to minus one. Plus one means that the appraisers perfectly agree. Minus one means that the appraisers perfectly disagree. A value of zero indicates that the likelihood of their agreement is based upon random chance. The Fleis’ Kappa value must be greater than 0.7 for the Gage R&R study to provide reliable results. A value of 0.9 or greater is ideal.
The Fleis’ Kappa ratio is:
In this formula the P is the percentage of time that the appraisers agreed. Pe is the expected percentage of agreement if the it was based upon random chance effects. So, if there are four choices and all are equally likely, the Pe is 25%.
The Fleis’ Kappa ratio has several beneficial attributes. One is that it can be used with multiple appraisers, you just need to do the calculation for each pair. Second, it does not require that all appraisers evaluate all the items. The percentages will be based upon those items they have in common.
Hints & tips
- When structuring you Gage R&R seek to avoid the special conditions mentioned. They add complications which increases uncertainty in the results.
- Don’t hesitate to use Minitab or other statistical support material when facing the special considerations.
Lesson notes are only available for subscribers.
PMI, PMP, CAPM and PMBOK are registered marks of the Project Management Institute, Inc.