Abstract
Bartko (1966, 1974) has presented some analysis of variance (ANOVA) intraclass correlation reliability coefficients that avoid some serious deficiencies not uncommonly found in reliability measures. In his second edition, Winer (1971, pp, 289-296) presented some intraclass correlation results which appear to have deficiencies. His so-called for anchor points approach will produce an intraclass correlation reliability coefficient of unity (as expected) for the case in which the judges (raters) agree perfectly about a group of subjects. However, the method will also yield an intraclass correlation of unity for the case in which the judges display a constant additive bias. In general with Winer's approach, any adjustment of original rating data that leaves the rater's variance-covariance matrix unaltered will produce the same intraclass correlation coefficient, and thus numerous variations (of which additive bias is a subset) of the original data set can and will yield the same intraclass correlation. Bias and Unity Reliability As a first illustration on a more elementary level, the phenomenon discussed above can be observed with the product-moment correlation, which is a sometimes used but not recommended measure of reliability. Consider
Keywords
Related Publications
Intraclass correlations: Uses in assessing rater reliability.
Reliability coefficients often take the form of intraclass correlation coefficients. In this article, guidelines are given for choosing among six different forms of the intracla...
A Generalized Family of Coefficients of Relational Agreement for Numerical Scales
A family of coefficients of relational agreement for numerical scales is proposed. The theory is a generalization to multiple judges of the Zegers and ten Berge theory of associ...
The Intraclass Correlation Coefficient as a Measure of Reliability
A procedure for estimating the reliability of sets of ratings in terms of the intraclass correlation coefficient is discussed. The procedure is based upon the analysis of varian...
Skill Scores Based on the Mean Square Error and Their Relationships to the Correlation Coefficient
Several skill scores are defined, based on the mean-square-error measure of accuracy and alternative climatological standards of reference. Decompositions of these skill scores ...
Measuring Agreement between Two Judges on the Presence or Absence of a Trait
At least a dozen indexes have been proposed for measuring agreement between two judges on a categorical scale. Using the binary (positive-negative) case as a model, this paper p...
Publication Info
- Year
- 1976
- Type
- article
- Volume
- 83
- Issue
- 5
- Pages
- 762-765
- Citations
- 807
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1037/0033-2909.83.5.762