Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Evaluation & the Health Professions
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Lunz, M. E.
Right arrow Articles by Stahl, J. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lunz, M. E.
Right arrow Articles by Stahl, J. A.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Judge Consistency and Severity Across Grading Periods

Mary E. Lunz

John A. Stahl

American Society of Clinical Pathologists

The purpose of this research project was to confirm that differences in the severity of judges and the stringency of grading periods occur, regardless of the nature of the assessment or the examination materials used. Three rather different examinations that require judges were analyzed, using an extended Rasch model to determine whether differences in judge severity and grading-period stringency were observable for all three examinations. Significant variation in judge severity and some variation across grading periods were found on all three examinations. This implies that regardless of the nature of the examination, items, or judges, examinee/measures cannot be considered independent of the particular judges involved unless correction for severity is made systematically. Accounting for judge severity and gradinig-period stringency is extremely important when pass/fail decisions that are meant to generalize to competence are made, as in certification examinations.

Evaluation & the Health Professions, Vol. 13, No. 4, 425-444 (1990)
DOI: 10.1177/016327879001300405


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Language TestingHome page
Y.-H. Kim
An investigation into native and non-native teachers' judgments of oral English performance: A mixed methods approach
Language Testing, April 1, 2009; 26(2): 187 - 217.
[Abstract] [PDF]


Home page
Language TestingHome page
C. Elder, G. Barkhuizen, U. Knoch, and J. von Randow
Evaluating rater responses to an online training program for L2 writing assessment
Language Testing, January 1, 2007; 24(1): 37 - 64.
[Abstract] [PDF]


Home page
Assessment for Effective InterventionHome page
B. Garrett, E. Towles, H. Kleinert, and J. Kearns
Portfolios in Large-Scale Alternate Assessment Systems: Frameworks for Reliability
Assessment for Effective Intervention, January 1, 2003; 28(2): 17 - 27.
[Abstract] [PDF]


Home page
Language TestingHome page
K. Kondo-Brown
A FACETS analysis of rater bias in measuring Japanese second language writing performance
Language Testing, January 1, 2002; 19(1): 3 - 31.
[Abstract] [PDF]


Home page
Language TestingHome page
J. A. Upshur and C. E. Turner
Systematic effects in the rating of second-language speaking ability: test method and learner discourse
Language Testing, January 1, 1999; 16(1): 82 - 111.
[Abstract] [PDF]


Home page
Language TestingHome page
S. C. Weigle
Using FACETS to model rater training effects
Language Testing, April 1, 1998; 15(2): 263 - 287.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
L. H. Ludlow and S. M. Haley
Effect of Context in Rating of Mobility Activities in Children with Disabilities: An Assessment Using the Pediatric Evaluation of Disability Inventory
Educational and Psychological Measurement, February 1, 1996; 56(1): 122 - 129.
[Abstract]


Home page
Language TestingHome page
T. Lumley and T.F. McNamara
Rater characteristics and rater bias: implications for training
Language Testing, March 1, 1995; 12(1): 54 - 71.
[Abstract] [PDF]