Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Evaluation & the Health Professions
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
0163278708324444v1
31/4/419    most recent
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Fernandez, S. A.
Right arrow Articles by Jarjoura, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fernandez, S. A.
Right arrow Articles by Jarjoura, D.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Reliability of Surgical Skills Scores in Otolaryngology Residents

Analysis Using Generalizability Theory

Soledad A. Fernandez

Ohio State University, Columbus, Soledad.Fernandez{at}osumc.edu

Gregory J. Wiet

Columbus Children's Hospital, Columbus, Ohio Ohio State University

Nancy N. Butler

Ohio State University, Columbus

Bradley Welling

Ohio State University, Columbus

David Jarjoura

Ohio State University, Columbus

Assessments of temporal bone dissection performance among otolaryngology residents have not been adequately developed. At the Ohio State College of Medicine, an instrument (Welling Scale, Version 1 [WS1]) is used to evaluate residents' end-product performance after drilling a temporal bone. In this study, the authors evaluate the components that contribute to measurement error using this scale. Generalizability theory was used to reveal components of measurement error that allow for better understanding of test results. A major component of measurement error came from inconsistency in performance across the two cadaveric test bones each resident was assigned. In contrast, ratings of performance using the WS1 were highly consistent across raters and rating sessions within raters. The largest source of measurement error was caused by residents' inconsistent performance across bones. Rater disagreement introduced only small error into scores. The WS1 provides small measurement error, with two raters and two bones for each participant.

Key Words: measurement error variance • mixed models • performance measures

This version was published on December 1, 2008

Evaluation & the Health Professions, Vol. 31, No. 4, 419-436 (2008)
DOI: 10.1177/0163278708324444


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?