ABSTRACT

This comprehensive Handbook focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories.  The book reviews all of the major models and includes discussions about how and where the models originated, conceptually and in practical terms. Diverse perspectives on how these models can best be evaluated are also provided. Practical applications provide a realistic account of the issues practitioners face using these models. Disparate elements of the book are linked through editorial sidebars that connect common ideas across chapters, compare and reconcile differences in terminology, and explain variations in mathematical notation. These sidebars help to demonstrate the commonalities that exist across the field. By assembling this critical information, the editors hope to inspire others to use polytomous IRT models in their own research so they too can achieve the type of improved measurement that such models can provide.

Part 1 examines the most commonly used polytomous IRT models, major issues that cut across these models,  and a common notation for calculating functions for each model. An introduction to IRT software is also provided.  Part 2 features distinct approaches to evaluating the effectiveness of polytomous IRT models in various measurement contexts. These chapters appraise evaluation procedures and fit tests and demonstrate how to implement these procedures using IRT software. The final section features groundbreaking applications.  Here the goal is to provide solutions to technical problems to allow for the most effective use of these models in measuring educational, psychological, and social science abilities and traits. This section also addresses the major issues encountered when using polytomous IRT models in computerized adaptive testing.  Equating test scores across different testing contexts is the focus of the last chapter. The various contexts include personality research, motor performance, health and quality of life indicators, attitudes, and educational achievement.

Featuring contributions from the leading authorities, this handbook will appeal to measurement researchers, practitioners, and students who want to apply polytomous IRT models to their own research. It will be of particular interest to education and psychology assessment specialists who develop and use tests and measures in their work, especially researchers in clinical, educational, personality, social, and health  psychology.  This book also serves as a supplementary text in graduate courses on educational measurement, psychometrics, or item response theory.

Part 1. Development of Polytomous IRT Models. R. Ostini, M. Nering, New Perspectives and Applications. R. Hambleton, W. van der Linden, C. Well, IRT Models for the Analysis of Polytomously-Scored Data: Brief and Selected History of Model Building Advances. D. Thissen, L. Cai, R.D. Bock, The Nominal Categories Item Response Model. F. Samejima, The General Graded Response Model. G. Masters, The Partial Credit Model. D. Andrich, Understanding the Response Structure and Process in the Polytomous Rasch Model. Part 2. Polytomous IRT Model Evaluation. R.D. Bock, R. Gibbons, Factor Analysis of Categorical Item Responses. C.A.W. Glas, Testing Fit to IRT Models for Polytomously Scored Items. Part 3. Application of Polytomous IRT Models. C. Huang, R. Mislevy, An Application of the Polytomous Rasch Model to Mixed Strategies. A. Boyd, B. Dodd, S. Choi, Polytomous Models in Computerized Adaptive Testing. S. Kim, D. Harris, M. Kolen, Equating with Polytomous Item Response Models.