Unmixing Rasch scales: How to score an educational test

Maria Bolsinova, Gunter Maris, Herbert Hoijtink

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

One of the important questions in the practice of educational testing is how a particular test should be scored. In this paper we consider what an appropriate simple scoring rule should be for the Dutch as a second language test consisting of listening and reading items. As in many other applications, here the Rasch model which allows to score the test with a simple sumscore is too restrictive to adequately represent the data. In this study we propose an exploratory algorithm which clusters the items into subscales each fitting a Rasch model and thus provides a scoring rule based on observed data. The scoring rule produces either a weighted sumscore based on equal weights within each subscale or a set of sumscores (one for each of the subscales). An MCMC algorithm which enables to determine the number of Rasch scales constituting the test and to unmix these scales is introduced and evaluated in simulations. Using the results of unmixing, we conclude that the Dutch language test can be scored with a weighted sumscore with three different weights.

Original languageEnglish
Pages (from-to)925-945
Number of pages21
JournalAnnals of Applied Statistics
Volume10
Issue number2
DOIs
Publication statusPublished - 1 Jun 2016

Keywords

  • Educational testing
  • Markov chain Monte Carlo
  • Mixture model
  • Multi-dimensional IRT
  • One parameter logistic model
  • Rasch model
  • Scoring rule

Fingerprint

Dive into the research topics of 'Unmixing Rasch scales: How to score an educational test'. Together they form a unique fingerprint.

Cite this