TY - JOUR
T1 - Validity of Comparative Judgment Scores
T2 - How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts
AU - Lesterhuis, Marije
AU - Bouwer, Renske
AU - Van Daal, Tine
AU - Donche, Vincent
AU - De Maeyer, Sven
N1 - Funding Information:
This work was supported by the Flanders Innovation and Entrepreneurship and the Research Foundation under (Grant No. 130043).
Publisher Copyright:
Copyright © 2022 Lesterhuis, Bouwer, van Daal, Donche and De Maeyer.
PY - 2022/5/13
Y1 - 2022/5/13
N2 - The advantage of comparative judgment is that it is particularly suited to assess multidimensional and complex constructs as text quality. This is because assessors are asked to compare texts holistically and to make a quality judgment for each text in a pairwise comparison based upon on the most salient and critical differences. Also, the resulted rank order is based on the judgment of all assessors, representing the shared consensus. In order to be able to select the right number of assessors, the question is to what extent the conceptualization of assessors prevails in the aspects they base their judgment on, or whether comparative judgment minimizes the differences between assessors. In other words, can we detect types of assessors who tend to consider certain aspects of text quality more often than others? A total of 64 assessors compared argumentative texts, after which they provided decision statements on what aspects of text quality had informed their judgment. These decision statements were coded on six overarching themes of text quality: argumentation, organization, language use, language conventions, source use, references, and layout. Using a multilevel-latent class analysis, four different types of assessors could be distinguished: narrowly focused, broadly focused, source-focused, and language-focused. However, the analysis also showed that all assessor types mainly focused on argumentation and organization, and that assessor types only partly explained whether the aspect of text quality was mentioned in a decision statement. We conclude that comparative judgment is a strong method for comparing complex constructs like text quality. First, because the rank order combines different views on text quality, but foremost because the method of comparative judgment minimizes differences between assessors.
AB - The advantage of comparative judgment is that it is particularly suited to assess multidimensional and complex constructs as text quality. This is because assessors are asked to compare texts holistically and to make a quality judgment for each text in a pairwise comparison based upon on the most salient and critical differences. Also, the resulted rank order is based on the judgment of all assessors, representing the shared consensus. In order to be able to select the right number of assessors, the question is to what extent the conceptualization of assessors prevails in the aspects they base their judgment on, or whether comparative judgment minimizes the differences between assessors. In other words, can we detect types of assessors who tend to consider certain aspects of text quality more often than others? A total of 64 assessors compared argumentative texts, after which they provided decision statements on what aspects of text quality had informed their judgment. These decision statements were coded on six overarching themes of text quality: argumentation, organization, language use, language conventions, source use, references, and layout. Using a multilevel-latent class analysis, four different types of assessors could be distinguished: narrowly focused, broadly focused, source-focused, and language-focused. However, the analysis also showed that all assessor types mainly focused on argumentation and organization, and that assessor types only partly explained whether the aspect of text quality was mentioned in a decision statement. We conclude that comparative judgment is a strong method for comparing complex constructs like text quality. First, because the rank order combines different views on text quality, but foremost because the method of comparative judgment minimizes differences between assessors.
KW - assessor cognition
KW - comparative judgment
KW - latent class analysis
KW - validity
KW - writing assessment
UR - http://www.scopus.com/inward/record.url?scp=85131319322&partnerID=8YFLogxK
U2 - 10.3389/feduc.2022.823895
DO - 10.3389/feduc.2022.823895
M3 - Article
SN - 2504-284X
VL - 7
JO - Frontiers in Education
JF - Frontiers in Education
M1 - 823895
ER -