International application of PROMIS computerized adaptive tests: US versus country-specific item parameters can be consequential for individual patient scores

Caroline B. Terwee*, Martine H.P. Crins, Leo D. Roorda, Karon F. Cook, David Cella, Niels Smits, Benjamin D. Schalet

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Objective: PROMIS offers computerized adaptive tests (CAT) of patient-reported outcomes, using a single set of US-based IRT item parameters across populations and language-versions. The use of country-specific item parameters has local appeal, but also disadvantages. We illustrate the effects of choosing US or country-specific item parameters on PROMIS CAT T-scores. Study design and setting: Simulations were performed on response data from Dutch chronic pain patients (n = 1110) who completed the PROMIS Pain Behavior item bank. We compared CAT T-scores obtained with (1) US parameters; (2) Dutch item parameters; (3) US item parameters for DIF-free items and Dutch item parameters (rescaled to the US metric) for DIF items; (4) Dutch item parameters for all items (rescaled to the US metric). Results: Without anchoring to a common metric, CAT T-scores cannot be compared. When scores were rescaled to the US metric, mean differences in CAT T-scores based on US vs. Dutch item parameters were negligible. However, 0.9%–4.3% of the T-score differences were larger than 5 points (0.5 SD). Conclusion: The choice of item parameters can be consequential for individual patient scores. We recommend more studies of translated CATs to examine if strategies that allow for country-specific item parameters should be further investigated.

Original languageEnglish
Pages (from-to)1-13
Number of pages13
JournalJournal of Clinical Epidemiology
Volume134
DOIs
Publication statusPublished - Jun 2021

Cite this