David Thissen
Publications on Testing and Measurement
Thissen, D. & Steinberg, L. (in press). Item response theory. In R. Millsap & A. Maydeu-Olivares, Handbook of quantitative methods in psychology. London: Sage Publications.
Langer, M.M., Hill, C.D., Thissen, D., Burwinkle, T.M., Varni, J.W., & DeWalt, D.A. (in press). Item response theory detects differential item functioning between healthy and ill children in QoL measures. Journal of Clinical Epidemiology.
Thissen, D., Cai, L., & Bock, R.D. (in press). The nominal item response model. In M. Nering & R. Ostini (Eds.), Handbook of polytomous item response theory models: Developments and applications.
Thissen, D. & Steinberg, L. (in press). Using item response theory to disentangle constructs at different levels of generality. In S. Embretson & J. Roberts (Eds.), New directions in psychological measurement with model-based approaches.
Bjorner, J.B., & Chang, C.-H., Thissen, D., Reeve, B.B. (2007). Developing tailored instruments: item banking and computerized adaptive assessment. Quality of Life Research, 16, 95-108.
Thissen, D., Reeve, B.B., Bjorner, J.B., & Chang, C.-H. (2007). Methodological issues for building item banks and computerized adaptive scales. Quality of Life Research, 16, 109-116.
Thissen, D. (2007). Linking assessments based on aggregate reporting: Background and issues. In N.J. Dorans, M. Pommerich, & P.W. Holland (Eds.) Linking and aligning scores and scales (Pp. 287-312). New York, NY: Springer.
Hill, C.D., Edwards, M.C., Thissen, D., Langer, M.M., Wirth, R.J., Burwinkle, T.M., & Varni, J.W. (2007). Practical issues in the application of item response theory: A demonstration using items from the Pediatric Quality of Life Inventoryª (PedsQLª) 4.0 Generic Core Scales. Medical Care, 45, S39-47.
Reeve, B.B., Hays, R.D, Bjorner, J.B., Cook K.F., Crane, P.K., Teresi, J.A., Thissen, D., Revicki, D.A., Weiss, D.J., Hambleton, R.K., Liu, H., Gershon, R., Reise, S.P., & Cella, D (2007). Psychometric evaluation and calibration of health-related quality of life items banks: Plans for the patient-reported outcome measurement information system (PROMIS). Medical Care, 45, S22-31.
Jones, L.V. & Thissen, D. (2007). A history and overview of psychometrics. In C.R. Rao and S. Sinharay, Handbook of Statistics, 26: Psychometrics (Pp. 1-27) Amsterdam: North Holland.
Edelen, M.O., Thissen, D., Teresi, J.A., Kleinman, M., & Ocepek-Welikson, K. (2006). Identification of differential item functioning using item response theory and the likelihood-based model comparison approach: application to the Mini-Mental Status Examination. Medical Care, 44, S134-142.
Steinberg, L., & Thissen, D. (2006) Using Effect Sizes for Research Reporting: Examples using Item Response Theory to Analyze Differential Item Functioning. Psychological Methods, 11, 402-415.
Cai, L., Maydeu-Olivares, A., Coffman, D.L., & Thissen, D. (2006). Limited information goodness-of-fit testing of item response theory models for sparse 2p tables. British Journal of Mathematical and Statistical Psychology, 59, 173-194.
Woods, C.M. & Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spline-based densities. Psychometrika, 71, 281-301.
Bethke, A., Hill, C., McLeod, L., VanDyk, P., Zhao, L., Zhou, X., & Thissen, D. (2004). North Carolina Computerized Adaptive Testing System: 2003 comparability study results. Research Triangle Park, NC: RTI International.
Rodebaugh, T.L., Woods, C.M., Thissen, D., Heimberg, R.G., Chambless, D.L., & Rapee, R.M. (2004). More information from fewer questions: The factor structure and item properties of the original and brief fear of negative evaluation scale. Psychological Assessment, 16, 169-181.
Orlando, M. & Thissen, D. (2003). Further invesigation of the performance of S-X2: An item fit index for use with dichotomous item response theory models. Applied Psychological Measurement, 27, 289-298
McLeod, L., Lewis, C, & Thissen, D. (2003). A Bayesian method for the detection of item preknowledge in computerized adaptive testing. Applied Psychological Measurement, 27, 121-137.
Thissen, D. (2003). Psychometric engineering as art: Variations on a theme. In H. Yanai, A. Okada, and K. Shigemasu, Y. Kano, & J.J. Meulman (Eds), New developments in psychometrics: Proceedings of the International Meeting of the Psychometric Society IMPS 2001 (Pp. 3-18). Tokyo: Springer-Verlag.
Vevea, J.L., Edwards, M.C., Thissen, D., Reeve, B.B., Flora, D.B., Sathy, V., & Coon, C. (2002). User's guide for Augment v.2: Emperical Bayes Subscore Augmentation Software. Electronic Research Memorandum #2002-2. Chapel Hill, NC: University of North Carolina, L.L. Thurstone Psychometric Laboratory.
Flora, D.B., & Thissen, D. (2002). User's guide for IRTScore: Item response theory score approximation Software. Electronic Research Memorandum #2002-1. Chapel Hill, NC: University of North Carolina, L.L. Thurstone Psychometric Laboratory.
Thissen, D. (2001). Psychometric engineering as art. Psychometrika, 66, 473-486.
Thissen, D. & Wainer, H. (Eds) (2001) Test Scoring. Hillsdale, NJ: Lawrence Erlbaum Associates.
Thissen, D. & Wainer, H. (2001). Overview of Test Scoring. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 1-19). Hillsdale, NJ: Lawrence Erlbaum Associates.
Wainer, H. & Thissen, D (2001). True score theory: The traditional method. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 23-72). Hillsdale, NJ: Lawrence Erlbaum Associates.
Thissen, D., & Orlando, M. (2001). Item response theory for items scored in two categories. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 73-140). Hillsdale, NJ: Lawrence Erlbaum Associates.
Thissen, D., Nelson, L., Rosa, K., & McLeod, L.D. (2001). Item response theory for items scored in more than two categories. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 141-186). Hillsdale, NJ: Lawrence Erlbaum Associates.
McLeod, L.D., Swygert, K.A., & Thissen, D (2001). Factor analysis for items scored in two categories. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 189-216). Hillsdale, NJ: Lawrence Erlbaum Associates.
Swygert, K.A., McLeod, L.D., & Thissen, D (2001). Factor analysis for items scored in more than two categories. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 217-250). Hillsdale, NJ: Lawrence Erlbaum Associates.
Rosa, K., Swygert, K.A., Nelson, L., & Thissen, D. (2001). Item response theory applied to combinations of multiple-choice and constructed-response itemsÑscale scores for patterns of summed scores. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 253-292). Hillsdale, NJ: Lawrence Erlbaum Associates.
Thissen, D., Nelson, L., & Swygert, K.A. (2001). Item response theory applied to combinations of multiple-choice and constructed-response itemsÑapproximation methods for scale scores. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 293-341). Hillsdale, NJ: Lawrence Erlbaum Associates.
Wainer, H., Vevea, J.L., Camacho, F., Reeve, B.B., Rosa, K., Nelson, L., Swygert, K.A., & Thissen, D. (2001). Augmented scores---"borrowing strength" to compute scores based on small numbers of items. In D. Thissen & H. Wainer (Eds), Test Scoring (Pp. 343-387). Hillsdale, NJ: Lawrence Erlbaum Associates.
Orlando, M., Sherbourne, C.D., & Thissen, D. (2000). Summed-score linking using item response theory: Application to depression measurement. Psychological Assessment, 12, 354-359.
Thissen, D. & Mislevy, R.J. (2000). Testing algorithms. In H. Wainer, N. Dorans, D. Eignor, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen (Eds.), Computerized adaptive testing: A primer (Second Edition). Hillsdale, NJ: Lawrence Erlbaum Associates, 101-133.
Thissen, D. (2000). Reliability and measurement precision. In H. Wainer, N. Dorans, D. Eignor, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen (Eds.), Computerized adaptive testing: A primer (Second Edition). Hillsdale, NJ: Lawrence Erlbaum Associates, 159-184.
Steinberg, L., Thissen, D. & Wainer, H. (2000). Validity. In H. Wainer, N. Dorans, D. Eignor, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen (Eds.), Computerized adaptive testing: A primer (Second Edition). Hillsdale, NJ: Lawrence Erlbaum Associates, 185-229.
Wainer, H., Dorans, N., Green, B., Mislevy, R.J., Steinberg, L. & Thissen, D. (2000). Future challenges. In H. Wainer, N. Dorans, D. Eignor, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen (Eds.), Computerized adaptive testing: A primer (Second Edition). Hillsdale, NJ: Lawrence Erlbaum Associates, 231-270.
Orlando, M., & Thissen, D. (2000). New item fit indices for dichotomous item response theory models. Applied Psychological Measurement, 24, 50-64.
Yung, Y.F., McLeod, L.D., & Thissen, D. (1999). On the relationship between the higher-order factor model and the hierarchical factor model. Psychometrika, 64, 113-128.
Chen, W.H., & Thissen, D. (1999). Estimation of Item Parameters for The Three-Parameter Logistic Model Using The Marginal Likelihood of Summed Scores. British Journal of Mathematical and Statistical Psychology, 52, 19-37.
Thissen, D., Nelson, L., Billeaud, K., & McLeod, L. (1998). A brief introduction to item response theory for items scored in more than two categories. In Bourque, M.L. (Ed.), Proceedings of achievement levels workshop (Pp. 47-61). Washington, DC: National Assessment Governing Board.
Billeaud, K., Swygert, K., Nelson, L., & Thissen, D. (1998). Some ideas about item response theory applied to combinations of multiple-choice and constructed-response items---Scale scores for patterns of summed scores. In Bourque, M.L. (Ed.), Proceedings of achievement levels workshop (Pp. 65-76). Washington, DC: National Assessment Governing Board.
Williams, V.S.L., Billeaud, K., Davis, L.A., Thissen, D., & Sanford, E. (1998). Projecting to the NAEP scale: Results from the North Carolina End-of-Grade testing program. Journal of Educational Measurement, 35, 277-296..
Williams, V.S.L., Pommerich, M., & Thissen, D. (1998). A comparison of developmental scales based on Thurstone methods and item response theory. Journal of Educational Measurement, 35, 93-107.
Bock, R.D., Thissen, D., & Zimowski, M.F. (1997). IRT estimation of domain scores. Journal of Educational Measurement, 34, 197-211.
Chen, W.H. & Thissen, D. (1997). Local dependence indices for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22, 265-289.
Thissen, D. & Steinberg, L. (1997). A response model for multiple choice items. In W.J. van der Linden & Ronald K. Hambleton (Eds), Handbook of item response theory (Pp. 51-65). New York: Springer-Verlag.
Steinberg, L. & Thissen, D. (1996). Uses of item response theory and the testlet concept in the measurement of psychopathology, Psychological Methods, 1, 81-97.
Wainer, H. & Thissen, D. (1996). How is reliability related to the quality of test scores? What is the effect of local dependence on reliability? Educational Measurement: Issues and Practice, 15, 22-29.
Thissen, D., Pommerich, M., Billeaud, K., & Williams, V.S.L. (1995). Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement, 19, 39-49.
Wang, X.B., Wainer, H., & Thissen, D. (1995). On the viability of some untestable assumptions in equating exams that allow examinee choice. Applied Measurement in Education, 8, 211-225.
Steinberg, L. & Thissen, D. (1995). Item response theory in personality research. In P. Shrout & S. Fiske (Eds.), Personality research, methods & theory: A Festschrift honoring Donald W. Fiske. Hillsdale, NJ: Lawrence Erlbaum Associates, 161-181.
Wainer, H., Wang, X.B., & Thissen, D. (1994). How well can we compare scores on test forms that are constructed by examinees' choice? Journal of Educational Measurement, 31, 183-199.
Lukhele, R., Thissen, D., & Wainer, H. (1994). On the relative value of multiple-choice, constructed-response, and examinee-selected items on two achievement tests. Journal of Educational Measurement, 31, 234-250.
Thissen, D., Wainer, H., & Wang, X.B. (1994). Are tests comprising both multiple-choice and free-response items necessarily less unidimensional than multiple-choice tests? An analysis of two tests. Journal of Educational Measurement, 31, 113-123.
Wainer, H., & Thissen, D. (1994). On examinee choice in educational testing. Review of Educational Research, 64, 159-195.
Pommerich, M., Billeaud, K., Williams, V.S.L., & Thissen, D. (1993). User's guide for the North Carolina End of Grade Tests. Raleigh, NC: North Carolina Department of Public Instruction.
Wainer, H. & Thissen, D. (1993). Combining multiple-choice and constructed response test scores: Toward a Marxist theory of test construction. Applied Measurement in Education, 6, 103-118.
Thissen, D. (1993). Repealing rules that no longer apply to psychological measurement. In N. Frederiksen, R.J. Mislevy & I. Bejar (Eds.), Test theory for a new generation of tests. Hillsdale, NJ: Lawrence Erlbaum Associates, 79-97.
Thissen, D., Steinberg, L. & Wainer, H. (1993) Detection of differential item functioning using the parameters of item response models. In P.W. Holland & H. Wainer (Eds.), Differential item functioning. Hillsdale, NJ: Lawrence Erlbaum Associates, 67-113.
Sireci, S.G., Thissen, D. & Wainer, H. (1991). On the reliability of testlet-based tests. Journal of Educational Measurement, 28, 237-247.
Wainer, H., Sireci, S.G. & Thissen, D. (1991). DIFferential testlet functioning: Definitions and detection. Journal of Educational Measurement, 28, 197-219.
Thissen, D. & Wainer, H. (1990). Confidence envelopes for item response theory. Journal of Educational Statistics, 15, 113-128.
Thissen, D. & Mislevy, R.J. (1990). Testing algorithms. In H. Wainer, N. Dorans, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen, Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates, 103-135.
Thissen, D. (1990). Reliability and measurement precision. In H. Wainer, N. Dorans, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen, Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates, 161-186.
Steinberg, L., Thissen, D. & Wainer, H. (1990). Validity. In H. Wainer, N. Dorans, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen, Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates, 187-231.
Wainer, H., Dorans, N., Green, B., Mislevy, R.J., Steinberg, L. & Thissen, D. (1990). Future challenges. In H. Wainer, N. Dorans, R. Flaugher, B. Green, R. Mislevy, L. Steinberg & D. Thissen, Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates, 233-272.
Thissen, D. & Mooney, J.A. (1989). Loglinear item response models, with applications to data from social surveys. Sociological Methodology 1989, 299-330.
Thissen, D., Steinberg, L. & Mooney, J.A. (1989). Trace lines for testlets: A use of multiple-categorical-response models. Journal of Educational Measurement 26, 247-260.
Thissen, D., Steinberg, L. & Fitzpatrick, A.R. (1989). Multiple choice models: The distractors are also part of the item. Journal of Educational Measurement, 26, 161176.
Thissen, D. & Steinberg, L. (1988). Data analysis using item response theory. Psychological Bulletin, 104, 385-395.
Thissen, D., Steinberg, L. & Wainer, H. (1988). Use of item response theory in the study of group differences in trace lines. In H. Wainer & H. Braun (Eds.), Test Validity. Hillsdale, NJ: Erlbaum, pp. 147-169.
Wainer, H. & Thissen, D. (1987). Estimating ability with the wrong model. Journal of Educational Statistics, 12, 339-368.
Thissen, D. & Steinberg, L. (1986). A taxonomy of item response models. Psychometrika, 51, 567-577.
Thissen, D. (1986). Measurement precision and "reliability": Some considerations of metrics and stopping rules in CAT. Proceedings of the 27th Annual Conference of the Military Testing Association. San Diego: NPRDC.
Thissen, D., Steinberg, L. & Gerrard, M. (1986). Beyond group mean differences: The concept of item bias. Psychological Bulletin, 99, 118-128.
Thissen, D. & Steinberg, L. (1984). A response model for multiple choice items. Psychometrika, 49, 501-519.
Thissen, D. & Wainer, H. (1983). Toward the measurement and prediction of victim proneness. Journal of Research in Crime and Delinquency, 20, 243-261.
Thissen, D. (1983). Timed testing: An approach using item response theory. In D. Weiss (Ed.), New Horizons in Testing: Latent Trait Test Theory and Computerized Adaptive Testing. N.Y.: Academic Press, pp. 179-203.
Thissen, D., Steinberg, L., Pyszczynski, T. & Greenberg, J. (1983). An item response theory for personality and attitude scales: Item analysis using restricted factor analysis. Applied Psychological Measurement, 7, 211-226.
Thissen, D. & Wainer, H. (1982). Some standard errors in item response theory. Psychometrika, 47, 397-412.
Thissen, D. (1982). Marginal maximum likelihood estimation for the one-parameter logistic model. Psychometrika, 47, 175-186.
Thissen, D. (1976). Information in wrong responses to the Raven Progressive Matrices. Journal of Educational Measurement, 13, 201-214.