1Stevns, S. S. (1946). On the theory of scales of measurement. Science, 103, 677 - 680.
2Messick, S. (1995). Validity of psychological assessment. American Psychologist, 50, 9, 741- 749.
3Michell, J. (1990). An introduction to the logic of psychological measurement. Hinsdale, NJ: Lawrence Erlbaum Associates.
4Wilson, M. (2005). Constructing measures: An item response modeling approach. Mahwah, NJ : Lawrence Erlbaum Associates.
5Camilli, G., & Shepard, L.A. (1994). Methods for identifying biased test items. Thousand Oaks: Sage Publications.
6Haladyna, T. M. (1997). Writing test items to evaluate higher order thinking. Needham Heights, MA: Allyn and Baon.
7Embretson, S. E. , & Yang, X. (2006). Item response theory. In J, Green, G. Camilli, & P, Elmore (Eds.), Complementary research methods for education, 3rd edition (pp. 385 - 410 ). Washington DC: American Educational Research Association.
8American Psychological Association. (1953). Ethical standards for psychologists. Washington, DC: American Psychological Association, Inc.
9Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52, 281- 302.
10Messick, S. (1989). Meaning and values in test validation: The science and ethics of assessment. Educational Researcher, 18, 5-11.