Brown, M. L., Hart, K. M., & Kuchemann, D. (1984). Chelsea diagnostic mathematics
tests: algebra. Windsor, UK: NFER-Nelson.
Cronbach, L. J., Gleser, G. C., Nanda, H., & Rajaratnam, N. (1972). The dependability
of behavioural measurements: theory of generalizability for scores and profile
reporting. New York, NY: Wiley.
Linn, R. L. (1994) Assessment-based reform: challenges to educational measurement.
Paper presented at Angoff Memorial Lecture. Princeton, NJ: Educational Testing
Service.
Linn, R. L., & Baker, E. L. (1996). Can performance-based student assessment be
psychometrically sound? In J. B. Baron & D. P. Wolf (Eds.), Performance-based
assessment—challenges and possibilities: 95th yearbook of the National Society for the
Study of Education part 1 (Vol. 95(1), pp. 84-103). Chicago, IL: National Society for
the Study of Education.
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading,
MA: Addison-Wesley.
Shavelson, R. J., Ruiz-Primo, M. A., & Wiley, E. W. (1999). Note on sources of
sampling variability in science performance assessments. Journal of Educational
Measurement, 36(1), 61-71.
Toulmin, S. (2001). Return to reason. Cambridge, MA: Harvard University Press.
Wiliam, D. (1993). Technical issues in the development and implementation of a system
of criterion-referenced age-independent levels of attainment in the National
Curriculum of England and Wales. Unpublished PhD thesis, King’s College University
of London.
Wiliam, D. (2000, November) Integrating summative and formative functions of
assessment. Paper presented at First annual conference of the Association for
Educational Assessment-Europe held at Prague, Czech Republic. London, UK: King’s
College London School of Education.
Wiliam, D. (2000b). The meanings and consequence of educational assessments.
Critical quarterly, 42(1), 105-127.
Wood, R. (1991). Assessment and testing: a survey of research. Cambridge: Cambridge
University Press.