Evidence of Content Validity of the Enade Psychology Assessment

Authors

  • Girlene Ribeiro de Jesus Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil
  • Renata Manuelly de Lima Rêgo Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil
  • Victor Vasconcelos de Souza Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil

DOI:

https://doi.org/10.18222/eae.v29i72.4897

Keywords:

Validity, National Exam of Student Proficiency (Enade), Blueprint, Cognitive Demand

Abstract

This study aims to analyze evidence of validity based on the content of the psychology test applied in the Exame Nacional de Desempenho dos Estudantes (Enade) [National Exam of Student Proficiency] of 2015. We used Blueprint, a tool recommended in the international literature for test design. There were significant discrepancies between the skills and abilities presented in the test specifications, and in the cognitive demand demanded in almost all multiple-choice questions. Furthermore, there are skills and abilities presented in test specifications that were not in the test, and other skills and abilities that were represented by only one item, which compromise measurement reliability. We conclude that the evidence based on the content for this test shows the necessity to use tools to improve this primary source of evidence.

 

Downloads

Download data is not yet available.

Author Biographies

Girlene Ribeiro de Jesus, Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil

Girlene Ribeiro de Jesus é professora da Faculdade de Educação e do Programa de Pós-Graduação em Educação da Universidade de Brasília e coordenadora de Pesquisa no Cebraspe.

Renata Manuelly de Lima Rêgo, Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil

Renata Manuelly de Lima Rego é doutoranda do Programa de Pós-Graduação em Psicologia Social, do Trabalho e das Organizações na Universidade de Brasília e atua como pesquisadora no Cebraspe.

Victor Vasconcelos de Souza, Universidade de Brasília (UnB) e Centro Brasileiro de Pesquisa em Avaliação e Seleção e de Promoção de Eventos (Cebraspe), Brasília-DF, Brasil

Victor Vasconcelos de Souza é mestrando do Programa de Pós-Graduação em Psicologia Social, do Trabalho e das Organizações na Universidade de Brasília e atua como analista de resultados no Cebraspe.

References

ALDERMAN, J. Test development process at ETS. Princeton: ETS Global Institute Course, 2015.

AMERICAN EDUCATIONAL RESEARCH ASSOCIATION; AMERICAN PSYCHOLOGICAL ASSOCIATION; NATIONAL COUNCIL ON MEASUREMENT IN EDUCATION. Standards for educational and psychological testing. Washington, DC: APA, 2014. AMERICAN PSYCHOLOGICAL ASSOCIATION. Standards for educational and psychological tests and manuals. Washington, DC: APA, 1966.

ANDERSON, L. W. et al. (Ed.). A taxonomy for learning, teaching, and assessing: a revision of Bloom’s Taxonomy of educational objectives. 2. ed. New York: Longman, 2001.

BORSBOOM, D.; MELLENBERGH, G. J.; VAN HEERDEN, J. The concept of validity. Psychological Review, Washington, v. 111, n. 4, p. 1061-1071, nov. 2004.

BRASIL. Instituto Nacional de Estudos e Pesquisas Educacionais Anísio Teixeira. Exame nacional do desempenho dos estudantes: psicologia. Brasília, DF: Inep, 2015a. Disponível em: <http://download.inep.gov.br/educacao_superior/enade/provas/2015/09_psicologia.pdf>. Acesso em: 8 ago. 2017.

BRASIL. Portaria Inep n. 243, de 10 de junho de 2015. Estabelece as diretrizes da área de psicologia. Diário Oficial da União, Brasília, DF, 12 jun. 2015b. Seção 1, p. 27.

CHAPELLE, C. A.; ENRIGHT, M. K.; JAMIESON, J. M. Building a validity argument for the test of English as a foreign language. New York: Routledge, 2011.

CIZEK, G. J. Validating test score meaning and defending test score use: different aims, different methods. Assessment in Education: Principles, Policy & Practice, London, v. 23, n. 2, p. 212-225, Aug. 2016.

CIZEK, G. J.; WEBB, L. C.; KALOHN, J. C. The use of cognitive taxonomies in licensure and certification test development: reasonable or customary? Evaluation & The Health Professions, Thousand Oaks, v. 18, n. 1, p. 77-91, Mar. 1995.

COLLEGE BOARD. Test specifications for the redesigned SAT. New York: College Board, 2015.

CRONBACH, L. J.; MEEHL, P. E. Construct validity in psychological tests. Psychological Bulletin, Washington, DC, v. 52, p. 281-302, July 1955.

DEVILLE, C. W. An empirical link of content and construct validity evidence. Applied Psychological Measurement, Thousand Oaks, v. 20, n. 2, p. 127-139, June 1996.

DEVILLE, C. W.; PROMETRIC, S. An empirical link of content and construct validity evidence. Applied Psychological Measurement, Thousand Oaks, v. 20, n. 2, p. 127-139, 1996.

DOWNING, S. M. Twelve steps for effective test development. In: DOWNING, S. M.; HALADYNA, T. M. (Org.). Handbook of test development. New Jersey: Lawrence Erlbaum Associates, 2006. p. 3-25.

DOWNING, S. M.; HALADYNA, T. M. Test item development: validity evidence from quality assurance procedures. Applied Measurement in Education, Oxford, v. 10, n. 1, p. 61-82, Dec. 2009.

EDUCATIONAL TESTING SERVICE. ETS standards for quality and fairness. Princeton: ETS, 2014. FERRAZ, A. P. C. M.; BELHOT, R. V. Taxonomia de Bloom: revisão teórica e apresentação das adequações do instrumento para definição de objetivos instrucionais. Gestão & Produção, São Carlos, v. 17, n. 2, p. 421-431, 2010.

HALADYNA, T. M. Developing and validating multiple-choice test items. 3. ed. New Jersey: Lawrence Erlbaum Associates, 2004.

HALADYNA, T. M.; RODRIGUEZ, M. C. Developing and validating test items. New York: Taylor & Francis Group, 2013.

HOGAN, T. P. Introdução à prática de testes psicológicos. São Paulo: LTC, 2006.

HUDDLESTON, E. M. Test development on the basis of content validity. Educational and Psychological Measurement, Thousand Oaks, v. 16, n. 3, p. 283-293, Oct. 1956.

HUTZ, C. S. Avanços e polêmicas em avaliação psicológica. Itatiba: Casa do Psicólogo, 2009.

HUTZ, C. S.; BANDEIRA, D. R; TRENTINI, C. M. Psicometria. Porto Alegre: Artmed, 2015.

JENSEN, J. L. et al. Teaching to the test… or testing to teach: exams requiring higher order thinking skills encourage greater conceptual understanding. Educational Psychology Review, New York, v. 26, n. 2, p. 307-329, Jan. 2014.

KANE, M. Content-related validity evidence in test development. In: DOWNING, S. M.; HALADYNA, T. M. (Org.). Handbook of test development. New Jersey: Lawrence Erlbaum Associates, 2006. p. 131-153.

KANE, M. Validating the interpretations and uses of test scores. Journal of Educational Measurement, New Jersey, v. 50, n. 1, p. 1-73, mar. 2013.

KELLEY, T. L. Interpretations of educational measurement. Yonkers-on-Hudson: World Book, 1927.

LAWSHE, C. H. A quantitative approach to content validity. Personnel Psychology, Hoboken, v. 28, n. 4, p. 563-575, dez. 1975.

LEUNG, S. F.; MOK, E.; WONG, D. The impact of assessment methods on the learning of nursing students. Nurse Education Today, v. 28, n. 6, p. 711-719, Aug. 2008.

LI, X.; SIRECI, S. G. A new method for analyzing content validity data using multidimensional scaling. Educational and Psychological Measurement, Thousand Oaks, v. 73, n. 3, p. 365-385, Jan. 2013.

LINN, R. L. The standards for educational and psychological testing: guidance in test development. In: DOWNING, S. M.; HALADYNA, T. M. (Org.). Handbook of test development. New Jersey: Lawrence Erlbaum Associates, 2006. p. 27-38.

LYNN, M. R. Determination and quantification of content validity. Nursing Research, London, v. 35, n. 6, p. 382-385, Nov./Dec. 1986.

MARZANO, R. J.; KENDALL, J. S. The new taxonomy of educational objectives. 2. ed. Thousand Oaks: Corwin, 2007. MESSICK, S. Validity. In: LINN, R. L. (Ed.). Educational measurement. Washington, DC: American Council on Education; National Council on Measurement in Education, 1989. p. 13-103.

MISLEVY, R. J. Validity by design. Educational Reseacher, Washington, v. 36, n. 8, p. 463-469, Nov. 2007.

MISLEVY, R. J.; ALMOND, R. G.; LUKAS, J. F. A brief introduction to Evidence-Centered Design. ETS Research Report Series, Princeton, v. 03-16, n. 1, p. i-29, July 2003.

MOMSEN, J. L. et al. Just the facts? Introductory undergraduate biology courses focus on low-level cognitive skills. CBE-Life Sciences Education, Bethesda, v. 9, n. 4, p. 435-440, Dec. 2010.

NASCIMENTO, T. G.; SOUZA, E. C. L. Escala trifatorial da identidade social (ETIS): evidências de sua adequação psicométrica. Psico-USF, Bragança Paulista, v. 22, n. 2, p. 217-234, May/Aug. 2017.

NATIONAL ASSESSMENT OF EDUCATIONAL PROGRESS. Reading Framework for the 2015 National Assessment of Educational Progress. Washington: U.S. Government Printing Office, Jan. 2015.

NEWTON, P. E. Macro- and micro-validation: Beyond the “five sources” framework for classifying validation evidence and analysis. Practical Assessment, Research & Evaluation, College Park, v. 21, n. 12, p. 1-13, Dec. 2016.

PARTNERSHIP FOR ASSESSMENT OF READINESS FOR COLLEGE AND CAREERS. PARCC Grades 6-11 High Level Blueprints. EUA: 2017. Disponível em:<http://www.parcconline.org/files/83/Spring%202016/388/Grades%206-11%20High%20Level%20Blueprint%20(Updated).pdf>. Acesso em: 26 jun. 2017.

PASQUALI, L. Instrumentos psicológicos: manual prático de elaboração. Brasília, DF: LabPAM/IBAPP, 1999.

PASQUALI, L. Psicometria. Revista da Escola de Enfermagem da USP, São Paulo, v. 43, p. 992-999, dez. 2009. Edição especial.

PASQUALI, L. Instrumentação psicológica: fundamentos e prática. Porto Alegre: Artmed, 2010.

PASQUALI, L. Validade dos testes. Examen: Pesquisa em Avaliação, Certificação e Seleção, Brasília, DF, v. 1, n. 1, p. 14-48, jul./dez. 2017.

POLIT, D. F.; BECK, C. T.; OWEN, S. T. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Research in Nursing & Health, Thousand Oaks, v. 30, n. 4, p. 459-567, Aug. 2007.

RULON, P. J. On the validity of educational tests. Harvard Educational Review, Washington, DC, v. 16, p. 290-296, 1946.

RUTKOWSKI, L.; VON DAVIER, M.; RUTKOWSKI, D. Handbook of international large-scale assessment: background, technical issues, and methods of data analysis. Boca Raton: CRC/Taylor & Francis Group, 2014. SCULLY, D. Constructing multiple-choice items to measure higher-order thinking. Practical Assessment, Research & Evaluation, College Park, v. 22, n. 4, p. 1-13, May 2017. Disponível em: <http://pareonline.net/getvn.asp?v=22&n=4>. Acesso em: 27 ago. 2017.

SIRECI, S. G. The construct of content validity. Social Indicators Research, New York, v. 45, n. 1-3, p. 83-117, Nov. 1998.

SIRECI, S. G. Agreeing on validity arguments. Journal of Educational Measurement, New Jersey, v. 50, n. 1, p. 99-104, Mar. 2013.

SIRECI, S. G.; GEISINGER, K. F. Analyzing test content using cluster analysis and multidimensional scaling. Applied Psychological Measurement, Thousand Oaks, v. 16, n. 1, p. 17-31, Mar. 1992.

ZIEKY, M. J. An introduction to the use of Evidence-Centered Design in test development. Psicología Educativa, Madrid, v. 20, n. 2, p. 79-87, dic. 2014.

ZUMBO, B. D. What role does, and should, the test standards play outside of the United States of America? Educational Measurement: Issues and Practice, Philadelphia, v. 33, n. 4, p. 31-33, Dec. 2014.

Published

2021-05-21

How to Cite

Jesus, G. R. de, Rêgo, R. M. de L., & Souza, V. V. de. (2021). Evidence of Content Validity of the Enade Psychology Assessment. Estudos Em Avaliação Educacional, 29(72), 858–884. https://doi.org/10.18222/eae.v29i72.4897

Most read articles by the same author(s)