Multiple imputation in big identifiable data for educational research: An example from the Brazilian education assessment system

Maria Eugénia Ferrão; Paula Prata; Maria Teresa Gonzaga Alves

doi:10.1590/s0104-40362020002802346

Ensaio (Jul 2020)

Multiple imputation in big identifiable data for educational research: An example from the Brazilian education assessment system

Maria Eugénia Ferrão,
Paula Prata,
Maria Teresa Gonzaga Alves

Affiliations

Maria Eugénia Ferrão: ORCiD; University of Beira Interior, Covilhã/Center for Mathematics Applied to Economic Forecasting and Decision Making, Lisboa, Portugal
Paula Prata: ORCiD; University of Beira Interior, Instituto de Telecomunicações, Covilhã, Portugal
Maria Teresa Gonzaga Alves: ORCiD; Federal University of Minas Gerais, Belo Horizonte, MG, Brazil

DOI: https://doi.org/10.1590/s0104-40362020002802346
Journal volume & issue: Vol. 28, no. 108
pp. 599 – 621

Abstract

Read online

Almost all quantitative studies in educational assessment, evaluation and educational research are based on incomplete data sets, which have been a problem for years without a single solution. The use of big identifiable data poses new challenges in dealing with missing values. In the first part of this paper, we present the state-of-art of the topic in the Brazilian education scientific literature, and how researchers have dealt with missing data since the turn of the century. Next, we use open access software to analyze real-world data, the 2017 Prova Brasil , for several federation units to document how the naïve assumption of missing completely at random may substantially affect statistical conclusions, researcher interpretations, and subsequent implications for policy and practice. We conclude with straightforward suggestions for any education researcher on applying R routines to conduct the hypotheses test of missing completely at random and, if the null hypothesis is rejected, then how to implement the multiple imputation, which appears to be one of the most appropriate methods for handling missing data.

Published in Ensaio

ISSN: 0104-4036 (Print); 1809-4465 (Online)
Publisher: Fundação CESGRANRIO
Country of publisher: Brazil
LCC subjects: Education: Education (General)
Website: http://www.scielo.br/scielo.php?script=sci_serial&pid=0104-4036&lng=en&nrm=iso

About the journal

Abstract

Keywords