On the Impossibility of Learning the Missing Mass

Elchanan Mossel; Mesrob  I. Ohannessian

doi:10.3390/e21010028

Entropy (Jan 2019)

On the Impossibility of Learning the Missing Mass

Elchanan Mossel,
Mesrob I. Ohannessian

Affiliations

Elchanan Mossel: Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Mesrob I. Ohannessian: Toyota Technological Institute at Chicago, Chicago, IL 60637, USA

DOI: https://doi.org/10.3390/e21010028
Journal volume & issue: Vol. 21, no. 1
p. 28

Abstract

Read online

This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “missing mass”. The impossibility result can then be stated as: the missing mass is not distribution-free learnable in relative error. The proof is semi-constructive and relies on a coupling argument using a dithered geometric distribution. Via a reduction, this impossibility also extends to both discrete and continuous tail estimation. These results formalize the folklore that in order to predict rare events without restrictive modeling, one necessarily needs distributions with “heavy tails”.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords