Galaxy Image Classification Based on Citizen Science Data: A Comparative Study

Manuel Jimenez; Mercedes Torres Torres; Robert John; Isaac Triguero

doi:10.1109/ACCESS.2020.2978804

IEEE Access (Jan 2020)

Galaxy Image Classification Based on Citizen Science Data: A Comparative Study

Manuel Jimenez,
Mercedes Torres Torres,
Robert John,
Isaac Triguero

Affiliations

Manuel Jimenez: ORCiD; Computational Optimisation and Learning Laboratory (COL), School of Computer Science, University of Nottingham, Nottingham, U.K.
Mercedes Torres Torres: ORCiD; Computer Vision Laboratory (CVL), School of Computer Science, University of Nottingham, Nottingham, U.K.
Robert John: Computational Optimisation and Learning Laboratory (COL), School of Computer Science, University of Nottingham, Nottingham, U.K.
Isaac Triguero: ORCiD; Computational Optimisation and Learning Laboratory (COL), School of Computer Science, University of Nottingham, Nottingham, U.K.

DOI: https://doi.org/10.1109/ACCESS.2020.2978804
Journal volume & issue: Vol. 8
pp. 47232 – 47246

Abstract

Read online

Many research fields are now faced with huge volumes of data automatically generated by specialised equipment. Astronomy is a discipline that deals with large collections of images difficult to handle by experts alone. As a consequence, astronomers have been relying on the power of the crowds, as a form of citizen science, for the classification of galaxy images by amateur people. However, the new generation of telescopes that will produce images at a higher rate highlights the limitations of this approach, and the use of machine learning methods for automatic classification is considered essential. The goal of this paper is to shed light on the automated classification of galaxy images exploring two distinct machine learning strategies. First, following the classical approach consisting of feature extraction together with a classifier, we compare the state-of-the-art feature extractor for this problem, the WND-CHARM, with our proposal based on autoencoders for feature extraction on galaxy images. We then compare these results with an end-to-end classification using convolutional neural networks. To better leverage the available citizen science data, we also investigate a pre-training scheme that exploits both amateur- and expert-labelled data. Our experiments reveal that autoencoders greatly speed up feature extraction in comparison with WND-CHARM and both classification strategies, either using convolutional neural networks or feature extraction, reach comparable accuracy. The use of pre-training in convolutional neural networks, however, has allowed us to provide even better results.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords