Comparison of neural network architectures for feature extraction from binary black hole merger waveforms

Osvaldo Gramaxo Freitas; Juan Calderón Bustillo; José A Font; Solange Nunes; Antonio Onofre; Alejandro Torres-Forné

doi:10.1088/2632-2153/ad2972

Machine Learning: Science and Technology (Jan 2024)

Comparison of neural network architectures for feature extraction from binary black hole merger waveforms

Osvaldo Gramaxo Freitas,
Juan Calderón Bustillo,
José A Font,
Solange Nunes,
Antonio Onofre,
Alejandro Torres-Forné

Affiliations

Osvaldo Gramaxo Freitas: ORCiD; Centro de Física das Universidades do Minho e do Porto (CF-UM-UP), Universidade do Minho , 4710-057 Braga, Portugal; Departamento de Astronomía y Astrofísica, Universitat de València , Dr Moliner 50, 46100 Burjassot (València), Spain
Juan Calderón Bustillo: ORCiD; Instituto Galego de Física de Altas Enerxías, Universidade de Santiago de Compostela , 15782 Santiago de Compostela, Galicia, Spain
José A Font: ORCiD; Departamento de Astronomía y Astrofísica, Universitat de València , Dr Moliner 50, 46100 Burjassot (València), Spain; Observatori Astronòmic, Universitat de València , Catedrático José Beltrén 2, 46980 Paterna (València), Spain
Solange Nunes: ORCiD; Centro de Física das Universidades do Minho e do Porto (CF-UM-UP), Universidade do Minho , 4710-057 Braga, Portugal
Antonio Onofre: ORCiD; Centro de Física das Universidades do Minho e do Porto (CF-UM-UP), Universidade do Minho , 4710-057 Braga, Portugal
Alejandro Torres-Forné: ORCiD; Departamento de Astronomía y Astrofísica, Universitat de València , Dr Moliner 50, 46100 Burjassot (València), Spain; Observatori Astronòmic, Universitat de València , Catedrático José Beltrén 2, 46980 Paterna (València), Spain

DOI: https://doi.org/10.1088/2632-2153/ad2972
Journal volume & issue: Vol. 5, no. 1
p. 015036

Abstract

Read online

We evaluate several neural-network architectures, both convolutional and recurrent, for gravitational-wave time-series feature extraction by performing point parameter estimation on noisy waveforms from binary-black-hole mergers. We build datasets of 100 000 elements for each of four different waveform models (or approximants) in order to test how approximant choice affects feature extraction. Our choices include SEOBNRv4P and IMRPhenomPv3 , which contain only the dominant quadrupole emission mode, alongside IMRPhenomPv3HM and NRHybSur3dq8 , which also account for high-order modes. Each dataset element is injected into detector noise corresponding to the third observing run of the LIGO-Virgo-KAGRA (LVK) collaboration. We identify the temporal convolutional network architecture as the overall best performer in terms of training and validation losses and absence of overfitting to data. Comparison of results between datasets shows that the choice of waveform approximant for the creation of a dataset conditions the feature extraction ability of a trained network. Hence, care should be taken when building a dataset for the training of neural networks, as certain approximants may result in better network convergence of evaluation metrics. However, this performance does not necessarily translate to data which is more faithful to numerical relativity simulations. We also apply this network on actual signals from LVK runs, finding that its feature-extracting performance can be effective on real data.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords