The Astrophysical Journal Letters (Jan 2024)

CEERS Key Paper. IX. Identifying Galaxy Mergers in CEERS NIRCam Images Using Random Forests and Convolutional Neural Networks

  • Caitlin Rose,
  • Jeyhan S. Kartaltepe,
  • Gregory F. Snyder,
  • Marc Huertas-Company,
  • L. Y. Aaron Yung,
  • Pablo Arrabal Haro,
  • Micaela B. Bagley,
  • Laura Bisigello,
  • Antonello Calabrò,
  • Nikko J. Cleri,
  • Mark Dickinson,
  • Henry C. Ferguson,
  • Steven L. Finkelstein,
  • Adriano Fontana,
  • Andrea Grazian,
  • Norman A. Grogin,
  • Benne W. Holwerda,
  • Kartheik G. Iyer,
  • Lisa J. Kewley,
  • Allison Kirkpatrick,
  • Dale D. Kocevski,
  • Anton M. Koekemoer,
  • Jennifer M. Lotz,
  • Ray A. Lucas,
  • Lorenzo Napolitano,
  • Casey Papovich,
  • Laura Pentericci,
  • Pablo G. Pérez-González,
  • Nor Pirzkal,
  • Swara Ravindranath,
  • Rachel S. Somerville,
  • Amber N. Straughn,
  • Jonathan R. Trump,
  • Stephen M. Wilkins,
  • Guang Yang

DOI
https://doi.org/10.3847/2041-8213/ad8dd4
Journal volume & issue
Vol. 976, no. 1
p. L8

Abstract

Read online

A crucial yet challenging task in galaxy evolution studies is the identification of distant merging galaxies, a task that suffers from a variety of issues ranging from telescope sensitivities and limitations to the inherently chaotic morphologies of young galaxies. In this paper, we use random forests and convolutional neural networks to identify high-redshift JWST Cosmic Evolution Early Release Science Survey (CEERS) galaxy mergers. We train these algorithms on simulated 3 < z < 5 CEERS galaxies created from the IllustrisTNG subhalo morphologies and the Santa Cruz SAM light cone. We apply our models to observed CEERS galaxies at 3 < z < 5. We find that our models correctly classify ∼60%–70% of simulated merging and nonmerging galaxies; better performance on the merger class comes at the expense of misclassifying more nonmergers. We could achieve more accurate classifications, as well as test for a dependency on physical parameters such as gas fraction, mass ratio, and relative orbits, by curating larger training sets. When applied to real CEERS galaxies using visual classifications as ground truth, the random forests correctly classified 40%–60% of mergers and nonmergers at 3 < z < 4 but tended to classify most objects as nonmergers at 4 < z < 5 (misclassifying ∼70% of visually classified mergers). On the other hand, the CNNs tended to classify most objects as mergers across all redshifts (misclassifying 80%–90% of visually classified nonmergers). We investigate what features the models find most useful, as well as the characteristics of false positives and false negatives, and also calculate merger rates derived from the identifications made by the models.

Keywords