Scientific Data (Sep 2024)
A dataset of manually annotated filaments from H-alpha observations
Abstract
Abstract We present the Manually Annotated GONG Filaments in H-alpha Observations (MAGFiLO v1.0) dataset. This dataset contains 10,244 annotated filaments from 1,593 observations captured by the Global Oscillation Network Group (GONG), spanning the years 2011 through 2022. Each annotation details one filament’s segmentation, minimum bounding box, spine, and magnetic field chirality. With a total of over one thousand person-hours of annotation, and a double-blind review process, we ensured high-quality ground-truth data. Our inter-annotator agreement reaches a Kappa score of 0.66. We also verified that the hemispheric preference of filaments as annotated in MAGFiLO aligns with the findings from similar datasets of much smaller sample sizes. MAGFiLO is the first dataset of its size, enabling advanced deep learning models to identify filaments and their features with unprecedented precision. It also provides a testbed for solar physicists interested in large-scale analysis of filaments. In this report, we document the details of the annotation and the post-processing phases that were applied.