International Journal of Infectious Diseases (Nov 2020)
Predicting the dominant influenza A serotype by quantifying mutation activities
Abstract
Objectives: Characterizing and predicting the evolutionary process of influenza, which remain challenging, are of importance in capturing the patterns of influenza activities and for developing prevention and control strategies. This study quantified genetic mutation activity and developed a statistical model to predict the dominant influenza A serotype with limited sequencing data. Data and methods: A total number of 8097 and 7090 HA sequences for A/H1N1 and A/H3N2 were collected from the 2008/09 to 2018/19 flu seasons in seven countries or regions. A g-measure, which reflected the overall level of genetic activity through time, was considered to predict the dominant flu serotype in the population. Results: The model discriminated the influenza serotypes well with a sensitivity = 0.84, precision = 0.79 and AUC = 0.78 (95% CI: 0.54–0.97), and explained 42% of the serotypes variability with the R2. Conclusions: Our study suggests that the dominance of a flu serotype in a population can be well discriminated by genetic mutation activities from sample strains. By the data-driven computational framework, the genetic mutation can be quantified to trace the genetic activities on a real-time basis and provide early warning for the coming flu season.