Borkenhagen LK, Runstadler JA. Examining the Influenza A Virus Sialic Acid Binding Preference Predictions of a Sequence-Based Convolutional Neural Network. Influenza Other Respir Viruses. 2024 Dec;18(12):e7
Background: Though receptor binding specificity is well established as a contributor to host tropism and spillover potential of influenza A viruses, determining receptor binding preference of a specific virus still requires expensive and time-consuming laboratory analyses. In this study, we pilot a machine learning approach for prediction of binding preference.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
See Also:
Latest articles in those days:
- WPRO/WHO: Avian Influenza Weekly Update, 17 January 2025 21 hours ago
- Retrospective modelling of the disease and mortality burden of the 1918-1920 influenza pandemic in Zurich, Switzerland 1 days ago
- Avian Influenza incidents and depopulation methods Feb 2022 to June 2024 in U.S. 2 days ago
- [preprint]From Birds to Bovine: A Review and Critical Analysis of the Outbreaks of Highly Pathogenic Avian Influenza A(H5N1) Clade 2.3.4.4b in the United States 2 days ago
- Detection of equine influenza virus gene in the air around infected horses 2 days ago
[Go Top] [Close Window]