Borkenhagen LK, Runstadler JA. Examining the Influenza A Virus Sialic Acid Binding Preference Predictions of a Sequence-Based Convolutional Neural Network. Influenza Other Respir Viruses. 2024 Dec;18(12):e7
Background: Though receptor binding specificity is well established as a contributor to host tropism and spillover potential of influenza A viruses, determining receptor binding preference of a specific virus still requires expensive and time-consuming laboratory analyses. In this study, we pilot a machine learning approach for prediction of binding preference.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
See Also:
Latest articles in those days:
- Specific determination of influenza virus A by SERS-active spike-like track-etched membranes 21 hours ago
- A Signal-Amplified SERS-LFA Leveraging Multivalent Nanobodies on 2D substrates for Clinical Detection of Influenza A Virus 21 hours ago
- Phylodynamic Reconstruction of H1N1pdm09 Influenza Virus Transmission in Brazil: A Decade of Evolutionary Dynamics 21 hours ago
- Avian Influenza Weekly Update # 1029: 16 January 2026 2 days ago
- Evidence of High Pathogenic Avian Influenza H5N1 Clade 2.3.4.4b Among Poultry in Ghana From 2021 to 2022 2 days ago
[Go Top] [Close Window]


