Frontiers in Bioinformatics (Jun 2022)
A Machine Learning Framework Predicts the Clinical Severity of Hemophilia B Caused by Point-Mutations
Abstract
Blood coagulation is a vital physiological mechanism to stop blood loss following an injury to a blood vessel. This process starts immediately upon damage to the endothelium lining a blood vessel, and results in the formation of a platelet plug that closes the site of injury. In this repair operation, an essential component is the coagulation factor IX (FIX), a serine protease encoded by the F9 gene and whose deficiency causes hemophilia B. If not treated by prophylaxis or gene therapy, patients with this condition are at risk of life-threatening bleeding episodes. In this sense, a deep understanding of the FIX protein and its activated form (FIXa) is essential to develop efficient therapeutics. In this study, we used well-studied structural analysis techniques to create a residue interaction network of the FIXa protein. Here, the nodes are the amino acids of FIXa, and two nodes are connected by an edge if the two residues are in close proximity in the FIXa 3D structure. This representation accurately captured fundamental properties of each amino acid of the FIXa structure, as we found by validating our findings against hundreds of clinical reports about the severity of HB. Finally, we established a machine learning framework named HemB-Class to predict the effect of mutations of all FIXa residues to all other amino acids and used it to disambiguate several conflicting medical reports. Together, these methods provide a comprehensive map of the FIXa protein architecture and establish a robust platform for the rational design of FIX therapeutics.
Keywords