PLoS Computational Biology (May 2023)
An integrated model for predicting KRAS dependency.
Abstract
The clinical approvals of KRAS G12C inhibitors have been a revolutionary advance in precision oncology, but response rates are often modest. To improve patient selection, we developed an integrated model to predict KRAS dependency. By integrating molecular profiles of a large panel of cell lines from the DEMETER2 dataset, we built a binary classifier to predict a tumor's KRAS dependency. Monte Carlo cross validation via ElasticNet within the training set was used to compare model performance and to tune parameters α and λ. The final model was then applied to the validation set. We validated the model with genetic depletion assays and an external dataset of lung cancer cells treated with a G12C inhibitor. We then applied the model to several Cancer Genome Atlas (TCGA) datasets. The final "K20" model contains 20 features, including expression of 19 genes and KRAS mutation status. In the validation cohort, K20 had an AUC of 0.94 and accurately predicted KRAS dependency in both mutant and KRAS wild-type cell lines following genetic depletion. It was also highly predictive across an external dataset of lung cancer lines treated with KRAS G12C inhibition. When applied to TCGA datasets, specific subpopulations such as the invasive subtype in colorectal cancer and copy number high pancreatic adenocarcinoma were predicted to have higher KRAS dependency. The K20 model has simple yet robust predictive capabilities that may provide a useful tool to select patients with KRAS mutant tumors that are most likely to respond to direct KRAS inhibitors.