Rationalism in the face of GPT hypes: Benchmarking the output of large language models against human expert-curated biomedical knowledge graphs

Negin Sadat Babaiha; Sathvik Guru Rao; Jürgen Klein; Bruce Schultz; Marc Jacobs; Martin Hofmann-Apitius

Artificial Intelligence in the Life Sciences (Jun 2024)

Rationalism in the face of GPT hypes: Benchmarking the output of large language models against human expert-curated biomedical knowledge graphs

Negin Sadat Babaiha,
Sathvik Guru Rao,
Jürgen Klein,
Bruce Schultz,
Marc Jacobs,
Martin Hofmann-Apitius

Affiliations

Negin Sadat Babaiha: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany; Bonn-Aachen International Center for Information Technology (b-it), University of Bonn, 53115 Bonn, Germany
Sathvik Guru Rao: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
Jürgen Klein: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
Bruce Schultz: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
Marc Jacobs: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
Martin Hofmann-Apitius: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany; Bonn-Aachen International Center for Information Technology (b-it), University of Bonn, 53115 Bonn, Germany; Corresponding author at: Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany.

Journal volume & issue: Vol. 5
p. 100095

Abstract

Read online

Biomedical knowledge graphs (KGs) hold valuable information regarding biomedical entities such as genes, diseases, biological processes, and drugs. KGs have been successfully employed in challenging biomedical areas such as the identification of pathophysiology mechanisms or drug repurposing. The creation of high-quality KGs typically requires labor-intensive multi-database integration or substantial human expert curation, both of which take time and contribute to the workload of data processing and annotation. Therefore, the use of automatic systems for KG building and maintenance is a prerequisite for the wide uptake and utilization of KGs. Technologies supporting the automated generation and updating of KGs typically make use of Natural Language Processing (NLP), which is optimized for extracting implicit triples described in relevant biomedical text sources. At the core of this challenge is how to improve the accuracy and coverage of the information extraction module by utilizing different models and tools. The emergence of pre-trained large language models (LLMs), such as ChatGPT which has grown in popularity dramatically, has revolutionized the field of NLP, making them a potential candidate to be used in text-based graph creation as well. So far, no previous work has investigated the power of LLMs on the generation of cause-and-effect networks and KGs encoded in Biological Expression Language (BEL). In this paper, we present initial studies towards one-shot BEL relation extraction using two different versions of the Generative Pre-trained Transformer (GPT) models and evaluate its performance by comparing the extracted results to a highly accurate, manually curated BEL KG curated by domain experts.

Published in Artificial Intelligence in the Life Sciences

ISSN: 2667-3185 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Science (General)
Website: https://www.journals.elsevier.com/artificial-intelligence-in-the-life-sciences

About the journal

Abstract

Keywords