Testing the reliability of an AI-based large language model to extract ecological information from the scientific literature

Andrew V. Gougherty; Hannah L. Clipp

doi:10.1038/s44185-024-00043-9

npj Biodiversity (May 2024)

Testing the reliability of an AI-based large language model to extract ecological information from the scientific literature

Andrew V. Gougherty,
Hannah L. Clipp

Affiliations

Andrew V. Gougherty: USDA Forest Service Northern Research Station
Hannah L. Clipp: USDA Forest Service Northern Research Station

DOI: https://doi.org/10.1038/s44185-024-00043-9
Journal volume & issue: Vol. 3, no. 1
pp. 1 – 5

Abstract

Read online

Abstract Artificial intelligence-based large language models (LLMs) have the potential to substantially improve the efficiency and scale of ecological research, but their propensity for delivering incorrect information raises significant concern about their usefulness in their current state. Here, we formally test how quickly and accurately an LLM performs in comparison to a human reviewer when tasked with extracting various types of ecological data from the scientific literature. We found the LLM was able to extract relevant data over 50 times faster than the reviewer and had very high accuracy (>90%) in extracting discrete and categorical data, but it performed poorly when extracting certain quantitative data. Our case study shows that LLMs offer great potential for generating large ecological databases at unprecedented speed and scale, but additional quality assurance steps are required to ensure data integrity.

Published in npj Biodiversity

ISSN: 2731-4243 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science: Natural history (General): General. Including nature conservation, geographical distribution
Website: https://www.nature.com/npjbiodivers/

About the journal