Nature Communications (May 2024)

Large-language models facilitate discovery of the molecular signatures regulating sleep and activity

  • Di Peng,
  • Liubin Zheng,
  • Dan Liu,
  • Cheng Han,
  • Xin Wang,
  • Yan Yang,
  • Li Song,
  • Miaoying Zhao,
  • Yanfeng Wei,
  • Jiayi Li,
  • Xiaoxue Ye,
  • Yuxiang Wei,
  • Zihao Feng,
  • Xinhe Huang,
  • Miaomiao Chen,
  • Yujie Gou,
  • Yu Xue,
  • Luoying Zhang

DOI
https://doi.org/10.1038/s41467-024-48005-w
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Sleep, locomotor and social activities are essential animal behaviors, but their reciprocal relationships and underlying mechanisms remain poorly understood. Here, we elicit information from a cutting-edge large-language model (LLM), generative pre-trained transformer (GPT) 3.5, which interprets 10.2–13.8% of Drosophila genes known to regulate the 3 behaviors. We develop an instrument for simultaneous video tracking of multiple moving objects, and conduct a genome-wide screen. We have identified 758 fly genes that regulate sleep and activities, including mre11 which regulates sleep only in the presence of conspecifics, and NELF-B which regulates sleep regardless of whether conspecifics are present. Based on LLM-reasoning, an educated signal web is modeled for understanding of potential relationships between its components, presenting comprehensive molecular signatures that control sleep, locomotor and social activities. This LLM-aided strategy may also be helpful for addressing other complex scientific questions.