Scientific Reports (Aug 2022)

Transcriptome-guided annotation and functional classification of long non-coding RNAs in Arabidopsis thaliana

  • Jose Antonio Corona-Gomez,
  • Evelia Lorena Coss-Navarrete,
  • Irving Jair Garcia-Lopez,
  • Christopher Klapproth,
  • Jaime Alejandro Pérez-Patiño,
  • Selene L. Fernandez-Valverde

DOI
https://doi.org/10.1038/s41598-022-18254-0
Journal volume & issue
Vol. 12, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Long non-coding RNAs (lncRNAs) are a prominent class of eukaryotic regulatory genes. Despite the numerous available transcriptomic datasets, the annotation of plant lncRNAs remains based on dated annotations that have been historically carried over. We present a substantially improved annotation of Arabidopsis thaliana lncRNAs, generated by integrating 224 transcriptomes in multiple tissues, conditions, and developmental stages. We annotate 6764 lncRNA genes, including 3772 that are novel. We characterize their tissue expression patterns and find 1425 lncRNAs are co-expressed with coding genes, with enriched functional categories such as chloroplast organization, photosynthesis, RNA regulation, transcription, and root development. This improved transcription-guided annotation constitutes a valuable resource for studying lncRNAs and the biological processes they may regulate.