Applications in Plant Sciences (Jan 2025)
Expression‐based machine learning models for predicting plant tissue identity
- Sourabh Palande,
- Jeremy Arsenault,
- Patricia Basurto‐Lozada,
- Andrew Bleich,
- Brianna N. I. Brown,
- Sophia F. Buysse,
- Noelle A. Connors,
- Sikta Das Adhikari,
- Kara C. Dobson,
- Francisco Xavier Guerra‐Castillo,
- Maria F. Guerrero‐Carrillo,
- Sophia Harlow,
- Héctor Herrera‐Orozco,
- Asia T. Hightower,
- Paulo Izquierdo,
- MacKenzie Jacobs,
- Nicholas A. Johnson,
- Wendy Leuenberger,
- Alessandro Lopez‐Hernandez,
- Alicia Luckie‐Duque,
- Camila Martínez‐Avila,
- Eddy J. Mendoza‐Galindo,
- David Cruz Plancarte,
- Jenny M. Schuster,
- Harry Shomer,
- Sidney C. Sitar,
- Anne K. Steensma,
- Joanne Elise Thomson,
- Damián Villaseñor‐Amador,
- Robin Waterman,
- Brandon M. Webster,
- Madison Whyte,
- Sofía Zorilla‐Azcué,
- Beronda L. Montgomery,
- Aman Y. Husbands,
- Arjun Krishnan,
- Sarah Percival,
- Elizabeth Munch,
- Robert VanBuren,
- Daniel H. Chitwood,
- Alejandra Rougon‐Cardoso
Affiliations
- Sourabh Palande
- Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA
- Jeremy Arsenault
- Department of Computer Science and Engineering Michigan State University East Lansing Michigan USA
- Patricia Basurto‐Lozada
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH) Universidad Nacional Autónoma de México Juriquilla Querétaro Mexico
- Andrew Bleich
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Brianna N. I. Brown
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Sophia F. Buysse
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Noelle A. Connors
- Department of Horticulture Michigan State University East Lansing Michigan USA
- Sikta Das Adhikari
- Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA
- Kara C. Dobson
- Ecology, Evolution, and Behavior Program Michigan State University East Lansing Michigan USA
- Francisco Xavier Guerra‐Castillo
- Unidad de Investigación Médica en Inmunología e Infectología Instituto Mexicano del Seguro Social Ciudad de México Mexico
- Maria F. Guerrero‐Carrillo
- Laboratory of Agrigenomic Sciences, Escuela Nacional de Estudios Superiores Unidad León Universidad Nacional Autónoma de México León Guanajuato Mexico
- Sophia Harlow
- Department of Horticulture Michigan State University East Lansing Michigan USA
- Héctor Herrera‐Orozco
- Posgrado en Ciencias Biológicas Universidad Nacional Autónoma de México Ciudad de México Mexico
- Asia T. Hightower
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Paulo Izquierdo
- Department of Plant, Soil, and Microbial Sciences Michigan State University East Lansing Michigan USA
- MacKenzie Jacobs
- Department of Biochemistry and Molecular Biology Michigan State University East Lansing Michigan USA
- Nicholas A. Johnson
- Ecology, Evolution, and Behavior Program Michigan State University East Lansing Michigan USA
- Wendy Leuenberger
- Ecology, Evolution, and Behavior Program Michigan State University East Lansing Michigan USA
- Alessandro Lopez‐Hernandez
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH) Universidad Nacional Autónoma de México Juriquilla Querétaro Mexico
- Alicia Luckie‐Duque
- Laboratory of Agrigenomic Sciences, Escuela Nacional de Estudios Superiores Unidad León Universidad Nacional Autónoma de México León Guanajuato Mexico
- Camila Martínez‐Avila
- Colección Nacional de Aves, Posgrado en Ciencias Biológicas, Instituto de Biología Universidad Nacional Autónoma de México Ciudad de México Mexico
- Eddy J. Mendoza‐Galindo
- Laboratory of Agrigenomic Sciences, Escuela Nacional de Estudios Superiores Unidad León Universidad Nacional Autónoma de México León Guanajuato Mexico
- David Cruz Plancarte
- Departamento de Botánica, Posgrado en Ciencias Biológicas, Instituto de Biología Universidad Nacional Autónoma de México Ciudad de México Mexico
- Jenny M. Schuster
- Molecular Plant Sciences Program Michigan State University East Lansing Michigan USA
- Harry Shomer
- Department of Computer Science and Engineering Michigan State University East Lansing Michigan USA
- Sidney C. Sitar
- Department of Plant, Soil, and Microbial Sciences Michigan State University East Lansing Michigan USA
- Anne K. Steensma
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Joanne Elise Thomson
- Molecular Plant Sciences Program Michigan State University East Lansing Michigan USA
- Damián Villaseñor‐Amador
- Programa de Posgrado en Ciencias Biológicas, Facultad de Ciencias Universidad Nacional Autónoma de México Ciudad de México Mexico
- Robin Waterman
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Brandon M. Webster
- Department of Plant Biology Michigan State University East Lansing Michigan USA
- Madison Whyte
- Department of Plant, Soil, and Microbial Sciences Michigan State University East Lansing Michigan USA
- Sofía Zorilla‐Azcué
- Programa de Posgrado en Ciencias Biológicas, Escuela Nacional de Estudios Superiores (ENES) Unidad Morelia, Universidad Nacional Autónoma de México Morelia Michoacán Mexico
- Beronda L. Montgomery
- Department of Biology Grinnell College Grinnell Iowa USA
- Aman Y. Husbands
- Department of Biology University of Pennsylvania Philadelphia Pennsylvania USA
- Arjun Krishnan
- Department of Biomedical Informatics, Center for Health AI University of Colorado Anschutz Medical Campus Aurora Colorado USA
- Sarah Percival
- Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA
- Elizabeth Munch
- Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA
- Robert VanBuren
- Department of Horticulture Michigan State University East Lansing Michigan USA
- Daniel H. Chitwood
- Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA
- Alejandra Rougon‐Cardoso
- Laboratory of Agrigenomic Sciences, Escuela Nacional de Estudios Superiores Unidad León Universidad Nacional Autónoma de México León Guanajuato Mexico
- DOI
- https://doi.org/10.1002/aps3.11621
- Journal volume & issue
-
Vol. 13,
no. 1
pp. n/a – n/a
Abstract
Abstract Premise The selection of Arabidopsis as a model organism played a pivotal role in advancing genomic science. The competing frameworks to select an agricultural‐ or ecological‐based model species were rejected, in favor of building knowledge in a species that would facilitate genome‐enabled research. Methods Here, we examine the ability of models based on Arabidopsis gene expression data to predict tissue identity in other flowering plants. Comparing different machine learning algorithms, models trained and tested on Arabidopsis data achieved near perfect precision and recall values, whereas when tissue identity is predicted across the flowering plants using models trained on Arabidopsis data, precision values range from 0.69 to 0.74 and recall from 0.54 to 0.64. Results The identity of belowground tissue can be predicted more accurately than other tissue types, and the ability to predict tissue identity is not correlated with phylogenetic distance from Arabidopsis. k‐nearest neighbors is the most successful algorithm, suggesting that gene expression signatures, rather than marker genes, are more valuable to create models for tissue and cell type prediction in plants. Discussion Our data‐driven results highlight that the assertion that knowledge from Arabidopsis is translatable to other plants is not always true. Considering the current landscape of abundant sequencing data, we should reevaluate the scientific emphasis on Arabidopsis and prioritize plant diversity.
Keywords