Scientific Reports (Nov 2024)
Selection on synonymous codon usage in soybean (Glycine max) WRKY genes
Abstract
Abstract The WRKY transcription factor gene family in soybean [Glycine max (L.) Merr.] (GmWRKY) is critical for the plant’s development and stress responses. This study examines the evolutionary dynamics of the GmWRKY gene family, focusing on its synonymous codon usage bias (CUB) in a comprehensive set of 179 coding sequences. CUB was analyzed using various indices, revealing a preference for A/T-ending codons and relatively low codon bias. Codon adaptation index (CAI) analysis suggested that these genes are optimized for efficient translation despite relatively low bias, reflecting a balance between codon diversity and translation efficiency. Neutrality and NC plots indicated that selective forces dominate over mutational forces in shaping codon usage, while selection signature analysis showed purifying selection being prevalent across the gene family. However, episodic positive selection was also detected in certain clades, highlighting potential adaptive diversification in response to environmental stress. Additionally, promoter binding site analysis uncovered correlations between codon usage and transcriptional regulation, indicating a context-dependent relationship between CUB and gene expression. Phylogenetic analysis identified 11 well-supported clades in the modern GmWRKY gene family and ancestral sequence reconstruction revealed more relaxed codon preferences and reduced selection constraints in modern GmWRKY genes, potentially linked to neofunctionalization and adaptation to environmental changes. These findings provide a framework for optimizing gene expression in transgenic soybean crops with resilience. Further functional validation of positively selected genes is recommended to elucidate their role in stress responses.
Keywords