RNA-seq data from whole rice grains of pigmented and non-pigmented Malaysian rice varieties
Rabiatul-Adawiah Zainal-Abidin,
Zamri Zainal,
Zeti-Azura Mohamed-Hussein,
Norliza Abu-Bakar,
Mohd Shahril Firdaus Ab Razak,
Sanimah Simoh,
Yun Shin Sew
Affiliations
Rabiatul-Adawiah Zainal-Abidin
Malaysian Agricultural Research & Development Institute (MARDI), 43300 Serdang, Selangor, Malaysia; Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
Zamri Zainal
Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia; Centre for Frontier Sciences, Faculty of Science & Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
Zeti-Azura Mohamed-Hussein
Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia; Centre for Frontier Sciences, Faculty of Science & Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
Norliza Abu-Bakar
Malaysian Agricultural Research & Development Institute (MARDI), 43300 Serdang, Selangor, Malaysia
Mohd Shahril Firdaus Ab Razak
Malaysian Agricultural Research & Development Institute (MARDI), 43300 Serdang, Selangor, Malaysia
Sanimah Simoh
Malaysian Agricultural Research & Development Institute (MARDI), 43300 Serdang, Selangor, Malaysia
Yun Shin Sew
Malaysian Agricultural Research & Development Institute (MARDI), 43300 Serdang, Selangor, Malaysia; Corresponding author.
Pigmented rice is enriched with antioxidants, macro- and micronutrients. A comprehensive investigation of the gene expression patterns among the pigmented rice varieties would help to understand the cellular mechanism and biological processes of rice grain pigmentation. Hence, we performed RNA sequencing and analysis on the whole grain of dehusked mature seeds of selected six Malaysian rice varieties with varying grain pigmentations. These varieties were black rice (BALI and Pulut Hitam 9), red rice (MRM16 and MRQ100) and white rice (MR297 and MRQ76). Illumina HiSeq™ 4000 sequencer was used to generate total raw nucleotides of approximately 53 Gb in size. From 353,937,212 total paired-end raw reads, 340,131,496 total clean reads were obtained. The raw reads were deposited into European Nucleotide Archive (ENA) database and can be accessed via accession number PRJEB34340. This dataset allows us to identify and profile all expressed genes with functions related to nutritional traits (i.e. antioxidants, folate and amylose content) and quality trait (i.e. aroma) across both pigmented and non-pigmented rice varieties. In addition, the transcriptome data obtained will be valuable for discovery of potential gene markers and functional SNPs related to functional traits to assist in rice breeding programme.