MethodsX (Jan 2022)

Method for identification of 10 SSR markers from monkey genomes and its statistical inference with One & Two-way ANOVA

  • Chinta Someswara Rao,
  • G.N.V.G. Sirisha,
  • K. Butchi Raju,
  • N V Ganapathi Raju

Journal volume & issue
Vol. 9
p. 101833

Abstract

Read online

DNA tracts that include simple sequence repeats (SSRs), sometimes known as genetic ''stutters), are composed of a few to many tandem repetitions of a short base-pair motif. These sequences frequently mutate, changing the amount of repetitions. SSRs are frequently found in promoters, untranslated regions, and even coding sequences, therefore these alterations can significantly affect practically every aspect of gene activity. SSR alleles can also contribute to normal diversity in brain and behavioural features. Mutational expansion of certain triplet repeats is the cause of a number of inherited neurodegenerative diseases. Due to its importance in genetic research, in this paper we explored Ten SSR markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA that are identified from the genomes of Eleven distinct monkeys: A.Nancymaae, C.C.Imitator, C.Atys, M.Leucophaeus, P.Paniscus, R.Bieti, R.Roxellana, S.Boliviensis, T.Syrichta, C.A.Palliatus and M.Nemestrina using pattern matching mechanism. We identified 4bp SSR from eleven monkey dataset's Unchr chromosome mainly in this paper. The proposed approach finds the exact place/location of the SSR's and number of times that it appears in the given genome sequence. The identified patterns are analyzed with One-way and Two-way ANOVA that gives better analysis which is useful for genomic studies. Also, this 4bp Ten SSR markers data is a valuable to illustrate genetic variation of genomic study. • The great specificity of data sets produced from monkey genomes with pattern matching has been demonstrated. • These findings show that SSR identification could be a useful tool for determining genome similarity and comparability. • Researchers can use the raw sequencing data to conduct additional bioinformatics analysis.

Keywords