Genetics and Molecular Biology (Dec 2001)

Trimming and clustering sugarcane ESTs

  • Guilherme P. Telles,
  • Felipe R. da Silva

DOI
https://doi.org/10.1590/S1415-47572001000100004
Journal volume & issue
Vol. 24, no. 1-4
pp. 17 – 23

Abstract

Read online

The original clustering procedure adopted in the Sugarcane Expressed Sequence Tag project (SUCEST) had many problems, for instance too many clusters, the presence of ribosomal sequences, etc. We therefore redesigned the clustering procedure entirely, including a much more careful initial trimming of the reads. In this paper the new trimming and clustering strategies are described in detail and we give the new official figures for the project, 237,954 expressed sequence tags and 43,141 clusters.O método de clustering adotado no Projeto SUCEST (Sugarcane EST Project) tinha vários problemas (muitos clusters, presença de seqüências de ribossomo etc.) Nós assumimos a tarefa de reprojetar todo o processo de clustering, propondo uma "limpeza" inicial mais cuidadosa das seqüências. Neste artigo as estratégias de limpeza das seqüências e de clustering são descritas em detalhe, incluindo os números oficiais do projeto (237,954 ESTs e 43,141 clusters).