BioTechniques (Jun 2012)

Coding SNPs as intrinsic markers for sample tracking in large-scale transcriptome studies

  • Weihong Xu,
  • Hong Gao,
  • Junhee Seok,
  • Julie Wilhelmy,
  • Michael N. Mindrinos,
  • Ronald W. Davis,
  • Wenzhong Xiao

DOI
https://doi.org/10.2144/0000113879
Journal volume & issue
Vol. 52, no. 6
pp. 386 – 388

Abstract

Read online

Large-scale transcriptome profiling in clinical studies often involves assaying multiple samples of a patient to monitor disease progression, treatment effect, and host response in multiple tissues. Such profiling is prone to human error, which often results in mislabeled samples. Here, we present a method to detect mislabeled sample outliers using coding single nucleotide polymorphisms (cSNPs) specifically designed on the microarray and demonstrate that the mislabeled samples can be efficiently identified by either simple clustering of allele-specific expression scores or Mahalanobis distance-based outlier detection method. Based on our results, we recommend the incorporation of cSNPs into future transcriptome array designs as intrinsic markers for sample tracking.

Keywords