Real-time definition of non-randomness in the distribution of genomic events.

Ulrich Abel; Annette Deichmann; Cynthia Bartholomae; Kerstin Schwarzwaelder; Hanno Glimm; Steven Howe; Adrian Thrasher; Alexandrine Garrigue; Salima Hacein-Bey-Abina; Marina Cavazzana-Calvo; Alain Fischer; Dirk Jaeger; Christof von Kalle; Manfred Schmidt

doi:10.1371/journal.pone.0000570

PLoS ONE (Jan 2007)

Real-time definition of non-randomness in the distribution of genomic events.

Ulrich Abel,
Annette Deichmann,
Cynthia Bartholomae,
Kerstin Schwarzwaelder,
Hanno Glimm,
Steven Howe,
Adrian Thrasher,
Alexandrine Garrigue,
Salima Hacein-Bey-Abina,
Marina Cavazzana-Calvo,
Alain Fischer,
Dirk Jaeger,
Christof von Kalle,
Manfred Schmidt

Affiliations

Ulrich Abel
Annette Deichmann
Cynthia Bartholomae
Kerstin Schwarzwaelder
Hanno Glimm
Steven Howe
Adrian Thrasher
Alexandrine Garrigue
Salima Hacein-Bey-Abina
Marina Cavazzana-Calvo
Alain Fischer
Dirk Jaeger
Christof von Kalle
Manfred Schmidt

DOI: https://doi.org/10.1371/journal.pone.0000570
Journal volume & issue: Vol. 2, no. 6
p. e570

Abstract

Read online

Features such as mutations or structural characteristics can be non-randomly or non-uniformly distributed within a genome. So far, computer simulations were required for statistical inferences on the distribution of sequence motifs. Here, we show that these analyses are possible using an analytical, mathematical approach. For the assessment of non-randomness, our calculations only require information including genome size, number of (sampled) sequence motifs and distance parameters. We have developed computer programs evaluating our analytical formulas for the real-time determination of expected values and p-values. This approach permits a flexible cluster definition that can be applied to most effectively identify non-random or non-uniform sequence motif distribution. As an example, we show the effectivity and reliability of our mathematical approach in clinical retroviral vector integration site distribution.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal