'For statisticians with a little background in biology, this book delivers a very readable presentation on the analysis of DNA sequences to determine whether a motif is of statistical significance due to its overabundance (or under-abundance) in terms of frequencies or location. This book is concise but sufficiently detailed. Biologists without a background in mathematical statistics may find the learning curve a little steep but tractable. The authors' continuous use of practical examples will be greatly appreciate by biologists and statisticians alike. This book is one of a kind, and I recommend it to any statistician interested in learning about DNA sequences and motifs.' Journal of the American Statistical Association
One of the most important problems in computational biology is whether a particular gene sequence (mathematically, a 'word') is simply random or is of biological significance, for example, because of its frequency or location. This book introduces the mathematical and statistical ideas used in solving this so-called exceptional word problem.