PLoS ONE (Jan 2010)

A novel preprocessing method using Hilbert Huang Transform for MALDI-TOF and SELDI-TOF mass spectrometry data.

  • Li-Ching Wu,
  • Hsin-Hao Chen,
  • Jorng-Tzong Horng,
  • Chen Lin,
  • Norden E Huang,
  • Yu-Che Cheng,
  • Kuang-Fu Cheng

DOI
https://doi.org/10.1371/journal.pone.0012493
Journal volume & issue
Vol. 5, no. 8
p. e12493

Abstract

Read online

MOTIVATION: Mass spectrometry is a high throughput, fast, and accurate method of protein analysis. Using the peaks detected in spectra, we can compare a normal group with a disease group. However, the spectrum is complicated by scale shifting and is also full of noise. Such shifting makes the spectra non-stationary and need to align before comparison. Consequently, the preprocessing of the mass data plays an important role during the analysis process. Noises in mass spectrometry data come in lots of different aspects and frequencies. A powerful data preprocessing method is needed for removing large amount of noises in mass spectrometry data. RESULTS: Hilbert-Huang Transformation is a non-stationary transformation used in signal processing. We provide a novel algorithm for preprocessing that can deal with MALDI and SELDI spectra. We use the Hilbert-Huang Transformation to decompose the spectrum and filter-out the very high frequencies and very low frequencies signal. We think the noise in mass spectrometry comes from many sources and some of the noises can be removed by analysis of signal frequency domain. Since the protein in the spectrum is expected to be a unique peak, its frequency domain should be in the middle part of frequency domain and will not be removed. The results show that HHT, when used for preprocessing, is generally better than other preprocessing methods. The approach not only is able to detect peaks successfully, but HHT has the advantage of denoising spectra efficiently, especially when the data is complex. The drawback of HHT is that this approach takes much longer for the processing than the wavlet and traditional methods. However, the processing time is still manageable and is worth the wait to obtain high quality data.