Application of Discrete Fourier Inter-Coefficient Difference for Assessing Genetic Sequence Similarity
Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
EURASIP Journal on Bioinformatics & Systems Biology
Computer Science & Engineering
Link to Published Version
King, Brian R.; Aburdene, Maurice F.; Thompson, Alex; and Warres, Zach. "Application of Discrete Fourier Inter-Coefficient Difference for Assessing Genetic Sequence Similarity." EURASIP Journal on Bioinformatics & Systems Biology (2014) : 8.