Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

Raimondi, Daniele; Orlando, Gabriele; Moreau, Yves; Vranken, Wim

doi:doi/10.1093/bioinformatics/bty309

Citer

Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

par Raimondi, Daniele

;Orlando, Gabriele

;Moreau, Yves ;Vranken, Wim

Référence Bioinformatics, 34, 18, page (3118-3125)
Publication Publié, 2018

Article révisé par les pairs

Résumé :

Motivation: Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations. Results: We developed a discriminative, alignment-free algorithm for homology detection with quasi-linear complexity, enabling theoretically much faster homology searches. To reach this goal, we convert the protein sequence into numeric biophysical representations. These are shrunk to a fixed length using a novel vector quantization method which uses a Discrete Cosine Transform compression. We then compute, for each compressed representation, similarity scores between proteins with the Dynamic Time Warping algorithm and we feed them into a Random Forest. The WARP performances are comparable with state of the art methods.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

Documents en relation

DI-fusion