CODA: a combo-Seq data analysis workflow

Nazzari, Marta; Hauser, Duncan; van Herwijnen, Marcel; Romitti, Mirian; Carvalho, Daniel J; Kip, Anna M; Caiment, Florian

doi:doi/10.1093/bib/bbac582

Citer

CODA: a combo-Seq data analysis workflow

par Nazzari, Marta ;Hauser, Duncan ;van Herwijnen, Marcel ;Romitti, Mirian

;Carvalho, Daniel J ;Kip, Anna M ;Caiment, Florian
Référence Briefings in bioinformatics, 24, 1
Publication Publié, 2023-01-01

Article révisé par les pairs

Résumé :

Abstract The analysis of the combined mRNA and miRNA content of a biological sample can be of interest for answering several research questions, like biomarkers discovery, or mRNA–miRNA interactions. However, the process is costly and time-consuming, separate libraries need to be prepared and sequenced on different flowcells. Combo-Seq is a library prep kit that allows us to prepare combined mRNA–miRNA libraries starting from very low total RNA. To date, no dedicated bioinformatics method exists for the processing of Combo-Seq data. In this paper, we describe CODA (Combo-seq Data Analysis), a workflow specifically developed for the processing of Combo-Seq data that employs existing free-to-use tools. We compare CODA with exceRpt, the pipeline suggested by the kit manufacturer for this purpose. We also evaluate how Combo-Seq libraries analysed with CODA perform compared with conventional poly(A) and small RNA libraries prepared from the same samples. We show that using CODA more successfully trimmed reads are recovered compared with exceRpt, and the difference is more dramatic with short sequencing reads. We demonstrate how Combo-Seq identifies as many genes and fewer miRNAs compared to the standard libraries, and how miRNA validation favours conventional small RNA libraries over Combo-Seq. The CODA code is available at https://github.com/marta-nazzari/CODA.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

CODA: a combo-Seq data analysis workflow

Documents en relation

DI-fusion