Batch effect removal methods for microarray gene expression data integration: A survey

Lazar, Cosmin; Taminau, Jonatan; Nowe, Ann; Meganck, Stijn; Steenhoff, David; Coletta, Alain; Molter, Colin; Weiss, David; Duque, Robin; Bersini, Hugues

doi:doi/10.1093/bib/bbs037

Citer

Batch effect removal methods for microarray gene expression data integration: A survey

par Lazar, Cosmin ;Taminau, Jonatan ;Nowe, Ann

;Meganck, Stijn ;Steenhoff, David ;Coletta, Alain

Référence Briefings in bioinformatics, 14, 4, page (469-490), bbs037
Publication Publié, 2013-07

Article révisé par les pairs

Résumé :

Genomic data integration is a key goal to be achieved towards large-scale genomic data analysis. This process is very challenging due to the diverse sources of information resulting from genomics experiments. In this work, we review methods designed to combine genomic data recorded from microarray gene expression (MAGE) experiments. It has been acknowledged that the main source of variation between different MAGE datasets is due to the so-called 'batch effects'. The methods reviewed here perform data integration by removing (or more precisely attempting to remove) the unwanted variation associated with batch effects. They are presented in a unified framework together with a wide range of evaluation tools, which are mandatory in assessing the efficiency and the quality of the data integration process. We provide a systematic description of the MAGE data integration methodology together with some basic recommendation to help the users in choosing the appropriate tools to integrate MAGE data for large-scale analysis; and also how to evaluate them from different perspectives in order to quantify their efficiency. All genomic data used in this study for illustration purposes were retrieved from InSilicoDB . http://insilico.ulb.ac.be. © The Author 2012. Published by Oxford University Press.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Batch effect removal methods for microarray gene expression data integration: A survey

Documents en relation

DI-fusion