TY - JOUR
T1 - Network tuned multiple rank aggregation and applications to gene ranking
AU - Wang, Wenhui
AU - Zhou, Xianghong Jasmine
AU - Liu, Zhenqiu
AU - Sun, Fengzhu
N1 - Publisher Copyright:
© 2015 Wang et al.; licensee BioMed Central Ltd.
PY - 2015/1/21
Y1 - 2015/1/21
N2 - With the development of various high throughput technologies and analysis methods, researchers can study different aspects of a biological phenomenon simultaneously or one aspect repeatedly with different experimental techniques and analysis methods. The output from each study is a rank list of components of interest. Aggregation of the rank lists of components, such as proteins, genes and single nucleotide variants (SNV), produced by these experiments has been proven to be helpful in both filtering the noise and bringing forth a more complete understanding of the biological problems. Current available rank aggregation methods do not consider the network information that has been observed to provide vital contributions in many data integration studies. We developed network tuned rank aggregation methods incorporating network information and demonstrated its superior performance over aggregation methods without network information. The methods are tested on predicting the Gene Ontology function of yeast proteins. We validate the methods using combinations of three gene expression data sets and three protein interaction networks as well as an integrated network by combining the three networks. Results show that the aggregated rank lists are more meaningful if protein interaction network is incorporated. Among the methods compared, CGI_RRA and CGI_Endeavour, which integrate rank lists with networks using CGI [1] followed by rank aggregation using either robust rank aggregation (RRA) [2] or Endeavour [3] perform the best. Finally, we use the methods to locate target genes of transcription factors.
AB - With the development of various high throughput technologies and analysis methods, researchers can study different aspects of a biological phenomenon simultaneously or one aspect repeatedly with different experimental techniques and analysis methods. The output from each study is a rank list of components of interest. Aggregation of the rank lists of components, such as proteins, genes and single nucleotide variants (SNV), produced by these experiments has been proven to be helpful in both filtering the noise and bringing forth a more complete understanding of the biological problems. Current available rank aggregation methods do not consider the network information that has been observed to provide vital contributions in many data integration studies. We developed network tuned rank aggregation methods incorporating network information and demonstrated its superior performance over aggregation methods without network information. The methods are tested on predicting the Gene Ontology function of yeast proteins. We validate the methods using combinations of three gene expression data sets and three protein interaction networks as well as an integrated network by combining the three networks. Results show that the aggregated rank lists are more meaningful if protein interaction network is incorporated. Among the methods compared, CGI_RRA and CGI_Endeavour, which integrate rank lists with networks using CGI [1] followed by rank aggregation using either robust rank aggregation (RRA) [2] or Endeavour [3] perform the best. Finally, we use the methods to locate target genes of transcription factors.
UR - http://www.scopus.com/inward/record.url?scp=84961589347&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84961589347&partnerID=8YFLogxK
U2 - 10.1186/1471-2105-16-S1-S6
DO - 10.1186/1471-2105-16-S1-S6
M3 - Article
C2 - 25708095
AN - SCOPUS:84961589347
SN - 1471-2105
VL - 16
JO - BMC bioinformatics
JF - BMC bioinformatics
IS - 1
M1 - S6
ER -