TY - GEN
T1 - Shape distributions and protein similarity
AU - Canzar, Stefan
AU - Remy, Jan
PY - 2006
Y1 - 2006
N2 - In this paper we describe a similarity model that provides the objective basis for clustering proteins of similar structure. More specifically, we consider the following variant of the protein-protein similarity problem: We want to find proteins in a large database D that are very similar to a given query protein in terms of geometric shape. We give experimental evidence, that the shape similarity model of Osada, Funkhouser, Chazelle and Dobkin [OFCD02] can be transferred to the context of protein structure comparison. This model is very simple and leads to algorithms that have attractive space requirements and running times. For example, it took 0.39 seconds to retrieve the eight members of the seryl family out of 26,600 domains. Furthermore, a very high agreement with one of the most popular classification schemes proved the significance of our simplified representation of complex proteins structure by a distribution of Cα-Cα distances.
AB - In this paper we describe a similarity model that provides the objective basis for clustering proteins of similar structure. More specifically, we consider the following variant of the protein-protein similarity problem: We want to find proteins in a large database D that are very similar to a given query protein in terms of geometric shape. We give experimental evidence, that the shape similarity model of Osada, Funkhouser, Chazelle and Dobkin [OFCD02] can be transferred to the context of protein structure comparison. This model is very simple and leads to algorithms that have attractive space requirements and running times. For example, it took 0.39 seconds to retrieve the eight members of the seryl family out of 26,600 domains. Furthermore, a very high agreement with one of the most popular classification schemes proved the significance of our simplified representation of complex proteins structure by a distribution of Cα-Cα distances.
UR - http://www.scopus.com/inward/record.url?scp=84863409328&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863409328&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84863409328
SN - 9783885791775
T3 - German Conference on Bioinformatics, GCB 2006
SP - 1
EP - 10
BT - German Conference on Bioinformatics, GCB 2006
T2 - German Conference on Bioinformatics, GCB 2006
Y2 - 19 September 2006 through 22 September 2006
ER -