Shape distributions and protein similarity

Stefan Canzar, Jan Remy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

In this paper we describe a similarity model that provides the objective basis for clustering proteins of similar structure. More specifically, we consider the following variant of the protein-protein similarity problem: We want to find proteins in a large database D that are very similar to a given query protein in terms of geometric shape. We give experimental evidence, that the shape similarity model of Osada, Funkhouser, Chazelle and Dobkin [OFCD02] can be transferred to the context of protein structure comparison. This model is very simple and leads to algorithms that have attractive space requirements and running times. For example, it took 0.39 seconds to retrieve the eight members of the seryl family out of 26,600 domains. Furthermore, a very high agreement with one of the most popular classification schemes proved the significance of our simplified representation of complex proteins structure by a distribution of Cα-Cα distances.

Original languageEnglish (US)
Title of host publicationGerman Conference on Bioinformatics, GCB 2006
Pages1-10
Number of pages10
StatePublished - 2006
EventGerman Conference on Bioinformatics, GCB 2006 - Tubingen, Germany
Duration: Sep 19 2006Sep 22 2006

Publication series

NameGerman Conference on Bioinformatics, GCB 2006

Conference

ConferenceGerman Conference on Bioinformatics, GCB 2006
Country/TerritoryGermany
CityTubingen
Period9/19/069/22/06

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'Shape distributions and protein similarity'. Together they form a unique fingerprint.

Cite this