Skip to Main Content
In this paper we present a method for protein structure comparison that is based on indexing. Unlike most methods using indexing, ours does not use invariants of the Cα atoms of the proteins, rather it relies on geometric properties of the secondary structures. Given a set of protein structures, we compute the angles and distances of all the triplets of linear segments associated to the secondary structures of the proteins and use them to build hash tables. The hash tables can be used for fast retrieval of hypotheses of matches of a query protein against the database. We present and analyze the tables obtained for two separate sets of proteins that are representatives of all the folds in the PDB. The tables show an interesting distribution of the triplets of elements, especially if one takes into account that the elements of a triplet are generally not close in space. The majority of the elements are found to belong to two planar regions in the three-dimensional tables. The planar regions can be characterized as those whose corresponding triplets of structures lie on almost parallel planes.