Determining relevant features to recognize electron density patterns in x-ray protein crystallography.

abstract

High-throughput computational methods in X-ray protein crystallography are indispensable to meet the goals of structural genomics. In particular, automated interpretation of electron density maps, especially those at mediocre resolution, can significantly speed up the protein structure determination process. TEXTAL(TM) is a software application that uses pattern recognition, case-based reasoning and nearest neighbor learning to produce reasonably refined molecular models, even with average quality data. In this work, we discuss a key issue to enable fast and accurate interpretation of typically noisy electron density data: what features should be used to characterize the density patterns, and how relevant are they? We discuss the challenges of constructing features in this domain, and describe SLIDER, an algorithm to determine the weights of these features. SLIDER searches a space of weights using ranking of matching patterns (relative to mismatching ones) as its evaluation function. Exhaustive search being intractable, SLIDER adopts a greedy approach that judiciously restricts the search space only to weight values that cause the ranking of good matches to change. We show that SLIDER contributes significantly in finding the similarity between density patterns, and discuss the sensitivity of feature relevance to the underlying similarity metric.

authors

published proceedings

J Bioinform Comput Biol

author list (cited authors)

Gopal, K., Romo, T. D., Sacchettini, J. C., & Ioerger, T. R.

citation count

3

complete list of authors

Gopal, Kreshna||Romo, Tod D||Sacchettini, James C||Ioerger, Thomas R

publication date

June 2005

publisher

World Scientific Publishing Publisher

published in

Journal of Bioinformatics and Computational Biology Journal

keywords

Absorptiometry, Photon
Algorithms
Artificial Intelligence
Computer Simulation
Crystallography, X-Ray
Electrons
Models, Molecular
Pattern Recognition, Automated
Protein Conformation
Proteins
Software

PubMed Central ID

16108088

Digital Object Identifier (DOI)

10.1142/s0219720005001272

start page

645

end page

676

volume

3

issue

3

URL

http://dx.doi.org/10.1142/s0219720005001272

Determining relevant features to recognize electron density patterns in x-ray protein crystallography. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL