Abstract
In protein crystallography, much time and effort are often required to trace an initial model from an interpretable electron density map and to refine it until it best agrees with the crystallographic data. Here, we present a method to build and refine a protein model automatically and without user intervention, starting from diffraction data extending to resolution higher than 2.3 Å and reasonable estimates of crystallographic phases. The method is based on an iterative procedure that describes the electron density map as a set of unconnected atoms and then searches for protein-like patterns. Automatic pattern recognition (model building) combined with refinement, allows a structural model to be obtained reliably within a few CPU hours. We demonstrate the power of the method with examples of a few recently solved structures.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$189.00 per year
only $15.75 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Bernstein, F.C. et al. The Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112, 535–542 (1977).
Helliwell, J.R. Synchrotron radiation facilities. Nature Struct. Biol. 5, 614–617 (1998).
Ogata, C.M. MAD phasing grows up. Nature Struct. Biol. 5, 638–640 (1998).
Furey, W. & Swaminathan, S. PHASES-95: a program package for the processing and analyzing diffraction data from macromolecules. Methods Enzymol. 277, 590–620 (1997).
Otwinowski, Z, in Isomorphous replacement and anomalous scattering. Proceedings of the CCP4 study weekend, January 25–26, 1991 (eds Wolf, W., Evans, P.R. & Leslie, A.G.W.) 80–85 (Daresbury Laboratory, Warrington, UK; 1991).
Fortelle de La, E. & Bricogne, G. Maximum-likelihood heavy-atom parameter refinement for multiple isomorphous replacement and multiwavelength anomalous diffraction methods. Methods Enzymol. 276 , 590–620 (1997).
Terwilliger, T.C. & Berendzen, J. Automated structure solution for MIR and MAD. Acta Crystallogr. D 55, 849–861 (1999).
Brünger, A.T. et al. Crystallography & NMR System: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905–921 ( 1998).
Navaza, J. AMoRe: an automated package for molecular replacement. Acta Crystallogr. A 50, 157–163 (1994).
Miller, R., Gallo, S.M., DeTitta, G.T., Khalak, H.G. & Weeks, C.M. SnB, crystal structure determination via shake-and-bake. J. Appl. Crystallogr. 27, 613–621 (1994).
Sheldrick, G.M. Patterson superposition and ab initio phasing. Methods Enzymol. 276, 307–326 ( 1997).
Lunin, V.Y., Lunina, N.L., Petrova, T.E., Urzhumtsev, A.G. & Podjarny, A.D. On the ab initio solution of the phase problem for macromolecules at very low resolution. II. Generalized likelihood based approach to cluster discrimination. Acta Crystallogr. D 54, 726–734 ( 1998).
Abrahams, J.P. & de Graaf, R.A.G. New developments in phase refinement Curr. Opin. Struct. Biol. 8, 601–605 (1998).
Jones, T.A., Zou, J.-Y., Cowan, S.W. & Kjeldgaard, M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A 47, 110–119 (1991).
Oldfield, T.J. A Semi-Automated map fitting procedure. in Proceedings from the 1996 meeting of the International Union Of Crystallography Macromolecular Computing School (1996).
Kleywegt, G.J. & Jones, T.A. Template convolution to enhance or detect structural features in macromolecular electron-density maps. Acta Crystallogr. D 53, 179– 185 (1997).
Schlagenhauf, E., Etges, R. & Metcalf, P. The crystal structure of Leishmania major surface proteinase leishmanolysin (gp63). Structure 6, 1035–1046 (1998).
Perrakis, A., Sixma, T.K., Wilson, K.S. & Lamzin, V.S. wARP: improvement and extension of crystallographic phases by weighted averaging of multiple refined dummy atomic models. Acta Crystallogr. D 53, 448–455 (1997).
Collaborative Computational Project Number 4. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D 50, 760–763 ( 1994).
Cowtan, K. & Main, P. Miscellaneous algorithms for density modification. Acta Crystallogr. D 54, 487 –493 (1998).
Lamzin, V.S. & Wilson, K.S. Automated refinement of protein models. Acta Crystallogr. D 49, 129– 147 (1993).
Murshudov, G.N., Vagin, A.A. & Dodson, E.J. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. D 53, 240– 255 (1997).
Bricogne, G. Direct phase determination by entropy maximization and likelihood ranking: status report and perspectives. Acta Crystallogr. D 49, 37–60 (1993).
Read, R.J. Improved Fourier coefficients for maps using phases from partial structures with errors. Acta Crystallogr. A 42, 140 –149 (1986).
Valegard, K. et al. Structure of a cephalosporin synthase. Nature 394, 805–809 (1998).
Hilge, M. et al. High resolution native and complex structures of thermostable β-mannanase from Thermomonospora fusca—substrate specificity in glycosyl hydrolase family 5. Structure, 6, 1433– 1444 (1998).
Perrakis, A. et al. Structure of a bacterial chitinase at 2.3 Å resolution. Structure 2, 1169–1180 (1994).
Brünger, A.T. Assessment of phase accuracy by cross validation: the free R value. Methods and application. Acta Crystallogr. D 49, 24–36 (1993).
Lamzin, V.S. & Wilson, K.S. Automated refinement for protein crystallography. Methods Enzymol. 277, 269 –305 (1997).
Sussman, J.L., Holbrook, S.R., Church, G.M. & Kim, S.-H. A structure factor least-squares refinement procedure for macromolecular structures using constrained and restrained parameters. Acta Crystallogr. A 33, 800–804 ( 1977).
Hendricksoon, W.A. & Konnert, J.H. in Computing in crystallography (eds Diamond, R., Ramasechan, S. & Venkatesan, K.), 13.01–13.25 (Indian Institute of Science, Bangalore, India; 1980).
Zou, J.-Y. & Jones, A. Towards the automatic interpretation of macromolecular electron density maps: qualitative and quantitative matching of protein sequence to map. Acta Crystallogr. D 52, 833–841 (1996).
Deacon, A.M., Weeks, C.M., Miller, R. & Ealick, S.E. The shake-and bake structure determination of troclinic lysozyme. Proc. Natl. Acad. Sci. USA 16, 9284–9289 (1998).
Dauter, Z., Sieker, L.C. & Wilson, K.S. Refinement of rubredoxin from Desulfovibrio vulgaris at 1.0 Å with and without restraints. Acta Crystallogr. B 48, 42–59 ( 1992).
van Asselt, E.J., Perrakis, A., Kalk, K.H., Lamzin, V.S. & Dijkstra, B.W. Accelerated X-ray structure elucidation of a 36 kDa muramidase/transglycosylase using wARP. Acta Crystallogr. D 54, 58–73 (1998).
Schmidt, A., Schlacher, A., Steiner, W., Schwab, H. & Kratky, K. Structure of the xylanase from Penicillium simplicissimum. Protein Sci. 7, 2081– 2088 (1998).
Becker, S., Groner, B. & Müller, C.W. Three dimensional structure of the Stat3β homodimer bound to DNA Nature 394, 145– 151 (1998).
Kraulis, P.J. MOLSCRIPT: a program to produce both detailed and schematic plots of protein structures. J. Appl. Crystallogr. 24, 946 –950 (1991).
Merrit, E.A. & Bacon, D.J. Raster3D photorealistic molecular graphics. Methods Enzymol. 277, 505– 524 (1997).
Acknowledgements
The authors are extremely grateful to K. Wilson and Z. Dauter for many useful comments and discussions. We also acknowledge the numerous test users for their efforts in using (and tuning) the ARP/wARP software and for making available brief results (presented in Table 1). Many thanks to our colleagues for critical reading of the manuscript. 'warpNtrace' is a part of the ARP/wARP suite, http://www.embl-hamburg.de/ARP (thanks to C. Blessing and T. Eriksson for the www setup and useful suggestions). The authors and the European Molecular Biology Laboratory copyright the software, which is available to the community.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Perrakis, A., Morris, R. & Lamzin, V. Automated protein model building combined with iterative structure refinement . Nat Struct Mol Biol 6, 458–463 (1999). https://doi.org/10.1038/8263
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1038/8263
This article is cited by
-
Small, solubilized platinum nanocrystals consist of an ordered core surrounded by mobile surface atoms
Communications Chemistry (2024)
-
Structural and biochemical analysis of family 92 carbohydrate-binding modules uncovers multivalent binding to β-glucans
Nature Communications (2024)
-
Structural and functional analyses of Burkholderia pseudomallei BPSL1038 reveal a Cas-2/VapD nuclease sub-family
Communications Biology (2023)
-
Structure of a putative immature form of a Rieske-type iron-sulfur protein in complex with zinc chloride
Communications Chemistry (2023)
-
Structure analysis suggests Ess1 isomerizes the carboxy-terminal domain of RNA polymerase II via a bivalent anchoring mechanism
Communications Biology (2021)