Key Points
-
Due to the escalating downstream costs in the development phase, objective quality assessment of lead series long before entering clinical trials is an increasing necessity within pharmaceutical research.
-
Moving away from the linear process of compound optimization towards a parallel strategy in which the profile of chemical entities is shaped in a multidimensional manner allows the properties of a molecule to be appropriately balanced in a rapid, iterative fashion.
-
The initiating point in a medicinal chemistry programme can arise from a variety of sources. Depending on the target and further information available, they can range from brute-force, serendipity search-based methods to information-rich design approaches for identifying novel chemical entities for further optimization.
-
High-throughput screening campaigns currently provide the main source for chemistry initiation in pharmaceutical research. Assay development time, logistical hurdles and issues concerning compound acquisition increasingly demand alternative approaches to complement this lead discovery pathway.
-
The design of combinatorial compound libraries on the basis of predicted molecular properties is now widely applied, increasing the quality of the product compounds generated. In addition focused libraries can be generated on the basis of ligand or biostructural information most effectively enhanced by support from modern integrated computational and synthetic methods.
-
Computational algorithms allow the annotation and grouping of biological targets as well as chemical structures. 'Chemogenomics' is the interface between disciplines where chemical topology space is married with biological target space. Chemogenomics databases will in future allow existing target-ligand information to be used prospectively to identify drugable targets and design tailored new ligand motifs thus creating valuable knowledge.
Abstract
The identification of small-molecule modulators of protein function, and the process of transforming these into high-content lead series, are key activities in modern drug discovery. The decisions taken during this process have far-reaching consequences for success later in lead optimization and even more crucially in clinical development. Recently, there has been an increased focus on these activities due to escalating downstream costs resulting from high clinical failure rates. In addition, the vast emerging opportunities from efforts in functional genomics and proteomics demands a departure from the linear process of identification, evaluation and refinement activities towards a more integrated parallel process. This calls for flexible, fast and cost-effective strategies to meet the demands of producing high-content lead series with improved prospects for clinical success.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Drews, J. Drug discovery: A historical perspective. Science 287, 1960–1964 (2000).
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Knowles, J. & Gromo, G. Target selection in drug discovery. Nature Rev. Drug Discov. 2, 63–69 (2003).
Hopkins, A. L. & Groom, C. R. The druggable genome. Nature Rev. Drug Discov. 1, 727–730 (2002).
Lenz, G. R., Nash, H. M. & Jindal, A. Chemical ligands, genomics and drug discovery. Drug Discov. Today 5, 145–156 (2000).
Hodgson, J. ADMET — turning chemicals into drugs. Nature Biotechnol. 19, 722–726 (2001).
Proudfoot, J. R. Drugs, leads, and drug-likeness: An analysis of some recently launched drugs. Bioorg. Med. Chem. Lett. 12, 1647–1650 (2002).
Alanine, A., Nettekoven, M., Roberts, E. & Thomas, A. Lead generation — enhancing the success of drug discovery by investing into the hit to lead process. Combin. Chem. High Throughput Screen. 6, 51–66 (2003).
Boguslavsky, J. Minimizing risk in 'Hits to Leads'. Drug Discov. & Develop. 4, 26–30 (2001).
Bleicher, K. H. Chemogenomics: bridging a drug discovery gap. Curr. Med. Chem. 9, 2077–2084 (2002).
Bajorath, J. Integration of virtual and high-throughput screening. Nature Rev. Drug Discov. 1, 882–894 (2002). This review article covers the current concepts of integrating both virtual and high-throughput screening.
Teague, J. S., Davis, A. M., Leeson, P. D. & Oprea, T. The design of leadlike combinatorial libraries. Angew. Chem. Int. Ed. Engl. 38, 3743–3748 (1999).
Walters, P. & Murcko, M. A. Prediction of 'drug-likeness' Adv. Drug Deliv. Rev. 54, 255–271 (2002).
Martin, E. J. & Critchlow, R. E. Beyond mere diversity: tailoring combinatorial libraries for drug discovery. J. Comb. Chem. 1, 32–45 (1999).
Menard, P. R., Mason, J. S., Morize I. & Bauerschmidt, S. Chemistry space metrics in diversity analysis, library design and compound selection. J. Chem. Inf. Comput. Sci. 38, 1204–1213 (1998).
Roche, O. et al. Development of a virtual screening method for identification of 'Frequent Hitters' in compound libraries. J. Med. Chem. 45, 137–142 (2002).
Balkenhohl, F., von dem Busche-Hünnefeld, C., Lansky, A. & Zechel, C. Combinatorial synthesis of small organic molecules. Angew. Chem. Int. Ed. Engl. 35, 2288–2337 (1996).
Böhm, H. -J. & Schneider, G. (eds). Virtual Screening for Bioactive Molecules (Wiley–VCH, Weinheim, 2000). An excellent compendium of current virtual screening methods.
Hann, M. M., Leach, A. R. & Harper, G. Molecular complexity and its impact on the probability of finding leads for drug discovery. J. Chem. Inf. Comput. Sci. 41, 856–864 (2001).
Crossley, R. From hits to leads, focusing the eyes of medicinal chemistry. Modern Drug Discov. 5, 18–22 (2002).
Van Dogen, M., Weigelt, J., Uppenberg, J., Schultz, J. & Wikström, M. Structure-based screening and design in drug discovery. Drug Discov. Today 7, 471–477 (2002).
Carr, R. & Jhoti, H. Structure-based screening of low affinity compounds. Drug Discov. Today 7, 522–527 (2002).
Huang, L., Lee, A. & Ellman, J. A. Identification of potent and selective mechanism-based inhibitors of the cysteine protease cruzain using solid-phase parallel synthesis. J. Med. Chem. 45, 676–684 (2002).
Patchett, A. A. & Nargund, R. P. Privileged structures — an update. Annu. Rep. Med. Chem. 35, 289–298 (2000).
Bleicher, K. H., Wütherich, Y., Adam, G., Hoffmann, T. & Sleight, A. J. Parallel solution- and solid-phase synthesis of spiropyrrolo-pyrroles as novel NK-1 receptor ligands. Bioorg. Med. Chem. Lett. 12, 3073–3076 (2002).
Stahl, M. et al. A validation study on the practical use of automated de novo design. J. Comput.-Aided Mol. Des. 16, 459–478 (2002).
Schneider, G. et al. Virtual screening for bioactive molecules by de novo design. Angew. Chem Int. Ed. Engl. 39, 4130–4133 (2000).
Schneider, G. & Böhm, H. -J. Virtual screening and fast automated docking methods. Drug Discov. Today 7, 64–70 (2002).
Lipinski, C., Lombardo, F., Dominy, B. & Feeney, P. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Deliv. Rev. 23, 3–25 (1997). A landmark publication based on retrospective data analysis for bioavailability resulting in the 'rule-of-five'.
Cariello, N. F. et al. Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity. Mutagenesis 17, 321–329 (2002).
Sadowski, J. & Kubinyi, H. A scoring scheme for discriminating between drugs and nondrugs. J. Med. Chem. 41, 3325–3329 (1998).
Zuegge, J. et al. A fast virtual screening filter for cytochrome P450 3A4 inhibition liability of compound libraries. Quant. Struct.-Act. Relat. 21, 249–256 (2002).
Roche, O. et al. A virtual screening method for prediction of the hERG potassium channel liability of compound libraries. Chembiochem 3, 455–459 (2002).
Schneider, G., Neidhart, W., Giller, T. & Schmid, S. 'Scaffold hopping' by topological pharmacophore search: a contribution to virtual screening. Angew. Chem Int. Ed. Engl. 38, 2894–2896 (1999).
Mason, J. S., Good, A. C. & Martin, E. J. 3-D Pharmacophores in drug discovery. Curr. Pharm. Des. 7, 567–597 (2001).
Bissantz, C., Folkers, G. & Rognan, D. Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. J. Med. Chem. 43, 4759–4767 (2000).
Duckworth, D. M. & Sanseau, P. In silico identification of novel therapeutic targets. Drug Discov. Today 7, 64–69 (2002).
Lee, D. K. et al. Identification of four human G-protein-coupled receptors expressed in the brain. Mol. Brain Res. 86, 13–22 (2001). This paper describes the successful identification of orphan G-protein-coupled receptors initiated by bioinformatic approaches.
Alaimo, P. J., Shogren-Knaak, M. A. & Shokat, K. M. Chemical genetic approaches for the elucidation of signaling pathways. Curr. Opin. Chem. Biol. 5, 360–367 (2001).
McGregor, M. J. & Pallai, P. V. Clustering of large databases of compounds: using the MDL “keys” as structural descriptors. J. Chem. Inf. Comp. Sci. 37, 443–448 (1997).
Stanton, D. T. Evaluation and use of BCUT descriptors in QSAR and QSPR studies. J. Chem. Inf. Com. Sci. 39, 11–20 (1999).
Sprague, P. W. Automated chemical hypothesis generation and database searching with CATALYST. Perspect. Drug Discov. Design 3, 1–20 (1995).
Liebeschuetz, J. W. et al. PRO_SELECT: combining structure-based drug design and array-based chemistry for rapid lead discovery. 2. The development of a series of highly potent and selective Factor Xa inhibitors. J. Med. Chem. 45, 1221–1232 (2002).
Boehm, H. -J. Prediction of binding constants of protein ligands: a fast method for the prioritization of hits obtained from de novo design or 3D database search programs. J. Comput.-Aided Mol. Des. 12, 309–323 (1998).
Acknowledgements
Dr. Simona Ceccarelli is cordially thanked for providing the cartoons 'Don't panic....' and 'Where there's a will, there's a way...'.
Author information
Authors and Affiliations
Corresponding author
Related links
Glossary
- DRUGABILITY
-
The feasibility of a target to be effectively modulated by a small molecule ligand that has appropriate bio-physicochemical and absorption, distribution, metabolism and excretion properties to be developed into a drug candidate with appropriate properties for the desired therapeutic use.
- HIT
-
A primary active compound(s), with non-promiscuous binding behaviour, exceeding a certain threshold value in a given assay(s). The 'active' is followed up with an identity and purity evaluation, an authentic sample is then obtained or re-synthesized and activity confirmed in a multi-point activity determination to establish the validity of the hit (validated hit).
- LEAD
-
A prototypical chemical structure or series of structures that demonstrate activity and selectivity in a pharmacological or biochemically relevant screen. This forms the basis for a focused medicinal chemistry effort for lead optimization and development with the goal of identifying a clinical candidate. A distinct lead series has a unique core structure and the ability to be patented separately.
- HIGH-THROUGHPUT SCREENING
-
Screening (of a compound collection) to identify hits in an in vitro assay, usually performed robotically in 384-well microtitre plates.
- HIGH-CONTENT LEAD SERIES
-
A lead series in which representatives have been extensively refined in not only their structure–activity relationship and selectivity, but also in their physicochemical and early absorption, distribution, metabolism and excretion properties, and safety measures, such as metabolic stability, permeation and hERG liabilities. Correlations have been elucidated and all crucial parameters have shown themselves to be modulated in the series.
- STRUCTURE–ACTIVITY RELATIONSHIP
-
The consistent correlation of structural features or groups with the biological activity of compounds in a given biological assay.
- PHYSICOCHEMICAL PROPERTIES
-
Physical molecular properties of a compound. Typical properties are solubility, acidity, lipophilicity, polar surface area, shape, flexibility and so on.
- VALIDATED HIT SERIES
-
A set of hits clustered into sub-structurally related families, representatives of which have been evaluated for their specificity, selectivity, physicochemical and in vitro ADME properties to characterize the series.
- LEAD SERIES IDENTIFIED
-
A peer-reviewed milestone, the requirements to be fulfilled are closely linked to the clinical candidate profile. Initial criteria are defined when hits are first identified; they include activity, selectivity and pertinent physicochemical properties, plus an evaluation of ADME and certain safety attributes. In vivo activity is not a mandatory requirement, provided the obstacles are appreciated and considered to be surmountable based on evidence.
- CYTOCHROME P450
-
A family of promiscuous iron-haem-containing enzymes involved in oxidative metabolism of a broad variety of xenobiotics and drug compounds.
- PHARMACOPHORE
-
The spatial orientation of various functional groups or features necessary for activity at a biomolecular target.
- MULTI-DIMENSIONAL OPTIMIZATION
-
The process of parallel optimization of several relevant drug-property parameters in concert with activity, to produce a drug candidate with balanced property profiles suitable for clinical development.
- NUCLEAR MAGNETIC RESONANCE
-
A spectroscopy tool used for the assignment and confirmation of chemical structure of a compound or biological macromolecule. Sophisticated multi-dimensional methods are used to characterize larger and more complex biomolecules.
- COMBINATORIAL CHEMISTRY
-
Synthesis technologies to generate compound libraries rather than single products. Robotic instruments for solid- and solution-phase chemistry, as well as high-throughput purification equipment, are applied.
- DRUG-LIKENESS
-
A scoring metric (computational) for the similarity of a given structure to a representative reference set of marketed drugs.
- DIVERSITY
-
A property–distance metric reflecting the dissimilarity of objects (molecules). Various molecular descriptors (indices) are used to define compounds in a numerical fashion so that they can be readily compared. Such measures must be considered within an appropriate context to be meaningful.
- 'FREQUENT-HITTER' LIABILITIES
-
An empirically derived metric by which compounds are assigned a probability to produce (false) positive results (hits) frequently in diverse screening assays.
- MOLECULAR TOPOLOGY
-
A graph-based method of describing molecular structure using atom connectivity through the molecular framework and assigning atoms or substructural domains with various property types: lipophilic, H-bond acceptor/donor, positively/ negatively charged and so on.
- DESCRIPTORS
-
Metrics used to numerically describe a structure or certain molecular attributes of a compound (for example, Tanimoto, Ghose and Crippen, BCUT and so on).
- PARALLEL SYNTHESIS
-
The process by which a set of individual compounds is made simultaneously using common chemical building blocks and homologous reagents.
- PRIVILEGED STRUCTURE
-
A specific core or scaffolding structure that imparts a generic activity towards a protein family or limited set of its members independently of the specific substituents attached to it.
- CONFORMERS
-
Distinct three-dimensional forms of a molecular structure of a given atomic connectivity, which results from internal rotations about single bonds between atoms.
- DOCKING AND SCORING
-
The process of computationally placing a virtual molecular structure into a binding site of a biological macromolecule (docking) and flexibly or rigidly relaxing the respective structures then ranking (scoring) the complementarity of fit.
- SMILES
-
A character-based line notation for chemical structures.
- BIT STRINGS
-
A contiguous set of characters that consists entirely of 1s and 0s, which can be used to encode, for example, the presence or not of structural elements in a compound.
- RAPID FEEDBACK SCREENING
-
Rapid feedback provided by assaying small compound sets (< 1,000) through a medium- throughput assay to guide the SAR for rapid iterative design and synthesis cycles.
Rights and permissions
About this article
Cite this article
Bleicher, K., Böhm, HJ., Müller, K. et al. Hit and lead generation: beyond high-throughput screening. Nat Rev Drug Discov 2, 369–378 (2003). https://doi.org/10.1038/nrd1086
Issue Date:
DOI: https://doi.org/10.1038/nrd1086
This article is cited by
-
Performance evaluation of drug synergy datasets using computational intelligence approaches
Multimedia Tools and Applications (2024)
-
Artificial intelligence for prediction of biological activities and generation of molecular hits using stereochemical information
Journal of Computer-Aided Molecular Design (2023)
-
Towards Exploring Large Molecular Space: An Efficient Chemical Genetic Algorithm
Journal of Computer Science and Technology (2022)
-
Synthetic Lethal Interactions Prediction Based on Multiple Similarity Measures Fusion
Journal of Computer Science and Technology (2021)
-
Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity
Journal of Cheminformatics (2020)