Tags

Type your tag names separated by a space and hit enter

A simple clustering technique to improve QSAR model selection and predictivity: application to a receptor independent 4D-QSAR analysis of cyclic urea derived inhibitors of HIV-1 protease.
J Chem Inf Comput Sci. 2003 Nov-Dec; 43(6):2180-93.JC

Abstract

A training set of 50 tetrahydropyrimidine-2-one based inhibitors of HIV-1 protease, for which the -log K(i) values were measured, was used to construct receptor independent 4D-QSAR models. A novel clustering technique was employed to facilitate and improve model selection as well as test set predictions. Following the manifold model theory, five unique models were chosen by the clustering algorithm (q(2) = 0.81-0.84). The models were used to map the atom type morphology of the inhibitor binding site of HIV-1 protease as well as to predict the potencies (-log K(i)) of 10 test set compounds. The rank-difference correlation coefficient was used to evaluate the quality of the test set predictions, which was improved from 0.39 to 0.68 when the clustering technique was applied. The set of five models, collectively, identify the important binding characteristics of the HIV protease receptor site. This study demonstrates that the selected simple clustering technique provides a discrete algorithm for model selection, as well as improving the quality of test set, or unknown, compound prediction as determined by the rank-difference correlation coefficient.

Authors+Show Affiliations

Laboratory of Molecular Modeling and Design (M/C-781), University of Illinois at Chicago, College of Pharmacy, 833 South Wood Street, Chicago, Illinois 60612-7231, USA.No affiliation info available

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

14632470

Citation

Senese, Craig L., and A J. Hopfinger. "A Simple Clustering Technique to Improve QSAR Model Selection and Predictivity: Application to a Receptor Independent 4D-QSAR Analysis of Cyclic Urea Derived Inhibitors of HIV-1 Protease." Journal of Chemical Information and Computer Sciences, vol. 43, no. 6, 2003, pp. 2180-93.
Senese CL, Hopfinger AJ. A simple clustering technique to improve QSAR model selection and predictivity: application to a receptor independent 4D-QSAR analysis of cyclic urea derived inhibitors of HIV-1 protease. J Chem Inf Comput Sci. 2003;43(6):2180-93.
Senese, C. L., & Hopfinger, A. J. (2003). A simple clustering technique to improve QSAR model selection and predictivity: application to a receptor independent 4D-QSAR analysis of cyclic urea derived inhibitors of HIV-1 protease. Journal of Chemical Information and Computer Sciences, 43(6), 2180-93.
Senese CL, Hopfinger AJ. A Simple Clustering Technique to Improve QSAR Model Selection and Predictivity: Application to a Receptor Independent 4D-QSAR Analysis of Cyclic Urea Derived Inhibitors of HIV-1 Protease. J Chem Inf Comput Sci. 2003 Nov-Dec;43(6):2180-93. PubMed PMID: 14632470.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - A simple clustering technique to improve QSAR model selection and predictivity: application to a receptor independent 4D-QSAR analysis of cyclic urea derived inhibitors of HIV-1 protease. AU - Senese,Craig L, AU - Hopfinger,A J, PY - 2003/11/25/pubmed PY - 2004/10/19/medline PY - 2003/11/25/entrez SP - 2180 EP - 93 JF - Journal of chemical information and computer sciences JO - J Chem Inf Comput Sci VL - 43 IS - 6 N2 - A training set of 50 tetrahydropyrimidine-2-one based inhibitors of HIV-1 protease, for which the -log K(i) values were measured, was used to construct receptor independent 4D-QSAR models. A novel clustering technique was employed to facilitate and improve model selection as well as test set predictions. Following the manifold model theory, five unique models were chosen by the clustering algorithm (q(2) = 0.81-0.84). The models were used to map the atom type morphology of the inhibitor binding site of HIV-1 protease as well as to predict the potencies (-log K(i)) of 10 test set compounds. The rank-difference correlation coefficient was used to evaluate the quality of the test set predictions, which was improved from 0.39 to 0.68 when the clustering technique was applied. The set of five models, collectively, identify the important binding characteristics of the HIV protease receptor site. This study demonstrates that the selected simple clustering technique provides a discrete algorithm for model selection, as well as improving the quality of test set, or unknown, compound prediction as determined by the rank-difference correlation coefficient. SN - 0095-2338 UR - https://www.unboundmedicine.com/medline/citation/14632470/A_simple_clustering_technique_to_improve_QSAR_model_selection_and_predictivity:_application_to_a_receptor_independent_4D_QSAR_analysis_of_cyclic_urea_derived_inhibitors_of_HIV_1_protease_ DB - PRIME DP - Unbound Medicine ER -