Abstract

We present a new algorithm that enumerates molecular structures matching a predefined extended valence sequence or signature. The algorithm can construct molecular structures composed of about 50 non-hydrogen atoms in CPU seconds time scale. The algorithm is run to produce all molecular structures matching the binding affinities (IC(50)) of some HIV-1 protease inhibitors. The algorithm is also used to compute the degeneracy, or the number of molecular structures, corresponding to a given signature. Signature degeneracy is systematically studied for varying signature heights on four molecular series, alkanes, alcohols, fullerene-type structures, and peptides. Signature degeneracy is compared with similar results obtained with popular topological indices (TIs). As a general rule, we find that signature degeneracy decreases as the signature height increases. We also find that alkanes, alcohols, and fullerene-type structures comprising n non-hydrogen atoms are uniquely characterized by signatures of height n/4, while peptides up to 4000 amino acids can be singled out with signatures of heights as small as 2 and 3.

Keywords

Degeneracy (biology)Signature (topology)Valence (chemistry)ChemistryMoleculeComputational chemistryAlgorithmBioinformaticsMathematicsBiologyGeometry

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
43
Issue
3
Pages
721-734
Citations
156
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

156
OpenAlex

Cite This

Jean‐Loup Faulon, Carla J. Churchwell, Donald P. Visco (2003). The Signature Molecular Descriptor. 2. Enumerating Molecules from Their Extended Valence Sequences. Journal of Chemical Information and Computer Sciences , 43 (3) , 721-734. https://doi.org/10.1021/ci020346o

Identifiers

DOI
10.1021/ci020346o