The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences.

Jean-Loup Faulon, Carla J Churchwell, Donald P Visco

    Research output: Contribution to journalArticlepeer-review

    Abstract

    We present a new algorithm that enumerates molecular structures matching a predefined extended valence sequence or signature. The algorithm can construct molecular structures composed of about 50 non-hydrogen atoms in CPU seconds time scale. The algorithm is run to produce all molecular structures matching the binding affinities (IC(50)) of some HIV-1 protease inhibitors. The algorithm is also used to compute the degeneracy, or the number of molecular structures, corresponding to a given signature. Signature degeneracy is systematically studied for varying signature heights on four molecular series, alkanes, alcohols, fullerene-type structures, and peptides. Signature degeneracy is compared with similar results obtained with popular topological indices (TIs). As a general rule, we find that signature degeneracy decreases as the signature height increases. We also find that alkanes, alcohols, and fullerene-type structures comprising n non-hydrogen atoms are uniquely characterized by signatures of height n/4, while peptides up to 4000 amino acids can be singled out with signatures of heights as small as 2 and 3.
    Original languageEnglish
    JournalJournal of Chemical Information and Computer Sciences
    Volume43
    Issue number3
    DOIs
    Publication statusPublished - 2003

    Fingerprint

    Dive into the research topics of 'The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences.'. Together they form a unique fingerprint.

    Cite this