The PRINTS database: a resource for identification of protein families.

    Research output: Contribution to journalArticlepeer-review


    The PRINTS database houses a collection of protein fingerprints, which may be used to assign family and functional attributes to uncharacterised sequences, such as those currently emanating from the various genome-sequencing projects. The April 2002 release includes 1,700 family fingerprints, encoding approximately 10,500 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. Fingerprints are groups of conserved motifs that, taken together, provide diagnostic protein family signatures. They derive much of their potency from the biological context afforded by matching motif neighbours; this makes them at once more flexible and powerful than single-motif approaches. The technique further departs from other pattern-matching methods by readily allowing the creation of fingerprints at superfamily-, family- and subfamily-specific levels, thereby allowing more fine-grained diagnoses. Here, we provide an overview of the method of protein fingerprinting and how the results of fingerprint analyses are used to build PRINTS and its relational cousin, PRINTS-S.
    Original languageEnglish
    Pages (from-to)252-263
    Number of pages11
    JournalBriefings in Bioinformatics
    Issue number3
    Publication statusPublished - Sept 2002


    Dive into the research topics of 'The PRINTS database: a resource for identification of protein families.'. Together they form a unique fingerprint.

    Cite this