Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape

Christopher G. Knight, Mark Platt, William Rowe, David C. Wedge, Farid Khan, Philip J R Day, Andy Mcshea, Joshua Knowles, Douglas B. Kell

    Research output: Contribution to journalArticlepeer-review


    Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE, resulting in some of the tightest binding sequences. © 2008 The Author(s).
    Original languageEnglish
    Article numbere6
    JournalNucleic acids research.
    Issue number1
    Publication statusPublished - 2009


    • chemistry: Aptamers, Nucleotide
    • Artificial Intelligence
    • methods: Directed Molecular Evolution
    • Models, Statistical
    • Oligonucleotide Array Sequence Analysis
    • metabolism: Phycocyanin
    • Regression Analysis
    • Sequence Analysis, DNA


    Dive into the research topics of 'Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape'. Together they form a unique fingerprint.

    Cite this