Reinforcement Learning for Batch-to-Batch Bioprocess Optimisation

Panagiotis Petsagkourakis, I. Orson Sandoval, Eric Bradford, Dongda Zhang, E.A. del Rio-Chanona

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Bioprocesses have received great attention from the scientific community as an alternative to fossil-based products by microorganisms-synthesised counterparts. However, bioprocesses are generally operated at unsteady-state conditions and are stochastic from a macro-scale perspective, making their optimisation a challenging task. Furthermore, as biological systems are highly complex, plant-model mismatch is usually present. To address the aforementioned challenges, in this work, we propose a reinforcement learning based online optimisation strategy. We first use reinforcement learning to learn an optimal policy given a preliminary process model. This means that we compute diverse trajectories and feed them into a recurrent neural network, resulting in a policy network which takes the states as input and gives the next optimal control action as output. Through this procedure, we are able to capture the previously believed behaviour of the biosystem. Subsequently, we adopted this network as an initial policy for the “real” system (the plant) and apply a batch-to-batch reinforcement learning strategy to update the network’s accuracy. This is computed by using a more complex process model (representing the real plant) embedded with adequate stochasticity to account for the perturbations in a real dynamic bioprocess. We demonstrate the effectiveness and advantages of the proposed approach in a case study by computing the optimal policy in a realistic number of batch runs.
    Original languageEnglish
    Title of host publicationComputer Aided Chemical Engineering
    Pages919-924
    Volume46
    DOIs
    Publication statusPublished - 2019
    Event29th European Symposium on Computer Aided Process Engineering: 29th European Symposium on Computer-Aided Process Engineering - Evoluon, Eindhoven, Netherlands
    Duration: 16 Jun 201919 Jun 2019
    http://www.escape29.nl

    Publication series

    NameComputer Aided Chemical Engineering
    PublisherElsevier
    ISSN (Electronic)1570-7946

    Conference

    Conference29th European Symposium on Computer Aided Process Engineering
    Abbreviated titleESCAPE-29
    Country/TerritoryNetherlands
    CityEindhoven
    Period16/06/1919/06/19
    Internet address

    Keywords

    • reinforcement learning
    • Batch process
    • recurrent neural networks
    • bio-processes

    Fingerprint

    Dive into the research topics of 'Reinforcement Learning for Batch-to-Batch Bioprocess Optimisation'. Together they form a unique fingerprint.

    Cite this