Reinforcement learning for batch bioprocess optimization

Research output: Contribution to journalArticlepeer-review

Abstract

Bioprocesses have received a lot of attention to produce clean and sustainable alternatives to fossil-based materials. However, they are generally difficult to optimize due to their unsteady-state operation modes and stochastic behaviours. Furthermore, biological systems are highly complex, therefore plant-model mismatch is often present. To address the aforementioned challenges we propose a Reinforcement learning based optimization strategy for batch processes.

In this work we applied the Policy Gradient method from batch-to-batch to update a control policy parametrized by a recurrent neural network. We assume that a preliminary process model is available, which is exploited to obtain a preliminary optimal control policy. Subsequently, this policy is updated based on measurements from the true plant. The capabilities of our proposed approach were tested on three case studies (one of which is nonsmooth) using a more complex process model for the true system embedded with adequate process disturbance. Lastly, we discussed advantages and disadvantages of this strategy compared against current existing approaches such as nonlinear model predictive control.
Original languageEnglish
Article number106649
JournalCOMPUTERS & CHEMICAL ENGINEERING
Volume133
Early online date18 Nov 2019
DOIs
Publication statusPublished - 18 Nov 2019

Keywords

  • Batch optimization
  • Bioprocesses
  • Machine learning
  • Nonsmooth
  • Policy gradient
  • Recurrent neural networks
  • Uncertain dynamic systems

Fingerprint

Dive into the research topics of 'Reinforcement learning for batch bioprocess optimization'. Together they form a unique fingerprint.

Cite this