Vis enkel innførsel

dc.contributor.authorPetsagkourakis, Panagiotis
dc.contributor.authorSandoval, Ilya Orson
dc.contributor.authorBradford, Eric
dc.contributor.authorZhang, Dongda
dc.contributor.authordel Rio-Chanona, Ehecatl Antonio
dc.date.accessioned2019-12-17T12:45:59Z
dc.date.available2019-12-17T12:45:59Z
dc.date.created2019-07-27T12:50:13Z
dc.date.issued2019
dc.identifier.citationComputer-aided chemical engineering. 2019, 46 919-924.nb_NO
dc.identifier.issn1570-7946
dc.identifier.urihttp://hdl.handle.net/11250/2633672
dc.description.abstractBioprocesses have received great attention from the scientific community as an alternative to fossil-based products by microorganisms-synthesised counterparts. However, bioprocesses are generally operated at unsteady-state conditions and are stochastic from a macro-scale perspective, making their optimisation a challenging task. Furthermore, as biological systems are highly complex, plant-model mismatch is usually present. To address the aforementioned challenges, in this work, we propose a reinforcement learning based online optimisation strategy. We first use reinforcement learning to learn an optimal policy given a preliminary process model. This means that we compute diverse trajectories and feed them into a recurrent neural network, resulting in a policy network which takes the states as input and gives the next optimal control action as output. Through this procedure, we are able to capture the previously believed behaviour of the biosystem. Subsequently, we adopted this network as an initial policy for the “real” system (the plant) and apply a batch-to-batch reinforcement learning strategy to update the network’s accuracy. This is computed by using a more complex process model (representing the real plant) embedded with adequate stochasticity to account for the perturbations in a real dynamic bioprocess. We demonstrate the effectiveness and advantages of the proposed approach in a case study by computing the optimal policy in a realistic number of batch runs.nb_NO
dc.language.isoengnb_NO
dc.publisherElseviernb_NO
dc.titleReinforcement Learning for Batch-to-Batch Bioprocess Optimisationnb_NO
dc.typeChapternb_NO
dc.description.versionpublishedVersionnb_NO
dc.source.pagenumber919-924nb_NO
dc.source.volume46nb_NO
dc.source.journalComputer-aided chemical engineeringnb_NO
dc.identifier.doi10.1016/B978-0-12-818634-3.50154-5
dc.identifier.cristin1712944
dc.description.localcodeThis chapter will not be available due to copyright restrictions (c) 2019 by European Association of Geoscientists and Engineersnb_NO
cristin.unitcode194,63,25,0
cristin.unitnameInstitutt for teknisk kybernetikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel