Weakly-Supervised Multi-Action Offline Reinforcement Learning for Intelligent Dosing of Epilepsy in Children

Zhou Li, Yifei Shen, Ruiqing Xu, Yu Yang, Jiannong Cao, Linchun Wu, Qing Wu

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Epilepsy in childhood is a common neurological disorder in children. Most cases are benign childhood epilepsy, which can be controlled with medication by adaptive adjustment of the dosage of antiepileptic drugs (AEDs). Recently, reinforcement learning-based intelligent dosing has attracted increasing attention. In clinical practice, patients usually take more than one drug at a time, however, conventional reinforcement learning algorithms do not sufficiently address the combination of two or more active drugs. In this paper, we propose the multi-action offline reinforcement learning (MA-ORL) model to solve this problem. Concretely, MA-ORL inherits the basic framework of the actor-critic network. For the patient’s health and safety concerns, MA-ORL abandons the intrusive and high-risk “trial-and-error” interactions with the environment but directly learns from the offline clinical dataset in the initial phase until the model is sufficiently trained and ready to use. Besides, to choose multiple actions simultaneously, we replace the actor’s output in standard reinforcement learning with a 2D matrix indicating the mixed feature representation of all different actions, then multiply it with separate masks to obtain separated actions. In addition to reaching an optimal return (i.e., the reduction of seizure frequency), MA-ORL also emphasizes the accuracy of the recommended dosage. Therefore, we introduce weak supervision to the learning objective to restrict the range of the learning outcome. It guarantees the recommended dosage by MA-ORL is as close as the one prescribed by experienced physicians. We conduct extensive experiments on clinical medical records containing 245 cases of epilepsy in childhood. Experimental results show MA-ORL reaches the highest cumulative return among all baseline models. Moreover, the suggested amount of medication taken a day by MA-ORL is more accurate than any other benchmark.
Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications: 28th International Conference, DASFAA 2023, Tianjin, China, April 17–20, 2023, Proceedings, Part IV
Pages208–223
Number of pages16
Publication statusPublished - 17 Apr 2023
EventThe 28th International Conference on Database Systems for Advanced Applications - Tianjin, China
Duration: 17 Apr 202320 Apr 2023
http://www.tjudb.cn/dasfaa2023/

Conference

ConferenceThe 28th International Conference on Database Systems for Advanced Applications
Abbreviated titleDASFAA-2023
Country/TerritoryChina
CityTianjin
Period17/04/2320/04/23
Internet address

Keywords

  • Multi-Action Reinforcement Learning
  • Offline Reinforcement Learning
  • Weakly-Supervised Learning
  • Epilepsy in Children

Fingerprint

Dive into the research topics of 'Weakly-Supervised Multi-Action Offline Reinforcement Learning for Intelligent Dosing of Epilepsy in Children'. Together they form a unique fingerprint.

Cite this