Abstract
Epilepsy in childhood is a common neurological disorder in children. Most cases are benign childhood epilepsy, which can be controlled with medication by adaptive adjustment of the dosage of antiepileptic drugs (AEDs). Recently, reinforcement learning-based intelligent dosing has attracted increasing attention. In clinical practice, patients usually take more than one drug at a time, however, conventional reinforcement learning algorithms do not sufficiently address the combination of two or more active drugs. In this paper, we propose the multi-action offline reinforcement learning (MA-ORL) model to solve this problem. Concretely, MA-ORL inherits the basic framework of the actor-critic network. For the patient’s health and safety concerns, MA-ORL abandons the intrusive and high-risk “trial-and-error” interactions with the environment but directly learns from the offline clinical dataset in the initial phase until the model is sufficiently trained and ready to use. Besides, to choose multiple actions simultaneously, we replace the actor’s output in standard reinforcement learning with a 2D matrix indicating the mixed feature representation of all different actions, then multiply it with separate masks to obtain separated actions. In addition to reaching an optimal return (i.e., the reduction of seizure frequency), MA-ORL also emphasizes the accuracy of the recommended dosage. Therefore, we introduce weak supervision to the learning objective to restrict the range of the learning outcome. It guarantees the recommended dosage by MA-ORL is as close as the one prescribed by experienced physicians. We conduct extensive experiments on clinical medical records containing 245 cases of epilepsy in childhood. Experimental results show MA-ORL reaches the highest cumulative return among all baseline models. Moreover, the suggested amount of medication taken a day by MA-ORL is more accurate than any other benchmark.
Original language | English |
---|---|
Title of host publication | Database Systems for Advanced Applications: 28th International Conference, DASFAA 2023, Tianjin, China, April 17–20, 2023, Proceedings, Part IV |
Pages | 208–223 |
Number of pages | 16 |
Publication status | Published - 17 Apr 2023 |
Event | The 28th International Conference on Database Systems for Advanced Applications - Tianjin, China Duration: 17 Apr 2023 → 20 Apr 2023 http://www.tjudb.cn/dasfaa2023/ |
Conference
Conference | The 28th International Conference on Database Systems for Advanced Applications |
---|---|
Abbreviated title | DASFAA-2023 |
Country/Territory | China |
City | Tianjin |
Period | 17/04/23 → 20/04/23 |
Internet address |
Keywords
- Multi-Action Reinforcement Learning
- Offline Reinforcement Learning
- Weakly-Supervised Learning
- Epilepsy in Children