Abstract: This study investigates the design of reward functions for deep reinforcement learning-based source term estimation (STE). Estimating the properties of unknown hazardous gas leakage using a ...
Abstract: In complex and dynamic environments, achieving autonomous decision-making and control of agent remains a challenging task. Traditional reinforcement learning algorithms often struggle to ...