Wang, Yanze. 2025. “Using Reward Machines for Offline Reinforcement Learning With Non-Markovian Reward Functions”. Scientific Insights 1 (1). Hong Kong:005. https://doi.org/10.64897/si.2025v1i1.005.