WANG, Yanze. Using Reward Machines for Offline Reinforcement Learning With Non-Markovian Reward Functions. Scientific Insights, Hong Kong, v. 1, n. 1, p. 5, 2025. Disponível em: https://ojs.achpub.com/SI/article/view/24. Acesso em: 10 may. 2025.