WANG, Yanze. Using Reward Machines for Offline Reinforcement Learning With Non-Markovian Reward Functions. Scientific Insights, Hong Kong, v. 1, n. 1, p. 005, 2025. DOI: 10.64897/si.2025v1i1.005. Disponível em: https://ojs.achpub.com/SI/article/view/24. Acesso em: 3 oct. 2025.