Hery, Hery, and Ariel Christopher Wawolangi. “Decision Policy Optimization for Human–AI Collaboration Using Off-Policy Reinforcement Learning from Logged Interaction Data”. International Journal for Applied Information Management 6, no. 2 (June 17, 2026): 272–289. Accessed June 18, 2026. http://ijaim.net/journal/index.php/ijaim/article/view/121.