(1)
Hery, H.; Wawolangi, A. C. . Decision Policy Optimization for Human–AI Collaboration Using Off-Policy Reinforcement Learning from Logged Interaction Data. Int. J. Appl. Inf. Manag. 2026, 6, 272-289.