I research various topics, mostly in the field of reinforcement learning. I am currently a Ph.D. student in the Reinforcement Learning and Artificial Intelligence lab, part of the Alberta Machine Intelligence Institute and the Department of Computing Science at the University of Alberta. My supervisor is Professor Rich Sutton.
An Off-policy Policy Gradient Theorem Using Emphatic Weightings. Ehsan Imani*, Eric Graves*, and Martha White. Advances in Neural Information Processing Systems 31 (NeurIPS), 2018. [pdf]