Recently I came across the Gram matrix and was trying to understand what it meant intuitively. Like most of my attempts to understand mathematical concepts intuitively, the process quickly became recursive as the definitions relied on other terms I didn’t fully understand.
I research various topics, mostly in the field of reinforcement learning. I am currently a Ph.D. student in the Reinforcement Learning and Artificial Intelligence lab, part of the Alberta Machine Intelligence Institute and the Department of Computing Science at the University of Alberta. My supervisor is Professor Rich Sutton.
An Off-policy Policy Gradient Theorem Using Emphatic Weightings. Ehsan Imani*, Eric Graves*, and Martha White. Advances in Neural Information Processing Systems 31 (NeurIPS), 2018. (*joint first author) [pdf]