publications

2022

  1. Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
    Runze LiuFengshuo BaiYali Du, and Yaodong Yang
    In Advances in Neural Information Processing Systems (NeurIPS), 2022