Paper Group AWR 235
October 20, 2019
Learning the Reward Function for a Misspecified Model. Fix your classifier: the marginal value of training the last weight layer. Neural Best-Buddies: Sparse Cross-Domain Correspondence. Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations. Factorised spatial representation learning: application in semi-supervised …