Paper Group ANR 481
May 5, 2019
State Duration and Interval Modeling in Hidden Semi-Markov Model for Sequential Data Analysis. Fairness in Reinforcement Learning. Backprop KF: Learning Discriminative Deterministic State Estimators. Optimal and Adaptive Off-policy Evaluation in Contextual Bandits. Binomial Checkpointing for Arbitrary Programs with No User Annotation. A scaled Breg …