[PAPER]@Telematika | Distributional Reinforcement Learning

Distributional Reinforcement Learning

Paper Group AWR 101

Paper Group AWR 101

February 1, 2020

FreeLabel: A Publicly Available Annotation Tool based on Freehand Traces. Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation. Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification. A neural network oracle for quantum nonlocality problems in networks. ROI Pooled Cor …

Paper Group AWR 275

Paper Group AWR 275

February 1, 2020

Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval. A Model to Search for Synthesizable Molecules. Binarized Knowledge Graph Embeddings. OpenBioLink: A benchmarking framework for large-scale biomedical link prediction. FastFusionNet: New State-of-the-Art for DAWNBench SQuAD. A Hitchhiker’s Guide to Statistical Comparison …

Paper Group ANR 453

Paper Group ANR 453

January 30, 2020

Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases. Constrained Generative Adversarial Networks for Interactive Image Generation. Multi-Gradient Descent for Multi-Objective Recommender Systems. Reinforcement Learning of Minimalist Numeral Grammars. Detecting Multiple Speech Disfluenc …

Paper Group ANR 652

Paper Group ANR 652

January 29, 2020

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts. Distributional Reinforcement Learning for Energy-Based Sequential Models. GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing. Improved Path-length Regret Bounds for Bandits. Recognition of Images of Korean Ch …

Paper Group ANR 897

Paper Group ANR 897

January 28, 2020

Stochastically Dominant Distributional Reinforcement Learning. Distributed Parameter Estimation in Randomized One-hidden-layer Neural Networks. The Knowledge Within: Methods for Data-Free Model Compression. Understanding LSTM – a tutorial into Long Short-Term Memory Recurrent Neural Networks. Communication-Efficient Distributed Online Learning wit …

Paper Group ANR 898

Paper Group ANR 898

January 28, 2020

Batch-Shaping for Learning Conditional Channel Gated Networks. A Simple Dynamic Learning Rate Tuning Algorithm For Automated Training of DNNs. Unsupervised Learning through Temporal Smoothing and Entropy Maximization. Distributional reinforcement learning with linear function approximation. Nonstationary Multivariate Gaussian Processes for Electron …

Paper Group ANR 1262

Paper Group ANR 1262

January 27, 2020

Parallel Restarted SPIDER – Communication Efficient Distributed Nonconvex Optimization with Optimal Computation Complexity. From Personalization to Privatization: Meta Matrix Factorization for Private Rating Predictions. Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision. A Compara …