Asynchronous Stochastic Gradient Descent with Delay Compensation. Sifting Common Information from Many Variables. Dependency Sensitive Convolutional Neural Networks for Modeling Sentences and Documents. Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning. Language Models with Pre-Trained (GloVe) Word Embeddings. Preco …