May 4, 2019

1215 words 6 mins read

Paper Group NANR 199

Paper Group NANR 199

Variance Reduction in Stochastic Gradient Langevin Dynamics. A Comparison of Event Representations in DEFT. Arabic Dialect Identification in Speech Transcripts. Proximal Stochastic Methods for Nonsmooth Nonconvex Finite-Sum Optimization. Attention-based LSTM Network for Cross-Lingual Sentiment Classification. 基於深層類神經網路之音訊事件偵測系統(Deep Neural Networks …

Variance Reduction in Stochastic Gradient Langevin Dynamics

Title Variance Reduction in Stochastic Gradient Langevin Dynamics
Authors Kumar Avinava Dubey, Sashank J. Reddi, Sinead A. Williamson, Barnabas Poczos, Alexander J. Smola, Eric P. Xing
Abstract Stochastic gradient-based Monte Carlo methods such as stochastic gradient Langevin dynamics are useful tools for posterior inference on large scale datasets in many machine learning applications. These methods scale to large datasets by using noisy gradients calculated using a mini-batch or subset of the dataset. However, the high variance inherent in these noisy gradients degrades performance and leads to slower mixing. In this paper, we present techniques for reducing variance in stochastic gradient Langevin dynamics, yielding novel stochastic Monte Carlo methods that improve performance by reducing the variance in the stochastic gradient. We show that our proposed method has better theoretical guarantees on convergence rate than stochastic Langevin dynamics. This is complemented by impressive empirical results obtained on a variety of real world datasets, and on four different machine learning tasks (regression, classification, independent component analysis and mixture modeling). These theoretical and empirical contributions combine to make a compelling case for using variance reduction in stochastic Monte Carlo methods.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6293-variance-reduction-in-stochastic-gradient-langevin-dynamics
PDF http://papers.nips.cc/paper/6293-variance-reduction-in-stochastic-gradient-langevin-dynamics.pdf
PWC https://paperswithcode.com/paper/variance-reduction-in-stochastic-gradient
Repo
Framework

A Comparison of Event Representations in DEFT

Title A Comparison of Event Representations in DEFT
Authors Ann Bies, Zhiyi Song, Jeremy Getman, Joe Ellis, Justin Mott, Stephanie Strassel, Martha Palmer, Teruko Mitamura, Marjorie Freedman, Heng Ji, Tim O{'}Gorman
Abstract
Tasks Anomaly Detection
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1004/
PDF https://www.aclweb.org/anthology/W16-1004
PWC https://paperswithcode.com/paper/a-comparison-of-event-representations-in-deft
Repo
Framework

Arabic Dialect Identification in Speech Transcripts

Title Arabic Dialect Identification in Speech Transcripts
Authors Shervin Malmasi, Marcos Zampieri
Abstract In this paper we describe a system developed to identify a set of four regional Arabic dialects (Egyptian, Gulf, Levantine, North African) and Modern Standard Arabic (MSA) in a transcribed speech corpus. We competed under the team name MAZA in the Arabic Dialect Identification sub-task of the 2016 Discriminating between Similar Languages (DSL) shared task. Our system achieved an F1-score of 0.51 in the closed training track, ranking first among the 18 teams that participated in the sub-task. Our system utilizes a classifier ensemble with a set of linear models as base classifiers. We experimented with three different ensemble fusion strategies, with the mean probability approach providing the best performance.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4814/
PDF https://www.aclweb.org/anthology/W16-4814
PWC https://paperswithcode.com/paper/arabic-dialect-identification-in-speech
Repo
Framework

Proximal Stochastic Methods for Nonsmooth Nonconvex Finite-Sum Optimization

Title Proximal Stochastic Methods for Nonsmooth Nonconvex Finite-Sum Optimization
Authors Sashank J. Reddi, Suvrit Sra, Barnabas Poczos, Alexander J. Smola
Abstract We analyze stochastic algorithms for optimizing nonconvex, nonsmooth finite-sum problems, where the nonsmooth part is convex. Surprisingly, unlike the smooth case, our knowledge of this fundamental problem is very limited. For example, it is not known whether the proximal stochastic gradient method with constant minibatch converges to a stationary point. To tackle this issue, we develop fast stochastic algorithms that provably converge to a stationary point for constant minibatches. Furthermore, using a variant of these algorithms, we obtain provably faster convergence than batch proximal gradient descent. Our results are based on the recent variance reduction techniques for convex optimization but with a novel analysis for handling nonconvex and nonsmooth functions. We also prove global linear convergence rate for an interesting subclass of nonsmooth nonconvex functions, which subsumes several recent works.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6116-proximal-stochastic-methods-for-nonsmooth-nonconvex-finite-sum-optimization
PDF http://papers.nips.cc/paper/6116-proximal-stochastic-methods-for-nonsmooth-nonconvex-finite-sum-optimization.pdf
PWC https://paperswithcode.com/paper/proximal-stochastic-methods-for-nonsmooth
Repo
Framework

Attention-based LSTM Network for Cross-Lingual Sentiment Classification

Title Attention-based LSTM Network for Cross-Lingual Sentiment Classification
Authors Xinjie Zhou, Xiaojun Wan, Jianguo Xiao
Abstract
Tasks Machine Translation, Representation Learning, Sentiment Analysis, Text Classification
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1024/
PDF https://www.aclweb.org/anthology/D16-1024
PWC https://paperswithcode.com/paper/attention-based-lstm-network-for-cross
Repo
Framework

基於深層類神經網路之音訊事件偵測系統(Deep Neural Networks for Audio Event Detection)[In Chinese]

Title 基於深層類神經網路之音訊事件偵測系統(Deep Neural Networks for Audio Event Detection)[In Chinese]
Authors Jhih-wei Chen, Chia-Hsin Liu, Yuan-Fu Liao
Abstract
Tasks
Published 2016-10-01
URL https://www.aclweb.org/anthology/O16-1028/
PDF https://www.aclweb.org/anthology/O16-1028
PWC https://paperswithcode.com/paper/ao14aeccc2e-a1e3e-aoaa-c3cdeep-neural
Repo
Framework

A Generalized Framework for Hierarchical Word Sequence Language Model

Title A Generalized Framework for Hierarchical Word Sequence Language Model
Authors Xiaoyi Wu, Kevin Duh, Yuji Matsumoto
Abstract
Tasks Language Modelling, Machine Translation, Speech Recognition, Spelling Correction
Published 2016-10-01
URL https://www.aclweb.org/anthology/Y16-2004/
PDF https://www.aclweb.org/anthology/Y16-2004
PWC https://paperswithcode.com/paper/a-generalized-framework-for-hierarchical-word
Repo
Framework

Variational Bayes on Monte Carlo Steroids

Title Variational Bayes on Monte Carlo Steroids
Authors Aditya Grover, Stefano Ermon
Abstract Variational approaches are often used to approximate intractable posteriors or normalization constants in hierarchical latent variable models. While often effective in practice, it is known that the approximation error can be arbitrarily large. We propose a new class of bounds on the marginal log-likelihood of directed latent variable models. Our approach relies on random projections to simplify the posterior. In contrast to standard variational methods, our bounds are guaranteed to be tight with high probability. We provide a new approach for learning latent variable models based on optimizing our new bounds on the log-likelihood. We demonstrate empirical improvements on benchmark datasets in vision and language for sigmoid belief networks, where a neural network is used to approximate the posterior.
Tasks Latent Variable Models
Published 2016-12-01
URL http://papers.nips.cc/paper/6259-variational-bayes-on-monte-carlo-steroids
PDF http://papers.nips.cc/paper/6259-variational-bayes-on-monte-carlo-steroids.pdf
PWC https://paperswithcode.com/paper/variational-bayes-on-monte-carlo-steroids
Repo
Framework

Computing Sentiment Scores of Verb Phrases for Vietnamese

Title Computing Sentiment Scores of Verb Phrases for Vietnamese
Authors Thien Khai Tran, Tuoi Thi Phan
Abstract
Tasks Common Sense Reasoning, Machine Translation, Opinion Mining, Sentiment Analysis
Published 2016-10-01
URL https://www.aclweb.org/anthology/O16-1020/
PDF https://www.aclweb.org/anthology/O16-1020
PWC https://paperswithcode.com/paper/computing-sentiment-scores-of-verb-phrases
Repo
Framework

A Machine Learning Approach to Clinical Terms Normalization

Title A Machine Learning Approach to Clinical Terms Normalization
Authors Jos{'e} Casta{~n}o, Mar{'\i}a Laura Gambarte, Hee Joon Park, Maria del Pilar Avila Williams, David P{'e}rez, Fern Campos, o, Daniel Luna, Sonia Ben{'\i}tez, Hern{'a}n Berinsky, Sof{'\i}a Zanetti
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2901/
PDF https://www.aclweb.org/anthology/W16-2901
PWC https://paperswithcode.com/paper/a-machine-learning-approach-to-clinical-terms
Repo
Framework

Feature Derivation for Exploitation of Distant Annotation via Pattern Induction against Dependency Parses

Title Feature Derivation for Exploitation of Distant Annotation via Pattern Induction against Dependency Parses
Authors Dayne Freitag, John Niekrasz
Abstract
Tasks Relation Extraction
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2905/
PDF https://www.aclweb.org/anthology/W16-2905
PWC https://paperswithcode.com/paper/feature-derivation-for-exploitation-of
Repo
Framework

A Practical Guide to Sentiment Annotation: Challenges and Solutions

Title A Practical Guide to Sentiment Annotation: Challenges and Solutions
Authors Saif Mohammad
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0429/
PDF https://www.aclweb.org/anthology/W16-0429
PWC https://paperswithcode.com/paper/a-practical-guide-to-sentiment-annotation
Repo
Framework

How to Train good Word Embeddings for Biomedical NLP

Title How to Train good Word Embeddings for Biomedical NLP
Authors Billy Chiu, Gamal Crichton, Anna Korhonen, Sampo Pyysalo
Abstract
Tasks Learning Word Embeddings, Named Entity Recognition, Sentiment Analysis, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2922/
PDF https://www.aclweb.org/anthology/W16-2922
PWC https://paperswithcode.com/paper/how-to-train-good-word-embeddings-for
Repo
Framework

Construction of a Personal Experience Tweet Corpus for Health Surveillance

Title Construction of a Personal Experience Tweet Corpus for Health Surveillance
Authors Keyuan Jiang, Ricardo Calix, Matrika Gupta
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2917/
PDF https://www.aclweb.org/anthology/W16-2917
PWC https://paperswithcode.com/paper/construction-of-a-personal-experience-tweet
Repo
Framework

GTI at SemEval-2016 Task 4: Training a Naive Bayes Classifier using Features of an Unsupervised System

Title GTI at SemEval-2016 Task 4: Training a Naive Bayes Classifier using Features of an Unsupervised System
Authors Jonathan Juncal-Mart{'\i}nez, Tamara {'A}lvarez-L{'o}pez, Milagros Fern{'a}ndez-Gavilanes, Enrique Costa-Montenegro, Francisco Javier Gonz{'a}lez-Casta{~n}o
Abstract
Tasks Dependency Parsing, Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1016/
PDF https://www.aclweb.org/anthology/S16-1016
PWC https://paperswithcode.com/paper/gti-at-semeval-2016-task-4-training-a-naive
Repo
Framework
comments powered by Disqus