October 15, 2019

2092 words 10 mins read

Paper Group NANR 255

Paper Group NANR 255

Scalable Bilinear Pi Learning Using State and Action Features. Semi-Supervised Learning on Data Streams via Temporal Label Propagation. Personalized Microblog Sentiment Classification via Adversarial Cross-lingual Multi-task Learning. Neural Network based Extreme Classification and Similarity Models for Product Matching. NextGen AML: Distributed De …

Scalable Bilinear Pi Learning Using State and Action Features

Title Scalable Bilinear Pi Learning Using State and Action Features
Authors Yichen Chen, Lihong Li, Mengdi Wang
Abstract Approximate linear programming (ALP) represents one of the major algorithmic families to solve large-scale Markov decision processes (MDP). In this work, we study a primal-dual formulation of the ALP, and develop a scalable, model-free algorithm called bilinear $\pi$ learning for reinforcement learning when a sampling oracle is provided. This algorithm enjoys a number of advantages. First, it adopts linear and bilinear models to represent the high-dimensional value function and state-action distributions, respectively, using given state and action features. Its run-time complexity depends on the number of features, not the size of the underlying MDPs. Second, it operates in a fully online fashion without having to store any sample, thus having minimal memory footprint. Third, we prove that it is sample-efficient, solving for the optimal policy to high precision with a sample complexity linear in the dimension of the parameter space.
Tasks
Published 2018-07-01
URL https://icml.cc/Conferences/2018/Schedule?showEvent=2330
PDF http://proceedings.mlr.press/v80/chen18e/chen18e.pdf
PWC https://paperswithcode.com/paper/scalable-bilinear-pi-learning-using-state-and
Repo
Framework

Semi-Supervised Learning on Data Streams via Temporal Label Propagation

Title Semi-Supervised Learning on Data Streams via Temporal Label Propagation
Authors Tal Wagner, Sudipto Guha, Shiva Kasiviswanathan, Nina Mishra
Abstract We consider the problem of labeling points on a fast-moving data stream when only a small number of labeled examples are available. In our setting, incoming points must be processed efficiently and the stream is too large to store in its entirety. We present a semi-supervised learning algorithm for this task. The algorithm maintains a small synopsis of the stream which can be quickly updated as new points arrive, and labels every incoming point by provably learning from the full history of the stream. Experiments on real datasets validate that the algorithm can quickly and accurately classify points on a stream with a small quantity of labeled examples.
Tasks
Published 2018-07-01
URL https://icml.cc/Conferences/2018/Schedule?showEvent=1879
PDF http://proceedings.mlr.press/v80/wagner18a/wagner18a.pdf
PWC https://paperswithcode.com/paper/semi-supervised-learning-on-data-streams-via
Repo
Framework

Personalized Microblog Sentiment Classification via Adversarial Cross-lingual Multi-task Learning

Title Personalized Microblog Sentiment Classification via Adversarial Cross-lingual Multi-task Learning
Authors Weichao Wang, Shi Feng, Wei Gao, Daling Wang, Yifei Zhang
Abstract Sentiment expression in microblog posts can be affected by user{'}s personal character, opinion bias, political stance and so on. Most of existing personalized microblog sentiment classification methods suffer from the insufficiency of discriminative tweets for personalization learning. We observed that microblog users have consistent individuality and opinion bias in different languages. Based on this observation, in this paper we propose a novel user-attention-based Convolutional Neural Network (CNN) model with adversarial cross-lingual learning framework. The user attention mechanism is leveraged in CNN model to capture user{'}s language-specific individuality from the posts. Then the attention-based CNN model is incorporated into a novel adversarial cross-lingual learning framework, in which with the help of user properties as bridge between languages, we can extract the language-specific features and language-independent features to enrich the user post representation so as to alleviate the data insufficiency problem. Results on English and Chinese microblog datasets confirm that our method outperforms state-of-the-art baseline algorithms with large margins.
Tasks Multi-Task Learning, Sentiment Analysis
Published 2018-10-01
URL https://www.aclweb.org/anthology/D18-1031/
PDF https://www.aclweb.org/anthology/D18-1031
PWC https://paperswithcode.com/paper/personalized-microblog-sentiment
Repo
Framework

Neural Network based Extreme Classification and Similarity Models for Product Matching

Title Neural Network based Extreme Classification and Similarity Models for Product Matching
Authors Kashif Shah, Selcuk Kopru, Jean-David Ruvini
Abstract Matching a seller listed item to an appropriate product has become a fundamental and one of the most significant step for e-commerce platforms for product based experience. It has a huge impact on making the search effective, search engine optimization, providing product reviews and product price estimation etc. along with many other advantages for a better user experience. As significant and vital it has become, the challenge to tackle the complexity has become huge with the exponential growth of individual and business sellers trading millions of products everyday. We explored two approaches; classification based on shallow neural network and similarity based on deep siamese network. These models outperform the baseline by more than 5{%} in term of accuracy and are capable of extremely efficient training and inference.
Tasks
Published 2018-06-01
URL https://www.aclweb.org/anthology/N18-3002/
PDF https://www.aclweb.org/anthology/N18-3002
PWC https://paperswithcode.com/paper/neural-network-based-extreme-classification
Repo
Framework

NextGen AML: Distributed Deep Learning based Language Technologies to Augment Anti Money Laundering Investigation

Title NextGen AML: Distributed Deep Learning based Language Technologies to Augment Anti Money Laundering Investigation
Authors Jingguang Han, Utsab Barman, Jeremiah Hayes, Jinhua Du, Edward Burgin, Dadong Wan
Abstract Most of the current anti money laundering (AML) systems, using handcrafted rules, are heavily reliant on existing structured databases, which are not capable of effectively and efficiently identifying hidden and complex ML activities, especially those with dynamic and time-varying characteristics, resulting in a high percentage of false positives. Therefore, analysts are engaged for further investigation which significantly increases human capital cost and processing time. To alleviate these issues, this paper presents a novel framework for the next generation AML by applying and visualizing deep learning-driven natural language processing (NLP) technologies in a distributed and scalable manner to augment AML monitoring and investigation. The proposed distributed framework performs news and tweet sentiment analysis, entity recognition, relation extraction, entity linking and link analysis on different data sources (e.g. news articles and tweets) to provide additional evidence to human investigators for final decision-making. Each NLP module is evaluated on a task-specific data set, and the overall experiments are performed on synthetic and real-world datasets. Feedback from AML practitioners suggests that our system can reduce approximately 30{%} time and cost compared to their previous manual approaches of AML investigation.
Tasks Decision Making, Entity Linking, Relation Extraction, Sentiment Analysis
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-4007/
PDF https://www.aclweb.org/anthology/P18-4007
PWC https://paperswithcode.com/paper/nextgen-aml-distributed-deep-learning-based
Repo
Framework

Data-Driven Text Simplification

Title Data-Driven Text Simplification
Authors Sanja {\v{S}}tajner, Horacio Saggion
Abstract
Tasks Lexical Simplification, Machine Translation, Semantic Role Labeling, Text Simplification
Published 2018-08-01
URL https://www.aclweb.org/anthology/C18-3005/
PDF https://www.aclweb.org/anthology/C18-3005
PWC https://paperswithcode.com/paper/data-driven-text-simplification
Repo
Framework

Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction

Title Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction
Authors Adam Ek, Mats Wir{'e}n, Robert {"O}stling, Kristina N. Bj{"o}rkenstam, Gintar{.e} Grigonyt{.e}, Sofia Gustafson Capkov{'a}
Abstract
Tasks Speaker Identification
Published 2018-05-01
URL https://www.aclweb.org/anthology/L18-1131/
PDF https://www.aclweb.org/anthology/L18-1131
PWC https://paperswithcode.com/paper/identifying-speakers-and-addressees-in
Repo
Framework

Unsupervised Learning of Distributional Relation Vectors

Title Unsupervised Learning of Distributional Relation Vectors
Authors Shoaib Jameel, Zied Bouraoui, Steven Schockaert
Abstract Word embedding models such as GloVe rely on co-occurrence statistics to learn vector representations of word meaning. While we may similarly expect that co-occurrence statistics can be used to capture rich information about the relationships between different words, existing approaches for modeling such relationships are based on manipulating pre-trained word vectors. In this paper, we introduce a novel method which directly learns relation vectors from co-occurrence statistics. To this end, we first introduce a variant of GloVe, in which there is an explicit connection between word vectors and PMI weighted co-occurrence vectors. We then show how relation vectors can be naturally embedded into the resulting vector space.
Tasks Relation Extraction, Word Embeddings
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-1003/
PDF https://www.aclweb.org/anthology/P18-1003
PWC https://paperswithcode.com/paper/unsupervised-learning-of-distributional
Repo
Framework

Feasible Arm Identification

Title Feasible Arm Identification
Authors Julian Katz-Samuels, Clay Scott
Abstract We introduce the feasible arm identification problem, a pure exploration multi-armed bandit problem where the agent is given a set of $D$-dimensional arms and a polyhedron $P = {x : A x \leq b } \subset R^D$. Pulling an arm gives a random vector and the goal is to determine, using a fixed budget of $T$ pulls, which of the arms have means belonging to $P$. We propose three algorithms MD-UCBE, MD-SAR, and MD-APT and provide a unified analysis establishing upper bounds for each of them. We also establish a lower bound that matches up to constants the upper bounds of MD-UCBE and MD-APT. Finally, we demonstrate the effectiveness of our algorithms on synthetic and real-world datasets.
Tasks
Published 2018-07-01
URL https://icml.cc/Conferences/2018/Schedule?showEvent=2064
PDF http://proceedings.mlr.press/v80/katz-samuels18a/katz-samuels18a.pdf
PWC https://paperswithcode.com/paper/feasible-arm-identification
Repo
Framework

CAYLEYNETS: SPECTRAL GRAPH CNNS WITH COMPLEX RATIONAL FILTERS

Title CAYLEYNETS: SPECTRAL GRAPH CNNS WITH COMPLEX RATIONAL FILTERS
Authors Ron Levie, Federico Monti, Xavier Bresson, Michael M. Bronstein
Abstract The rise of graph-structured data such as social networks, regulatory networks, citation graphs, and functional brain networks, in combination with resounding success of deep learning in various applications, has brought the interest in generalizing deep learning models to non-Euclidean domains. In this paper, we introduce a new spectral domain convolutional architecture for deep learning on graphs. The core ingredient of our model is a new class of parametric rational complex functions (Cayley polynomials) allowing to efficiently compute spectral filters on graphs that specialize on frequency bands of interest. Our model generates rich spectral filters that are localized in space, scales linearly with the size of the input data for sparsely-connected graphs, and can handle different constructions of Laplacian operators. Extensive experimental results show the superior performance of our approach on spectral image classification, community detection, vertex classification and matrix completion tasks.
Tasks Community Detection, Image Classification, Matrix Completion
Published 2018-01-01
URL https://openreview.net/forum?id=S1680_1Rb
PDF https://openreview.net/pdf?id=S1680_1Rb
PWC https://paperswithcode.com/paper/cayleynets-spectral-graph-cnns-with-complex
Repo
Framework

ChatEval: A Tool for the Systematic Evaluation of Chatbots

Title ChatEval: A Tool for the Systematic Evaluation of Chatbots
Authors Jo{~a}o Sedoc, Daphne Ippolito, Arun Kirubarajan, Jai Thirani, Lyle Ungar, Chris Callison-Burch
Abstract
Tasks Chatbot, Text Generation
Published 2018-11-01
URL https://www.aclweb.org/anthology/W18-6709/
PDF https://www.aclweb.org/anthology/W18-6709
PWC https://paperswithcode.com/paper/chateval-a-tool-for-the-systematic-evaluation
Repo
Framework

Predicting accuracy on large datasets from smaller pilot data

Title Predicting accuracy on large datasets from smaller pilot data
Authors Mark Johnson, Peter Anderson, Mark Dras, Mark Steedman
Abstract Because obtaining training data is often the most difficult part of an NLP or ML project, we develop methods for predicting how much data is required to achieve a desired test accuracy by extrapolating results from models trained on a small pilot training dataset. We model how accuracy varies as a function of training size on subsets of the pilot data, and use that model to predict how much training data would be required to achieve the desired accuracy. We introduce a new performance extrapolation task to evaluate how well different extrapolations predict accuracy on larger training sets. We show that details of hyperparameter optimisation and the extrapolation models can have dramatic effects in a document classification task. We believe this is an important first step in developing methods for estimating the resources required to meet specific engineering performance targets.
Tasks Document Classification
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-2072/
PDF https://www.aclweb.org/anthology/P18-2072
PWC https://paperswithcode.com/paper/predicting-accuracy-on-large-datasets-from
Repo
Framework

Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus

Title Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus
Authors Injy Hamed, Mohamed Elmahdy, Slim Abdennadher
Abstract
Tasks Speech Recognition
Published 2018-05-01
URL https://www.aclweb.org/anthology/L18-1601/
PDF https://www.aclweb.org/anthology/L18-1601
PWC https://paperswithcode.com/paper/collection-and-analysis-of-code-switch
Repo
Framework

A Delay-tolerant Proximal-Gradient Algorithm for Distributed Learning

Title A Delay-tolerant Proximal-Gradient Algorithm for Distributed Learning
Authors Konstantin Mishchenko, Franck Iutzeler, Jérôme Malick, Massih-Reza Amini
Abstract Distributed learning aims at computing high-quality models by training over scattered data. This covers a diversity of scenarios, including computer clusters or mobile agents. One of the main challenges is then to deal with heterogeneous machines and unreliable communications. In this setting, we propose and analyze a flexible asynchronous optimization algorithm for solving nonsmooth learning problems. Unlike most existing methods, our algorithm is adjustable to various levels of communication costs, machines computational powers, and data distribution evenness. We prove that the algorithm converges linearly with a fixed learning rate that does not depend on communication delays nor on the number of machines. Although long delays in communication may slow down performance, no delay can break convergence.
Tasks
Published 2018-07-01
URL https://icml.cc/Conferences/2018/Schedule?showEvent=2143
PDF http://proceedings.mlr.press/v80/mishchenko18a/mishchenko18a.pdf
PWC https://paperswithcode.com/paper/a-delay-tolerant-proximal-gradient-algorithm
Repo
Framework

ECNU at SemEval-2018 Task 2: Leverage Traditional NLP Features and Neural Networks Methods to Address Twitter Emoji Prediction Task

Title ECNU at SemEval-2018 Task 2: Leverage Traditional NLP Features and Neural Networks Methods to Address Twitter Emoji Prediction Task
Authors Xingwu Lu, Xin Mao, Man Lan, Yuanbin Wu
Abstract This paper describes our submissions to Task 2 in SemEval 2018, i.e., Multilingual Emoji Prediction. We first investigate several traditional Natural Language Processing (NLP) features, and then design several deep learning models. For subtask 1: Emoji Prediction in English, we combine two different methods to represent tweet, i.e., supervised model using traditional features and deep learning model. For subtask 2: Emoji Prediction in Spanish, we only use deep learning model.
Tasks
Published 2018-06-01
URL https://www.aclweb.org/anthology/S18-1068/
PDF https://www.aclweb.org/anthology/S18-1068
PWC https://paperswithcode.com/paper/ecnu-at-semeval-2018-task-2-leverage
Repo
Framework
comments powered by Disqus