May 4, 2019

1274 words 6 mins read

Paper Group NANR 167

Adverse Drug Reaction Classification With Deep Neural Networks. INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words. Convergence of Syntactic Complexity in Conversation. A Novel Measure for Coherence in Statistical Topic Models. Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data. Efficient Data Selection …

Adverse Drug Reaction Classification With Deep Neural Networks


Title	Adverse Drug Reaction Classification With Deep Neural Networks
Authors	Trung Huynh, Yulan He, Alistair Willis, Stefan Rueger
Abstract	We study the problem of detecting sentences describing adverse drug reactions (ADRs) and frame the problem as binary classification. We investigate different neural network (NN) architectures for ADR classification. In particular, we propose two new neural network models, Convolutional Recurrent Neural Network (CRNN) by concatenating convolutional neural networks with recurrent neural networks, and Convolutional Neural Network with Attention (CNNA) by adding attention weights into convolutional neural networks. We evaluate various NN architectures on a Twitter dataset containing informal language and an Adverse Drug Effects (ADE) dataset constructed by sampling from MEDLINE case reports. Experimental results show that all the NN architectures outperform the traditional maximum entropy classifiers trained from n-grams with different weighting strategies considerably on both datasets. On the Twitter dataset, all the NN architectures perform similarly. But on the ADE dataset, CNN performs better than other more complex CNN variants. Nevertheless, CNNA allows the visualisation of attention weights of words when making classification decisions and hence is more appropriate for the extraction of word subsequences describing ADRs.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1084/
PDF	https://www.aclweb.org/anthology/C16-1084
PWC	https://paperswithcode.com/paper/adverse-drug-reaction-classification-with
Repo
Framework

INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words


Title	INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words
Authors	Silvio Amir, Ramon Astudillo, Wang Ling, M{'a}rio J. Silva, Isabel Trancoso
Abstract
Tasks	Feature Engineering, Sentiment Analysis, Twitter Sentiment Analysis, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1036/
PDF	https://www.aclweb.org/anthology/S16-1036
PWC	https://paperswithcode.com/paper/inesc-id-at-semeval-2016-task-4-a-reducing
Repo
Framework

Convergence of Syntactic Complexity in Conversation


Title	Convergence of Syntactic Complexity in Conversation
Authors	Yang Xu, David Reitter
Abstract
Tasks	Language Acquisition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2072/
PDF	https://www.aclweb.org/anthology/P16-2072
PWC	https://paperswithcode.com/paper/convergence-of-syntactic-complexity-in
Repo
Framework

A Novel Measure for Coherence in Statistical Topic Models


Title	A Novel Measure for Coherence in Statistical Topic Models
Authors	Fred Morstatter, Huan Liu
Abstract
Tasks	Topic Models
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2088/
PDF	https://www.aclweb.org/anthology/P16-2088
PWC	https://paperswithcode.com/paper/a-novel-measure-for-coherence-in-statistical
Repo
Framework

Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data


Title	Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data
Authors	Maria Barrett, Joachim Bingel, Frank Keller, Anders S{\o}gaard
Abstract
Tasks	Eye Tracking, Part-Of-Speech Tagging
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2094/
PDF	https://www.aclweb.org/anthology/P16-2094
PWC	https://paperswithcode.com/paper/weakly-supervised-part-of-speech-tagging
Repo
Framework

Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora


Title	Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora
Authors	Amir Hazem, Emmanuel Morin
Abstract	Comparable corpora are the main alternative to the use of parallel corpora to extract bilingual lexicons. Although it is easier to build comparable corpora, specialized comparable corpora are often of modest size in comparison with corpora issued from the general domain. Consequently, the observations of word co-occurrences which are the basis of context-based methods are unreliable. We propose in this article to improve word co-occurrences of specialized comparable corpora and thus context representation by using general-domain data. This idea, which has been already used in machine translation task for more than a decade, is not straightforward for the task of bilingual lexicon extraction from specific-domain comparable corpora. We go against the mainstream of this task where many studies support the idea that adding out-of-domain documents decreases the quality of lexicons. Our empirical evaluation shows the advantages of this approach which induces a significant gain in the accuracy of extracted lexicons.
Tasks	Machine Translation, Topic Models, Word Embeddings
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1321/
PDF	https://www.aclweb.org/anthology/C16-1321
PWC	https://paperswithcode.com/paper/efficient-data-selection-for-bilingual
Repo
Framework

Black Holes and White Rabbits: Metaphor Identification with Visual Features


Title	Black Holes and White Rabbits: Metaphor Identification with Visual Features
Authors	Ekaterina Shutova, Douwe Kiela, Jean Maillard
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1020/
PDF	https://www.aclweb.org/anthology/N16-1020
PWC	https://paperswithcode.com/paper/black-holes-and-white-rabbits-metaphor
Repo
Framework

Singleton Detection using Word Embeddings and Neural Networks


Title	Singleton Detection using Word Embeddings and Neural Networks
Authors	Hessel Haagsma
Abstract
Tasks	Coreference Resolution, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3010/
PDF	https://www.aclweb.org/anthology/P16-3010
PWC	https://paperswithcode.com/paper/singleton-detection-using-word-embeddings-and
Repo
Framework

Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons


Title	Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons
Authors	Eric Benzschawel
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3003/
PDF	https://www.aclweb.org/anthology/P16-3003
PWC	https://paperswithcode.com/paper/identifying-potential-adverse-drug-events-in
Repo
Framework

English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting


Title	English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
Authors	Michael Carl, Akiko Aizawa, Masaru Yamada
Abstract	Speech-enabled interfaces have the potential to become one of the most efficient and ergonomic environments for human-computer interaction and for text production. However, not much research has been carried out to investigate in detail the processes and strategies involved in the different modes of text production. This paper introduces and evaluates a corpus of more than 55 hours of English-to-Japanese user activity data that were collected within the ENJA15 project, in which translators were observed while writing and speaking translations (translation dictation) and during machine translation post-editing. The transcription of the spoken data, keyboard logging and eye-tracking data were recorded with Translog-II, post-processed and integrated into the CRITT Translation Process Research-DB (TPR-DB), which is publicly available under a creative commons license. The paper presents the ENJA15 data as part of a large multilingual Chinese, Danish, German, Hindi and Spanish translation process data collection of more than 760 translation sessions. It compares the ENJA15 data with the other language pairs and reviews some of its particularities.
Tasks	Eye Tracking, Machine Translation
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1635/
PDF	https://www.aclweb.org/anthology/L16-1635
PWC	https://paperswithcode.com/paper/english-to-japanese-translation-vs-dictation
Repo
Framework

Near-Optimal Smoothing of Structured Conditional Probability Matrices


Title	Near-Optimal Smoothing of Structured Conditional Probability Matrices
Authors	Moein Falahatgar, Mesrob I. Ohannessian, Alon Orlitsky
Abstract	Utilizing the structure of a probabilistic model can significantly increase its learning speed. Motivated by several recent applications, in particular bigram models in language processing, we consider learning low-rank conditional probability matrices under expected KL-risk. This choice makes smoothing, that is the careful handling of low-probability elements, paramount. We derive an iterative algorithm that extends classical non-negative matrix factorization to naturally incorporate additive smoothing and prove that it converges to the stationary points of a penalized empirical risk. We then derive sample-complexity bounds for the global minimizer of the penalized risk and show that it is within a small factor of the optimal sample complexity. This framework generalizes to more sophisticated smoothing techniques, including absolute-discounting.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6199-near-optimal-smoothing-of-structured-conditional-probability-matrices
PDF	http://papers.nips.cc/paper/6199-near-optimal-smoothing-of-structured-conditional-probability-matrices.pdf
PWC	https://paperswithcode.com/paper/near-optimal-smoothing-of-structured
Repo
Framework

Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations


Title	Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations
Authors	Gavin Abercrombie, Dirk Hovy
Abstract
Tasks	Anomaly Detection, Sarcasm Detection, Sentiment Analysis
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3016/
PDF	https://www.aclweb.org/anthology/P16-3016
PWC	https://paperswithcode.com/paper/putting-sarcasm-detection-into-context-the
Repo
Framework

An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network


Title	An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network
Authors	Y Xia, i, Zhongyu Wei, Yang Liu
Abstract
Tasks	Machine Translation, Representation Learning, Sentence Classification, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3019/
PDF	https://www.aclweb.org/anthology/P16-3019
PWC	https://paperswithcode.com/paper/an-efficient-cross-lingual-model-for-sentence
Repo
Framework

POLYGLOT: Multilingual Semantic Role Labeling with Unified Labels


Title	POLYGLOT: Multilingual Semantic Role Labeling with Unified Labels
Authors	Alan Akbik, Yunyao Li
Abstract
Tasks	Question Answering, Semantic Role Labeling
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4001/
PDF	https://www.aclweb.org/anthology/P16-4001
PWC	https://paperswithcode.com/paper/polyglot-multilingual-semantic-role-labeling
Repo
Framework

A Web-framework for ODIN Annotation


Title	A Web-framework for ODIN Annotation
Authors	Ryan Georgi, Michael Wayne Goodman, Fei Xia
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4006/
PDF	https://www.aclweb.org/anthology/P16-4006
PWC	https://paperswithcode.com/paper/a-web-framework-for-odin-annotation
Repo
Framework