May 4, 2019

1274 words 6 mins read

Paper Group NANR 167

Paper Group NANR 167

Adverse Drug Reaction Classification With Deep Neural Networks. INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words. Convergence of Syntactic Complexity in Conversation. A Novel Measure for Coherence in Statistical Topic Models. Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data. Efficient Data Selection …

Adverse Drug Reaction Classification With Deep Neural Networks

Title Adverse Drug Reaction Classification With Deep Neural Networks
Authors Trung Huynh, Yulan He, Alistair Willis, Stefan Rueger
Abstract We study the problem of detecting sentences describing adverse drug reactions (ADRs) and frame the problem as binary classification. We investigate different neural network (NN) architectures for ADR classification. In particular, we propose two new neural network models, Convolutional Recurrent Neural Network (CRNN) by concatenating convolutional neural networks with recurrent neural networks, and Convolutional Neural Network with Attention (CNNA) by adding attention weights into convolutional neural networks. We evaluate various NN architectures on a Twitter dataset containing informal language and an Adverse Drug Effects (ADE) dataset constructed by sampling from MEDLINE case reports. Experimental results show that all the NN architectures outperform the traditional maximum entropy classifiers trained from n-grams with different weighting strategies considerably on both datasets. On the Twitter dataset, all the NN architectures perform similarly. But on the ADE dataset, CNN performs better than other more complex CNN variants. Nevertheless, CNNA allows the visualisation of attention weights of words when making classification decisions and hence is more appropriate for the extraction of word subsequences describing ADRs.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1084/
PDF https://www.aclweb.org/anthology/C16-1084
PWC https://paperswithcode.com/paper/adverse-drug-reaction-classification-with
Repo
Framework

INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words

Title INESC-ID at SemEval-2016 Task 4-A: Reducing the Problem of Out-of-Embedding Words
Authors Silvio Amir, Ramon Astudillo, Wang Ling, M{'a}rio J. Silva, Isabel Trancoso
Abstract
Tasks Feature Engineering, Sentiment Analysis, Twitter Sentiment Analysis, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1036/
PDF https://www.aclweb.org/anthology/S16-1036
PWC https://paperswithcode.com/paper/inesc-id-at-semeval-2016-task-4-a-reducing
Repo
Framework

Convergence of Syntactic Complexity in Conversation

Title Convergence of Syntactic Complexity in Conversation
Authors Yang Xu, David Reitter
Abstract
Tasks Language Acquisition
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2072/
PDF https://www.aclweb.org/anthology/P16-2072
PWC https://paperswithcode.com/paper/convergence-of-syntactic-complexity-in
Repo
Framework

A Novel Measure for Coherence in Statistical Topic Models

Title A Novel Measure for Coherence in Statistical Topic Models
Authors Fred Morstatter, Huan Liu
Abstract
Tasks Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2088/
PDF https://www.aclweb.org/anthology/P16-2088
PWC https://paperswithcode.com/paper/a-novel-measure-for-coherence-in-statistical
Repo
Framework

Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data

Title Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data
Authors Maria Barrett, Joachim Bingel, Frank Keller, Anders S{\o}gaard
Abstract
Tasks Eye Tracking, Part-Of-Speech Tagging
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2094/
PDF https://www.aclweb.org/anthology/P16-2094
PWC https://paperswithcode.com/paper/weakly-supervised-part-of-speech-tagging
Repo
Framework

Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora

Title Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora
Authors Amir Hazem, Emmanuel Morin
Abstract Comparable corpora are the main alternative to the use of parallel corpora to extract bilingual lexicons. Although it is easier to build comparable corpora, specialized comparable corpora are often of modest size in comparison with corpora issued from the general domain. Consequently, the observations of word co-occurrences which are the basis of context-based methods are unreliable. We propose in this article to improve word co-occurrences of specialized comparable corpora and thus context representation by using general-domain data. This idea, which has been already used in machine translation task for more than a decade, is not straightforward for the task of bilingual lexicon extraction from specific-domain comparable corpora. We go against the mainstream of this task where many studies support the idea that adding out-of-domain documents decreases the quality of lexicons. Our empirical evaluation shows the advantages of this approach which induces a significant gain in the accuracy of extracted lexicons.
Tasks Machine Translation, Topic Models, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1321/
PDF https://www.aclweb.org/anthology/C16-1321
PWC https://paperswithcode.com/paper/efficient-data-selection-for-bilingual
Repo
Framework

Black Holes and White Rabbits: Metaphor Identification with Visual Features

Title Black Holes and White Rabbits: Metaphor Identification with Visual Features
Authors Ekaterina Shutova, Douwe Kiela, Jean Maillard
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1020/
PDF https://www.aclweb.org/anthology/N16-1020
PWC https://paperswithcode.com/paper/black-holes-and-white-rabbits-metaphor
Repo
Framework

Singleton Detection using Word Embeddings and Neural Networks

Title Singleton Detection using Word Embeddings and Neural Networks
Authors Hessel Haagsma
Abstract
Tasks Coreference Resolution, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3010/
PDF https://www.aclweb.org/anthology/P16-3010
PWC https://paperswithcode.com/paper/singleton-detection-using-word-embeddings-and
Repo
Framework

Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons

Title Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons
Authors Eric Benzschawel
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3003/
PDF https://www.aclweb.org/anthology/P16-3003
PWC https://paperswithcode.com/paper/identifying-potential-adverse-drug-events-in
Repo
Framework

English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting

Title English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
Authors Michael Carl, Akiko Aizawa, Masaru Yamada
Abstract Speech-enabled interfaces have the potential to become one of the most efficient and ergonomic environments for human-computer interaction and for text production. However, not much research has been carried out to investigate in detail the processes and strategies involved in the different modes of text production. This paper introduces and evaluates a corpus of more than 55 hours of English-to-Japanese user activity data that were collected within the ENJA15 project, in which translators were observed while writing and speaking translations (translation dictation) and during machine translation post-editing. The transcription of the spoken data, keyboard logging and eye-tracking data were recorded with Translog-II, post-processed and integrated into the CRITT Translation Process Research-DB (TPR-DB), which is publicly available under a creative commons license. The paper presents the ENJA15 data as part of a large multilingual Chinese, Danish, German, Hindi and Spanish translation process data collection of more than 760 translation sessions. It compares the ENJA15 data with the other language pairs and reviews some of its particularities.
Tasks Eye Tracking, Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1635/
PDF https://www.aclweb.org/anthology/L16-1635
PWC https://paperswithcode.com/paper/english-to-japanese-translation-vs-dictation
Repo
Framework

Near-Optimal Smoothing of Structured Conditional Probability Matrices

Title Near-Optimal Smoothing of Structured Conditional Probability Matrices
Authors Moein Falahatgar, Mesrob I. Ohannessian, Alon Orlitsky
Abstract Utilizing the structure of a probabilistic model can significantly increase its learning speed. Motivated by several recent applications, in particular bigram models in language processing, we consider learning low-rank conditional probability matrices under expected KL-risk. This choice makes smoothing, that is the careful handling of low-probability elements, paramount. We derive an iterative algorithm that extends classical non-negative matrix factorization to naturally incorporate additive smoothing and prove that it converges to the stationary points of a penalized empirical risk. We then derive sample-complexity bounds for the global minimizer of the penalized risk and show that it is within a small factor of the optimal sample complexity. This framework generalizes to more sophisticated smoothing techniques, including absolute-discounting.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6199-near-optimal-smoothing-of-structured-conditional-probability-matrices
PDF http://papers.nips.cc/paper/6199-near-optimal-smoothing-of-structured-conditional-probability-matrices.pdf
PWC https://paperswithcode.com/paper/near-optimal-smoothing-of-structured
Repo
Framework

Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations

Title Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations
Authors Gavin Abercrombie, Dirk Hovy
Abstract
Tasks Anomaly Detection, Sarcasm Detection, Sentiment Analysis
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3016/
PDF https://www.aclweb.org/anthology/P16-3016
PWC https://paperswithcode.com/paper/putting-sarcasm-detection-into-context-the
Repo
Framework

An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network

Title An Efficient Cross-lingual Model for Sentence Classification Using Convolutional Neural Network
Authors Y Xia, i, Zhongyu Wei, Yang Liu
Abstract
Tasks Machine Translation, Representation Learning, Sentence Classification, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3019/
PDF https://www.aclweb.org/anthology/P16-3019
PWC https://paperswithcode.com/paper/an-efficient-cross-lingual-model-for-sentence
Repo
Framework

POLYGLOT: Multilingual Semantic Role Labeling with Unified Labels

Title POLYGLOT: Multilingual Semantic Role Labeling with Unified Labels
Authors Alan Akbik, Yunyao Li
Abstract
Tasks Question Answering, Semantic Role Labeling
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4001/
PDF https://www.aclweb.org/anthology/P16-4001
PWC https://paperswithcode.com/paper/polyglot-multilingual-semantic-role-labeling
Repo
Framework

A Web-framework for ODIN Annotation

Title A Web-framework for ODIN Annotation
Authors Ryan Georgi, Michael Wayne Goodman, Fei Xia
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4006/
PDF https://www.aclweb.org/anthology/P16-4006
PWC https://paperswithcode.com/paper/a-web-framework-for-odin-annotation
Repo
Framework
comments powered by Disqus