May 4, 2019

1456 words 7 mins read

Paper Group NANR 222

Paper Group NANR 222

The red one!: On learning to refer to things based on discriminative properties. Two-View Label Propagation to Semi-supervised Reader Emotion Classification. An Unsupervised Method for Automatic Translation Memory Cleaning. Towards Generalizable Sentence Embeddings. Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text. …

The red one!: On learning to refer to things based on discriminative properties

Title The red one!: On learning to refer to things based on discriminative properties
Authors Angeliki Lazaridou, Nghia The Pham, Marco Baroni
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2035/
PDF https://www.aclweb.org/anthology/P16-2035
PWC https://paperswithcode.com/paper/the-red-one-on-learning-to-refer-to-things
Repo
Framework

Two-View Label Propagation to Semi-supervised Reader Emotion Classification

Title Two-View Label Propagation to Semi-supervised Reader Emotion Classification
Authors Shoushan Li, Jian Xu, Dong Zhang, Guodong Zhou
Abstract In the literature, various supervised learning approaches have been adopted to address the task of reader emotion classification. However, the classification performance greatly suffers when the size of the labeled data is limited. In this paper, we propose a two-view label propagation approach to semi-supervised reader emotion classification by exploiting two views, namely source text and response text in a label propagation algorithm. Specifically, our approach depends on two word-document bipartite graphs to model the relationship among the samples in the two views respectively. Besides, the two bipartite graphs are integrated by linking each source text sample with its corresponding response text sample via a length-sensitive transition probability. In this way, our two-view label propagation approach to semi-supervised reader emotion classification largely alleviates the reliance on the strong sufficiency and independence assumptions of the two views, as required in co-training. Empirical evaluation demonstrates the effectiveness of our two-view label propagation approach to semi-supervised reader emotion classification.
Tasks Emotion Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1249/
PDF https://www.aclweb.org/anthology/C16-1249
PWC https://paperswithcode.com/paper/two-view-label-propagation-to-semi-supervised
Repo
Framework

An Unsupervised Method for Automatic Translation Memory Cleaning

Title An Unsupervised Method for Automatic Translation Memory Cleaning
Authors Masoud Jalili Sabet, Matteo Negri, Marco Turchi, Eduard Barbu
Abstract
Tasks Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2047/
PDF https://www.aclweb.org/anthology/P16-2047
PWC https://paperswithcode.com/paper/an-unsupervised-method-for-automatic
Repo
Framework

Towards Generalizable Sentence Embeddings

Title Towards Generalizable Sentence Embeddings
Authors Eleni Triantafillou, Jamie Ryan Kiros, Raquel Urtasun, Richard Zemel
Abstract
Tasks Information Retrieval, Natural Language Inference, One-Shot Learning, Representation Learning, Sentence Embeddings, Text Summarization, Transfer Learning
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1628/
PDF https://www.aclweb.org/anthology/W16-1628
PWC https://paperswithcode.com/paper/towards-generalizable-sentence-embeddings
Repo
Framework

Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text

Title Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text
Authors Kristina Toutanova, Victoria Lin, Wen-tau Yih, Hoifung Poon, Chris Quirk
Abstract
Tasks Open-Domain Question Answering, Question Answering, Relational Reasoning
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1136/
PDF https://www.aclweb.org/anthology/P16-1136
PWC https://paperswithcode.com/paper/compositional-learning-of-embeddings-for
Repo
Framework

SAMER: A Semi-Automatically Created Lexical Resource for Arabic Verbal Multiword Expressions Tokens Paradigm and their Morphosyntactic Features

Title SAMER: A Semi-Automatically Created Lexical Resource for Arabic Verbal Multiword Expressions Tokens Paradigm and their Morphosyntactic Features
Authors Mohamed Al-Badrashiny, Abdelati Hawwari, Mahmoud Ghoneim, Mona Diab
Abstract Although MWE are relatively morphologically and syntactically fixed expressions, several types of flexibility can be observed in MWE, verbal MWE in particular. Identifying the degree of morphological and syntactic flexibility of MWE is very important for many Lexicographic and NLP tasks. Adding MWE variants/tokens to a dictionary resource requires characterizing the flexibility among other morphosyntactic features. Carrying out the task manually faces several challenges since it is a very laborious task time and effort wise, as well as it will suffer from coverage limitation. The problem is exacerbated in rich morphological languages where the average word in Arabic could have 12 possible inflection forms. Accordingly, in this paper we introduce a semi-automatic Arabic multiwords expressions resource (SAMER). We propose an automated method that identifies the morphological and syntactic flexibility of Arabic Verbal Multiword Expressions (AVMWE). All observed morphological variants and syntactic pattern alternations of an AVMWE are automatically acquired using large scale corpora. We look for three morphosyntactic aspects of AVMWE types investigating derivational and inflectional variations and syntactic templates, namely: 1) inflectional variation (inflectional paradigm) and calculating degree of flexibility; 2) derivational productivity; and 3) identifying and classifying the different syntactic types. We build a comprehensive list of AVMWE. Every token in the AVMWE list is lemmatized and tagged with POS information. We then search Arabic Gigaword and All ATBs for all possible flexible matches. For each AVMWE type we generate: a) a statistically ranked list of MWE-lexeme inflections and syntactic pattern alternations; b) An abstract syntactic template; and c) The most frequent form. Our technique is validated using a Golden MWE annotated list. The results shows that the quality of the generated resource is 80.04{%}.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5414/
PDF https://www.aclweb.org/anthology/W16-5414
PWC https://paperswithcode.com/paper/samer-a-semi-automatically-created-lexical
Repo
Framework

Automatic Text Generation by Learning from Literary Structures

Title Automatic Text Generation by Learning from Literary Structures
Authors Angel Daza, Hiram Calvo, Jes{'u}s Figueroa-Nazuno
Abstract
Tasks Common Sense Reasoning, Text Generation
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0202/
PDF https://www.aclweb.org/anthology/W16-0202
PWC https://paperswithcode.com/paper/automatic-text-generation-by-learning-from
Repo
Framework

Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages

Title Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages
Authors Markus Saers, Dekai Wu
Abstract
Tasks Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1207/
PDF https://www.aclweb.org/anthology/W16-1207
PWC https://paperswithcode.com/paper/learning-translations-for-tagged-words
Repo
Framework

OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles

Title OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles
Authors Pierre Lison, J{"o}rg Tiedemann
Abstract We present a new major release of the OpenSubtitles collection of parallel corpora. The release is compiled from a large database of movie and TV subtitles and includes a total of 1689 bitexts spanning 2.6 billion sentences across 60 languages. The release also incorporates a number of enhancements in the preprocessing and alignment of the subtitles, such as the automatic correction of OCR errors and the use of meta-data to estimate the quality of each subtitle and score subtitle pairs.
Tasks Optical Character Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1147/
PDF https://www.aclweb.org/anthology/L16-1147
PWC https://paperswithcode.com/paper/opensubtitles2016-extracting-large-parallel
Repo
Framework

Author Name Disambiguation in MEDLINE Based on Journal Descriptors and Semantic Types

Title Author Name Disambiguation in MEDLINE Based on Journal Descriptors and Semantic Types
Authors Dina Vishnyakova, Raul Rodriguez-Esteban, Khan Ozol, Fabio Rinaldi
Abstract Author name disambiguation (AND) in publication and citation resources is a well-known problem. Often, information about email address and other details in the affiliation is missing. In cases where such information is not available, identifying the authorship of publications becomes very challenging. Consequently, there have been attempts to resolve such cases by utilizing external resources as references. However, such external resources are heterogeneous and are not always reliable regarding the correctness of information. To solve the AND task, especially when information about an author is not complete we suggest the use of new features such as journal descriptors (JD) and semantic types (ST). The evaluation of different feature models shows that their inclusion has an impact equivalent to that of other important features such as email address. Using such features we show that our system outperforms the state of the art.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5115/
PDF https://www.aclweb.org/anthology/W16-5115
PWC https://paperswithcode.com/paper/author-name-disambiguation-in-medline-based
Repo
Framework

Pynini: A Python library for weighted finite-state grammar compilation

Title Pynini: A Python library for weighted finite-state grammar compilation
Authors Kyle Gorman
Abstract
Tasks Optical Character Recognition, Speech Recognition
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2409/
PDF https://www.aclweb.org/anthology/W16-2409
PWC https://paperswithcode.com/paper/pynini-a-python-library-for-weighted-finite
Repo
Framework

DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter

Title DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Authors Julian Hough, Ye Tian, Laura de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen, Jonathan Ginzburg
Abstract We present the DUEL corpus, consisting of 24 hours of natural, face-to-face, loosely task-directed dialogue in German, French and Mandarin Chinese. The corpus is uniquely positioned as a cross-linguistic, multimodal dialogue resource controlled for domain. DUEL includes audio, video and body tracking data and is transcribed and annotated for disfluency, laughter and exclamations.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1281/
PDF https://www.aclweb.org/anthology/L16-1281
PWC https://paperswithcode.com/paper/duel-a-multi-lingual-multimodal-dialogue
Repo
Framework

From Extractive to Abstractive Summarization: A Journey

Title From Extractive to Abstractive Summarization: A Journey
Authors Parth Mehta
Abstract
Tasks Abstractive Text Summarization, Information Retrieval, Machine Translation, Text Generation, Text Summarization
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3015/
PDF https://www.aclweb.org/anthology/P16-3015
PWC https://paperswithcode.com/paper/from-extractive-to-abstractive-summarization
Repo
Framework

Unsupervised Authorial Clustering Based on Syntactic Structure

Title Unsupervised Authorial Clustering Based on Syntactic Structure
Authors Alon Daks, Aidan Clark
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3017/
PDF https://www.aclweb.org/anthology/P16-3017
PWC https://paperswithcode.com/paper/unsupervised-authorial-clustering-based-on
Repo
Framework

Real-Time Discovery and Geospatial Visualization of Mobility and Industry Events from Large-Scale, Heterogeneous Data Streams

Title Real-Time Discovery and Geospatial Visualization of Mobility and Industry Events from Large-Scale, Heterogeneous Data Streams
Authors Leonhard Hennig, Philippe Thomas, Renlong Ai, Johannes Kirschnick, He Wang, Jakob Pannier, Nora Zimmermann, Sven Schmeier, Feiyu Xu, Jan Ostwald, Hans Uszkoreit
Abstract
Tasks Decision Making, Domain Adaptation, Named Entity Recognition
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4007/
PDF https://www.aclweb.org/anthology/P16-4007
PWC https://paperswithcode.com/paper/real-time-discovery-and-geospatial
Repo
Framework
comments powered by Disqus