May 4, 2019

1456 words 7 mins read

Paper Group NANR 222

The red one!: On learning to refer to things based on discriminative properties. Two-View Label Propagation to Semi-supervised Reader Emotion Classification. An Unsupervised Method for Automatic Translation Memory Cleaning. Towards Generalizable Sentence Embeddings. Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text. …

The red one!: On learning to refer to things based on discriminative properties


Title	The red one!: On learning to refer to things based on discriminative properties
Authors	Angeliki Lazaridou, Nghia The Pham, Marco Baroni
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2035/
PDF	https://www.aclweb.org/anthology/P16-2035
PWC	https://paperswithcode.com/paper/the-red-one-on-learning-to-refer-to-things
Repo
Framework

Two-View Label Propagation to Semi-supervised Reader Emotion Classification


Title	Two-View Label Propagation to Semi-supervised Reader Emotion Classification
Authors	Shoushan Li, Jian Xu, Dong Zhang, Guodong Zhou
Abstract	In the literature, various supervised learning approaches have been adopted to address the task of reader emotion classification. However, the classification performance greatly suffers when the size of the labeled data is limited. In this paper, we propose a two-view label propagation approach to semi-supervised reader emotion classification by exploiting two views, namely source text and response text in a label propagation algorithm. Specifically, our approach depends on two word-document bipartite graphs to model the relationship among the samples in the two views respectively. Besides, the two bipartite graphs are integrated by linking each source text sample with its corresponding response text sample via a length-sensitive transition probability. In this way, our two-view label propagation approach to semi-supervised reader emotion classification largely alleviates the reliance on the strong sufficiency and independence assumptions of the two views, as required in co-training. Empirical evaluation demonstrates the effectiveness of our two-view label propagation approach to semi-supervised reader emotion classification.
Tasks	Emotion Classification
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1249/
PDF	https://www.aclweb.org/anthology/C16-1249
PWC	https://paperswithcode.com/paper/two-view-label-propagation-to-semi-supervised
Repo
Framework

An Unsupervised Method for Automatic Translation Memory Cleaning


Title	An Unsupervised Method for Automatic Translation Memory Cleaning
Authors	Masoud Jalili Sabet, Matteo Negri, Marco Turchi, Eduard Barbu
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2047/
PDF	https://www.aclweb.org/anthology/P16-2047
PWC	https://paperswithcode.com/paper/an-unsupervised-method-for-automatic
Repo
Framework

Towards Generalizable Sentence Embeddings


Title	Towards Generalizable Sentence Embeddings
Authors	Eleni Triantafillou, Jamie Ryan Kiros, Raquel Urtasun, Richard Zemel
Abstract
Tasks	Information Retrieval, Natural Language Inference, One-Shot Learning, Representation Learning, Sentence Embeddings, Text Summarization, Transfer Learning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1628/
PDF	https://www.aclweb.org/anthology/W16-1628
PWC	https://paperswithcode.com/paper/towards-generalizable-sentence-embeddings
Repo
Framework

Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text


Title	Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text
Authors	Kristina Toutanova, Victoria Lin, Wen-tau Yih, Hoifung Poon, Chris Quirk
Abstract
Tasks	Open-Domain Question Answering, Question Answering, Relational Reasoning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1136/
PDF	https://www.aclweb.org/anthology/P16-1136
PWC	https://paperswithcode.com/paper/compositional-learning-of-embeddings-for
Repo
Framework

SAMER: A Semi-Automatically Created Lexical Resource for Arabic Verbal Multiword Expressions Tokens Paradigm and their Morphosyntactic Features


Title	SAMER: A Semi-Automatically Created Lexical Resource for Arabic Verbal Multiword Expressions Tokens Paradigm and their Morphosyntactic Features
Authors	Mohamed Al-Badrashiny, Abdelati Hawwari, Mahmoud Ghoneim, Mona Diab
Abstract	Although MWE are relatively morphologically and syntactically fixed expressions, several types of flexibility can be observed in MWE, verbal MWE in particular. Identifying the degree of morphological and syntactic flexibility of MWE is very important for many Lexicographic and NLP tasks. Adding MWE variants/tokens to a dictionary resource requires characterizing the flexibility among other morphosyntactic features. Carrying out the task manually faces several challenges since it is a very laborious task time and effort wise, as well as it will suffer from coverage limitation. The problem is exacerbated in rich morphological languages where the average word in Arabic could have 12 possible inflection forms. Accordingly, in this paper we introduce a semi-automatic Arabic multiwords expressions resource (SAMER). We propose an automated method that identifies the morphological and syntactic flexibility of Arabic Verbal Multiword Expressions (AVMWE). All observed morphological variants and syntactic pattern alternations of an AVMWE are automatically acquired using large scale corpora. We look for three morphosyntactic aspects of AVMWE types investigating derivational and inflectional variations and syntactic templates, namely: 1) inflectional variation (inflectional paradigm) and calculating degree of flexibility; 2) derivational productivity; and 3) identifying and classifying the different syntactic types. We build a comprehensive list of AVMWE. Every token in the AVMWE list is lemmatized and tagged with POS information. We then search Arabic Gigaword and All ATBs for all possible flexible matches. For each AVMWE type we generate: a) a statistically ranked list of MWE-lexeme inflections and syntactic pattern alternations; b) An abstract syntactic template; and c) The most frequent form. Our technique is validated using a Golden MWE annotated list. The results shows that the quality of the generated resource is 80.04{%}.
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5414/
PDF	https://www.aclweb.org/anthology/W16-5414
PWC	https://paperswithcode.com/paper/samer-a-semi-automatically-created-lexical
Repo
Framework

Automatic Text Generation by Learning from Literary Structures


Title	Automatic Text Generation by Learning from Literary Structures
Authors	Angel Daza, Hiram Calvo, Jes{'u}s Figueroa-Nazuno
Abstract
Tasks	Common Sense Reasoning, Text Generation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0202/
PDF	https://www.aclweb.org/anthology/W16-0202
PWC	https://paperswithcode.com/paper/automatic-text-generation-by-learning-from
Repo
Framework

Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages


Title	Learning Translations for Tagged Words: Extending the Translation Lexicon of an ITG for Low Resource Languages
Authors	Markus Saers, Dekai Wu
Abstract
Tasks	Machine Translation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1207/
PDF	https://www.aclweb.org/anthology/W16-1207
PWC	https://paperswithcode.com/paper/learning-translations-for-tagged-words
Repo
Framework

OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles


Title	OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles
Authors	Pierre Lison, J{"o}rg Tiedemann
Abstract	We present a new major release of the OpenSubtitles collection of parallel corpora. The release is compiled from a large database of movie and TV subtitles and includes a total of 1689 bitexts spanning 2.6 billion sentences across 60 languages. The release also incorporates a number of enhancements in the preprocessing and alignment of the subtitles, such as the automatic correction of OCR errors and the use of meta-data to estimate the quality of each subtitle and score subtitle pairs.
Tasks	Optical Character Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1147/
PDF	https://www.aclweb.org/anthology/L16-1147
PWC	https://paperswithcode.com/paper/opensubtitles2016-extracting-large-parallel
Repo
Framework

Author Name Disambiguation in MEDLINE Based on Journal Descriptors and Semantic Types


Title	Author Name Disambiguation in MEDLINE Based on Journal Descriptors and Semantic Types
Authors	Dina Vishnyakova, Raul Rodriguez-Esteban, Khan Ozol, Fabio Rinaldi
Abstract	Author name disambiguation (AND) in publication and citation resources is a well-known problem. Often, information about email address and other details in the affiliation is missing. In cases where such information is not available, identifying the authorship of publications becomes very challenging. Consequently, there have been attempts to resolve such cases by utilizing external resources as references. However, such external resources are heterogeneous and are not always reliable regarding the correctness of information. To solve the AND task, especially when information about an author is not complete we suggest the use of new features such as journal descriptors (JD) and semantic types (ST). The evaluation of different feature models shows that their inclusion has an impact equivalent to that of other important features such as email address. Using such features we show that our system outperforms the state of the art.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5115/
PDF	https://www.aclweb.org/anthology/W16-5115
PWC	https://paperswithcode.com/paper/author-name-disambiguation-in-medline-based
Repo
Framework

Pynini: A Python library for weighted finite-state grammar compilation


Title	Pynini: A Python library for weighted finite-state grammar compilation
Authors	Kyle Gorman
Abstract
Tasks	Optical Character Recognition, Speech Recognition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2409/
PDF	https://www.aclweb.org/anthology/W16-2409
PWC	https://paperswithcode.com/paper/pynini-a-python-library-for-weighted-finite
Repo
Framework

DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter


Title	DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter
Authors	Julian Hough, Ye Tian, Laura de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen, Jonathan Ginzburg
Abstract	We present the DUEL corpus, consisting of 24 hours of natural, face-to-face, loosely task-directed dialogue in German, French and Mandarin Chinese. The corpus is uniquely positioned as a cross-linguistic, multimodal dialogue resource controlled for domain. DUEL includes audio, video and body tracking data and is transcribed and annotated for disfluency, laughter and exclamations.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1281/
PDF	https://www.aclweb.org/anthology/L16-1281
PWC	https://paperswithcode.com/paper/duel-a-multi-lingual-multimodal-dialogue
Repo
Framework

From Extractive to Abstractive Summarization: A Journey


Title	From Extractive to Abstractive Summarization: A Journey
Authors	Parth Mehta
Abstract
Tasks	Abstractive Text Summarization, Information Retrieval, Machine Translation, Text Generation, Text Summarization
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3015/
PDF	https://www.aclweb.org/anthology/P16-3015
PWC	https://paperswithcode.com/paper/from-extractive-to-abstractive-summarization
Repo
Framework

Unsupervised Authorial Clustering Based on Syntactic Structure


Title	Unsupervised Authorial Clustering Based on Syntactic Structure
Authors	Alon Daks, Aidan Clark
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3017/
PDF	https://www.aclweb.org/anthology/P16-3017
PWC	https://paperswithcode.com/paper/unsupervised-authorial-clustering-based-on
Repo
Framework

Real-Time Discovery and Geospatial Visualization of Mobility and Industry Events from Large-Scale, Heterogeneous Data Streams


Title	Real-Time Discovery and Geospatial Visualization of Mobility and Industry Events from Large-Scale, Heterogeneous Data Streams
Authors	Leonhard Hennig, Philippe Thomas, Renlong Ai, Johannes Kirschnick, He Wang, Jakob Pannier, Nora Zimmermann, Sven Schmeier, Feiyu Xu, Jan Ostwald, Hans Uszkoreit
Abstract
Tasks	Decision Making, Domain Adaptation, Named Entity Recognition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4007/
PDF	https://www.aclweb.org/anthology/P16-4007
PWC	https://paperswithcode.com/paper/real-time-discovery-and-geospatial
Repo
Framework