May 4, 2019

1312 words 7 mins read

Paper Group NANR 195

Interactively Learning Visually Grounded Word Meanings from a Human Tutor. Faking Intelligent CALL: The Irish context and the road ahead. Cross-lingual Pronoun Prediction for English, French and German with Maximum Entropy Classification. Japanese Text Normalization with Encoder-Decoder Model. ASU: An Experimental Study on Applying Deep Learning in …

Interactively Learning Visually Grounded Word Meanings from a Human Tutor


Title	Interactively Learning Visually Grounded Word Meanings from a Human Tutor
Authors	Yanchao Yu, Arash Eshghi, Oliver Lemon
Abstract
Tasks	Object Classification, Semantic Parsing
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-3206/
PDF	https://www.aclweb.org/anthology/W16-3206
PWC	https://paperswithcode.com/paper/interactively-learning-visually-grounded-word
Repo
Framework

Faking Intelligent CALL: The Irish context and the road ahead


Title	Faking Intelligent CALL: The Irish context and the road ahead
Authors	Neasa N{'\i} Chiar{'a}in, Ailbhe N{'\i} Chasaide
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-6508/
PDF	https://www.aclweb.org/anthology/W16-6508
PWC	https://paperswithcode.com/paper/faking-intelligent-call-the-irish-context-and
Repo
Framework

Cross-lingual Pronoun Prediction for English, French and German with Maximum Entropy Classification


Title	Cross-lingual Pronoun Prediction for English, French and German with Maximum Entropy Classification
Authors	Dominikus Wetzel
Abstract
Tasks	Language Modelling, Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2357/
PDF	https://www.aclweb.org/anthology/W16-2357
PWC	https://paperswithcode.com/paper/cross-lingual-pronoun-prediction-for-english
Repo
Framework

Japanese Text Normalization with Encoder-Decoder Model


Title	Japanese Text Normalization with Encoder-Decoder Model
Authors	Taishi Ikeda, Hiroyuki Shindo, Yuji Matsumoto
Abstract	Text normalization is the task of transforming lexical variants to their canonical forms. We model the problem of text normalization as a character-level sequence to sequence learning problem and present a neural encoder-decoder model for solving it. To train the encoder-decoder model, many sentences pairs are generally required. However, Japanese non-standard canonical pairs are scarce in the form of parallel corpora. To address this issue, we propose a method of data augmentation to increase data size by converting existing resources into synthesized non-standard forms using handcrafted rules. We conducted an experiment to demonstrate that the synthesized corpus contributes to stably train an encoder-decoder model and improve the performance of Japanese text normalization.
Tasks	Data Augmentation, Machine Translation, Morphological Analysis, Part-Of-Speech Tagging
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3918/
PDF	https://www.aclweb.org/anthology/W16-3918
PWC	https://paperswithcode.com/paper/japanese-text-normalization-with-encoder
Repo
Framework

ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.


Title	ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.
Authors	Michel Naim Gerguis, Cherif Salama, M. Watheq El-Kharashi
Abstract	This paper describes the ASU system submitted in the COLING W-NUT 2016 Twitter Named Entity Recognition (NER) task. We present an experimental study on applying deep learning to extracting named entities (NEs) from tweets. We built two Long Short-Term Memory (LSTM) models for the task. The first model was built to extract named entities without types while the second model was built to extract and then classify them into 10 fine-grained entity classes. In this effort, we show detailed experimentation results on the effectiveness of word embeddings, brown clusters, part-of-speech (POS) tags, shape features, gazetteers, and local context for the tweet input vector representation to the LSTM model. Also, we present a set of experiments, to better design the network parameters for the Twitter NER task. Our system was ranked the fifth out of ten participants with a final f1-score for the typed classes of 39{%} and 55{%} for the non typed ones.
Tasks	Entity Extraction, Named Entity Recognition, Opinion Mining, Word Embeddings
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3925/
PDF	https://www.aclweb.org/anthology/W16-3925
PWC	https://paperswithcode.com/paper/asu-an-experimental-study-on-applying-deep
Repo
Framework

Cross-Lingual Image Caption Generation


Title	Cross-Lingual Image Caption Generation
Authors	Takashi Miyazaki, Nobuyuki Shimizu
Abstract
Tasks	Dependency Parsing, Image Captioning, Information Retrieval, Named Entity Recognition, Sentiment Analysis, Transfer Learning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1168/
PDF	https://www.aclweb.org/anthology/P16-1168
PWC	https://paperswithcode.com/paper/cross-lingual-image-caption-generation
Repo
Framework


Title	Identifying Content Types of Messages Related to Open Source Software Projects
Authors	Yannis Korkontzelos, Paul Thompson, Sophia Ananiadou
Abstract	Assessing the suitability of an Open Source Software project for adoption requires not only an analysis of aspects related to the code, such as code quality, frequency of updates and new version releases, but also an evaluation of the quality of support offered in related online forums and issue trackers. Understanding the content types of forum messages and issue trackers can provide information about the extent to which requests are being addressed and issues are being resolved, the percentage of issues that are not being fixed, the cases where the user acknowledged that the issue was successfully resolved, etc. These indicators can provide potential adopters of the OSS with estimates about the level of available support. We present a detailed hierarchy of content types of online forum messages and issue tracker comments and a corpus of messages annotated accordingly. We discuss our experiments to classify forum messages and issue tracker comments into content-related classes, i.e.{\textasciitilde}to assign them to nodes of the hierarchy. The results are very encouraging.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1290/
PDF	https://www.aclweb.org/anthology/L16-1290
PWC	https://paperswithcode.com/paper/identifying-content-types-of-messages-related
Repo
Framework

The SemDaX Corpus â€• Sense Annotations with Scalable Sense Inventories


Title	The SemDaX Corpus â€• Sense Annotations with Scalable Sense Inventories
Authors	Bolette Pedersen, Anna Braasch, Anders Johannsen, H{'e}ctor Mart{'\i}nez Alonso, Sanni Nimb, Sussi Olsen, Anders S{\o}gaard, Nicolai Hartvig S{\o}rensen
Abstract	We launch the SemDaX corpus which is a recently completed Danish human-annotated corpus available through a CLARIN academic license. The corpus includes approx. 90,000 words, comprises six textual domains, and is annotated with sense inventories of different granularity. The aim of the developed corpus is twofold: i) to assess the reliability of the different sense annotation schemes for Danish measured by qualitative analyses and annotation agreement scores, and ii) to serve as training and test data for machine learning algorithms with the practical purpose of developing sense taggers for Danish. To these aims, we take a new approach to human-annotated corpus resources by double annotating a much larger part of the corpus than what is normally seen: for the all-words task we double annotated 60{%} of the material and for the lexical sample task 100{%}. We include in the corpus not only the adjucated files, but also the diverging annotations. In other words, we consider not all disagreement to be noise, but rather to contain valuable linguistic information that can help us improve our annotation schemes and our learning algorithms.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1136/
PDF	https://www.aclweb.org/anthology/L16-1136
PWC	https://paperswithcode.com/paper/the-semdax-corpus-a-sense-annotations-with
Repo
Framework

DOCAL - Vicomtech’s Participation in the WMT16 Shared Task on Bilingual Document Alignment


Title	DOCAL - Vicomtech’s Participation in the WMT16 Shared Task on Bilingual Document Alignment
Authors	Andoni Azpeitia, Thierry Etchegoyhen
Abstract
Tasks	Machine Translation, Semantic Textual Similarity
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2364/
PDF	https://www.aclweb.org/anthology/W16-2364
PWC	https://paperswithcode.com/paper/docal-vicomtechs-participation-in-the-wmt16
Repo
Framework

Antecedent Prediction Without a Pipeline


Title	Antecedent Prediction Without a Pipeline
Authors	Sam Wiseman, Alex Rush, er M., Stuart Shieber
Abstract
Tasks	Coreference Resolution
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0708/
PDF	https://www.aclweb.org/anthology/W16-0708
PWC	https://paperswithcode.com/paper/antecedent-prediction-without-a-pipeline
Repo
Framework

You and me… in a vector space: modelling individual speakers with distributional semantics


Title	You and me… in a vector space: modelling individual speakers with distributional semantics
Authors	Aur{'e}lie Herbelot, Behrang QasemiZadeh
Abstract
Tasks	Information Retrieval, Language Acquisition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/S16-2023/
PDF	https://www.aclweb.org/anthology/S16-2023
PWC	https://paperswithcode.com/paper/you-and-me-in-a-vector-space-modelling
Repo
Framework

First Steps Towards Coverage-Based Document Alignment


Title	First Steps Towards Coverage-Based Document Alignment
Authors	Lu{'\i}s Gomes, Gabriel Pereira Lopes
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2369/
PDF	https://www.aclweb.org/anthology/W16-2369
PWC	https://paperswithcode.com/paper/first-steps-towards-coverage-based-document
Repo
Framework

Network Motifs May Improve Quality Assessment of Text Documents


Title	Network Motifs May Improve Quality Assessment of Text Documents
Authors	Thomas Arnold, Karsten Weihe
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1404/
PDF	https://www.aclweb.org/anthology/W16-1404
PWC	https://paperswithcode.com/paper/network-motifs-may-improve-quality-assessment
Repo
Framework

Finding Non-Arbitrary Form-Meaning Systematicity Using String-Metric Learning for Kernel Regression


Title	Finding Non-Arbitrary Form-Meaning Systematicity Using String-Metric Learning for Kernel Regression
Authors	E.Dario Guti{'e}rrez, Roger Levy, Benjamin Bergen
Abstract
Tasks	Metric Learning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1225/
PDF	https://www.aclweb.org/anthology/P16-1225
PWC	https://paperswithcode.com/paper/finding-non-arbitrary-form-meaning
Repo
Framework

Proceedings of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL)


Title	Proceedings of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL)
Authors
Abstract
Tasks	Information Retrieval, Question Answering
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1500/
PDF	https://www.aclweb.org/anthology/W16-1500
PWC	https://paperswithcode.com/paper/proceedings-of-the-joint-workshop-on-3
Repo
Framework