May 5, 2019

1599 words 8 mins read

Paper Group NANR 28

Paper Group NANR 28

Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases. Proceedings of the CoNLL-16 shared task. Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. Multi-prototype Chinese Character Embedding. A Two-stage Approach for Extending Event Detection to New Types via Neural N …

Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases

Title Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases
Authors Sree Harsha Ramesh, Jayant Jain, Sarath K S, Krishna R Sundaresan
Abstract
Tasks Decision Making, Question Answering, Tokenization
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-6338/
PDF https://www.aclweb.org/anthology/W16-6338
PWC https://paperswithcode.com/paper/towards-building-a-domain-agnostic-natural
Repo
Framework

Proceedings of the CoNLL-16 shared task

Title Proceedings of the CoNLL-16 shared task
Authors
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/K16-2000/
PDF https://www.aclweb.org/anthology/K16-2000
PWC https://paperswithcode.com/paper/proceedings-of-the-conll-16-shared-task
Repo
Framework

Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study

Title Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Authors Silvie Cinkov{'a}, Ema Krej{\v{c}}ov{'a}, Anna Vernerov{'a}, V{'\i}t Baisa
Abstract We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.
Tasks Word Sense Disambiguation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1137/
PDF https://www.aclweb.org/anthology/L16-1137
PWC https://paperswithcode.com/paper/graded-and-word-sense-disambiguation
Repo
Framework

Multi-prototype Chinese Character Embedding

Title Multi-prototype Chinese Character Embedding
Authors Yanan Lu, Yue Zhang, Donghong Ji
Abstract Chinese sentences are written as sequences of characters, which are elementary units of syntax and semantics. Characters are highly polysemous in forming words. We present a position-sensitive skip-gram model to learn multi-prototype Chinese character embeddings, and explore the usefulness of such character embeddings to Chinese NLP tasks. Evaluation on character similarity shows that multi-prototype embeddings are significantly better than a single-prototype baseline. In addition, used as features in the Chinese NER task, the embeddings result in a 1.74{%} F-score improvement over a state-of-the-art baseline.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1138/
PDF https://www.aclweb.org/anthology/L16-1138
PWC https://paperswithcode.com/paper/multi-prototype-chinese-character-embedding
Repo
Framework

A Two-stage Approach for Extending Event Detection to New Types via Neural Networks

Title A Two-stage Approach for Extending Event Detection to New Types via Neural Networks
Authors Thien Huu Nguyen, Lisheng Fu, Kyunghyun Cho, Ralph Grishman
Abstract
Tasks Domain Adaptation, Representation Learning, Transfer Learning
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1618/
PDF https://www.aclweb.org/anthology/W16-1618
PWC https://paperswithcode.com/paper/a-two-stage-approach-for-extending-event
Repo
Framework

Measuring the Similarity of Sentential Arguments in Dialogue

Title Measuring the Similarity of Sentential Arguments in Dialogue
Authors Amita Misra, Brian Ecker, Marilyn Walker
Abstract
Tasks
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3636/
PDF https://www.aclweb.org/anthology/W16-3636
PWC https://paperswithcode.com/paper/measuring-the-similarity-of-sentential-1
Repo
Framework

SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features

Title SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features
Authors Kashif Shah, Fethi Bougares, Lo{"\i}c Barrault, Lucia Specia
Abstract
Tasks Feature Engineering, Language Modelling, Machine Translation, Sentence Embeddings, Speech Recognition, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2392/
PDF https://www.aclweb.org/anthology/W16-2392
PWC https://paperswithcode.com/paper/shef-lium-nn-sentence-level-quality
Repo
Framework

Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs

Title Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Authors Daniel Couto-Vale, Stella Neumann, Paula Niemietz
Abstract This paper introduces a toolkit used for the purpose of detecting replacements of different grammatical and semantic structures in ongoing text production logged as a chronological series of computer interaction events (so-called keystroke logs). The specific case we use involves human translations where replacements can be indicative of translator behaviour that leads to specific features of translations that distinguish them from non-translated texts. The toolkit uses a novel CCG chart parser customised so as to recognise grammatical words independently of space and punctuation boundaries. On the basis of the linguistic analysis, structures in different versions of the target text are compared and classified as potential equivalents of the same source text segment by {`}equivalence judges{'}. In that way, replacements of grammatical and semantic structures can be detected. Beyond the specific task at hand the approach will also be useful for the analysis of other types of spaceless text such as Twitter hashtags and texts in agglutinative or spaceless languages like Finnish or Chinese. |
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1574/
PDF https://www.aclweb.org/anthology/L16-1574
PWC https://paperswithcode.com/paper/automatic-recognition-of-linguistic
Repo
Framework

TweetMT: A Parallel Microblog Corpus

Title TweetMT: A Parallel Microblog Corpus
Authors I{~n}aki San Vicente, I{~n}aki Alegr{'\i}a, Cristina Espa{~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri
Abstract We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula. The corpus has been created by combining automatic collection and crowdsourcing approaches, and it is publicly available. It is intended for the development and testing of microtext machine translation systems. In this paper we describe the methodology followed to build the corpus, and present the results of the shared task in which it was tested.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1469/
PDF https://www.aclweb.org/anthology/L16-1469
PWC https://paperswithcode.com/paper/tweetmt-a-parallel-microblog-corpus
Repo
Framework

A Probabilistic Model of Social Decision Making based on Reward Maximization

Title A Probabilistic Model of Social Decision Making based on Reward Maximization
Authors Koosha Khalvati, Seongmin A. Park, Jean-Claude Dreher, Rajesh P. Rao
Abstract A fundamental problem in cognitive neuroscience is how humans make decisions, act, and behave in relation to other humans. Here we adopt the hypothesis that when we are in an interactive social setting, our brains perform Bayesian inference of the intentions and cooperativeness of others using probabilistic representations. We employ the framework of partially observable Markov decision processes (POMDPs) to model human decision making in a social context, focusing specifically on the volunteer’s dilemma in a version of the classic Public Goods Game. We show that the POMDP model explains both the behavior of subjects as well as neural activity recorded using fMRI during the game. The decisions of subjects can be modeled across all trials using two interpretable parameters. Furthermore, the expected reward predicted by the model for each subject was correlated with the activation of brain areas related to reward expectation in social interactions. Our results suggest a probabilistic basis for human social decision making within the framework of expected reward maximization.
Tasks Bayesian Inference, Decision Making
Published 2016-12-01
URL http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization
PDF http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization.pdf
PWC https://paperswithcode.com/paper/a-probabilistic-model-of-social-decision
Repo
Framework

Measuring the Effect of Conversational Aspects on Machine Translation Quality

Title Measuring the Effect of Conversational Aspects on Machine Translation Quality
Authors Marlies van der Wees, Arianna Bisazza, Christof Monz
Abstract Research in statistical machine translation (SMT) is largely driven by formal translation tasks, while translating informal text is much more challenging. In this paper we focus on SMT for the informal genre of dialogues, which has rarely been addressed to date. Concretely, we investigate the effect of dialogue acts, speakers, gender, and text register on SMT quality when translating fictional dialogues. We first create and release a corpus of multilingual movie dialogues annotated with these four dialogue-specific aspects. When measuring translation performance for each of these variables, we find that BLEU fluctuations between their categories are often significantly larger than randomly expected. Following this finding, we hypothesize and show that SMT of fictional dialogues benefits from adaptation towards dialogue acts and registers. Finally, we find that male speakers are harder to translate and use more vulgar language than female speakers, and that vulgarity is often not preserved during translation.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1242/
PDF https://www.aclweb.org/anthology/C16-1242
PWC https://paperswithcode.com/paper/measuring-the-effect-of-conversational
Repo
Framework

Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks

Title Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks
Authors Xiaotian Jiang, Quan Wang, Peng Li, Bin Wang
Abstract Distant supervision is an efficient approach that automatically generates labeled data for relation extraction (RE). Traditional distantly supervised RE systems rely heavily on handcrafted features, and hence suffer from error propagation. Recently, a neural network architecture has been proposed to automatically extract features for relation classification. However, this approach follows the traditional expressed-at-least-once assumption, and fails to make full use of information across different sentences. Moreover, it ignores the fact that there can be multiple relations holding between the same entity pair. In this paper, we propose a multi-instance multi-label convolutional neural network for distantly supervised RE. It first relaxes the expressed-at-least-once assumption, and employs cross-sentence max-pooling so as to enable information sharing across different sentences. Then it handles overlapping relations by multi-label learning with a neural network classifier. Experimental results show that our approach performs significantly and consistently better than state-of-the-art methods.
Tasks Multi-Label Learning, Relation Classification, Relation Extraction
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1139/
PDF https://www.aclweb.org/anthology/C16-1139
PWC https://paperswithcode.com/paper/relation-extraction-with-multi-instance-multi
Repo
Framework

Survey: Computational Sociolinguistics: A Survey

Title Survey: Computational Sociolinguistics: A Survey
Authors Dong Nguyen, A. Seza Do{\u{g}}ru{"o}z, Carolyn P. Ros{'e}, Franciska de Jong
Abstract
Tasks
Published 2016-09-01
URL https://www.aclweb.org/anthology/J16-3007/
PDF https://www.aclweb.org/anthology/J16-3007
PWC https://paperswithcode.com/paper/survey-computational-sociolinguistics-a
Repo
Framework

Squib: Expressive Power of Abstract Meaning Representations

Title Squib: Expressive Power of Abstract Meaning Representations
Authors Johan Bos
Abstract
Tasks
Published 2016-09-01
URL https://www.aclweb.org/anthology/J16-3006/
PDF https://www.aclweb.org/anthology/J16-3006
PWC https://paperswithcode.com/paper/squib-expressive-power-of-abstract-meaning
Repo
Framework

Formal Distributional Semantics: Introduction to the Special Issue

Title Formal Distributional Semantics: Introduction to the Special Issue
Authors Gemma Boleda, Aur{'e}lie Herbelot
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/J16-4002/
PDF https://www.aclweb.org/anthology/J16-4002
PWC https://paperswithcode.com/paper/formal-distributional-semantics-introduction
Repo
Framework
comments powered by Disqus