May 5, 2019

1599 words 8 mins read

Paper Group NANR 28

Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases. Proceedings of the CoNLL-16 shared task. Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. Multi-prototype Chinese Character Embedding. A Two-stage Approach for Extending Event Detection to New Types via Neural N …

Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases


Title	Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases
Authors	Sree Harsha Ramesh, Jayant Jain, Sarath K S, Krishna R Sundaresan
Abstract
Tasks	Decision Making, Question Answering, Tokenization
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-6338/
PDF	https://www.aclweb.org/anthology/W16-6338
PWC	https://paperswithcode.com/paper/towards-building-a-domain-agnostic-natural
Repo
Framework

Proceedings of the CoNLL-16 shared task


Title	Proceedings of the CoNLL-16 shared task
Authors
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/K16-2000/
PDF	https://www.aclweb.org/anthology/K16-2000
PWC	https://paperswithcode.com/paper/proceedings-of-the-conll-16-shared-task
Repo
Framework

Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study


Title	Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Authors	Silvie Cinkov{'a}, Ema Krej{\v{c}}ov{'a}, Anna Vernerov{'a}, V{'\i}t Baisa
Abstract	We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.
Tasks	Word Sense Disambiguation
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1137/
PDF	https://www.aclweb.org/anthology/L16-1137
PWC	https://paperswithcode.com/paper/graded-and-word-sense-disambiguation
Repo
Framework

Multi-prototype Chinese Character Embedding


Title	Multi-prototype Chinese Character Embedding
Authors	Yanan Lu, Yue Zhang, Donghong Ji
Abstract	Chinese sentences are written as sequences of characters, which are elementary units of syntax and semantics. Characters are highly polysemous in forming words. We present a position-sensitive skip-gram model to learn multi-prototype Chinese character embeddings, and explore the usefulness of such character embeddings to Chinese NLP tasks. Evaluation on character similarity shows that multi-prototype embeddings are significantly better than a single-prototype baseline. In addition, used as features in the Chinese NER task, the embeddings result in a 1.74{%} F-score improvement over a state-of-the-art baseline.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1138/
PDF	https://www.aclweb.org/anthology/L16-1138
PWC	https://paperswithcode.com/paper/multi-prototype-chinese-character-embedding
Repo
Framework

A Two-stage Approach for Extending Event Detection to New Types via Neural Networks


Title	A Two-stage Approach for Extending Event Detection to New Types via Neural Networks
Authors	Thien Huu Nguyen, Lisheng Fu, Kyunghyun Cho, Ralph Grishman
Abstract
Tasks	Domain Adaptation, Representation Learning, Transfer Learning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1618/
PDF	https://www.aclweb.org/anthology/W16-1618
PWC	https://paperswithcode.com/paper/a-two-stage-approach-for-extending-event
Repo
Framework

Measuring the Similarity of Sentential Arguments in Dialogue


Title	Measuring the Similarity of Sentential Arguments in Dialogue
Authors	Amita Misra, Brian Ecker, Marilyn Walker
Abstract
Tasks
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3636/
PDF	https://www.aclweb.org/anthology/W16-3636
PWC	https://paperswithcode.com/paper/measuring-the-similarity-of-sentential-1
Repo
Framework

SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features


Title	SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features
Authors	Kashif Shah, Fethi Bougares, Lo{"\i}c Barrault, Lucia Specia
Abstract
Tasks	Feature Engineering, Language Modelling, Machine Translation, Sentence Embeddings, Speech Recognition, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2392/
PDF	https://www.aclweb.org/anthology/W16-2392
PWC	https://paperswithcode.com/paper/shef-lium-nn-sentence-level-quality
Repo
Framework

Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs


Title	Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Authors	Daniel Couto-Vale, Stella Neumann, Paula Niemietz
Abstract	This paper introduces a toolkit used for the purpose of detecting replacements of different grammatical and semantic structures in ongoing text production logged as a chronological series of computer interaction events (so-called keystroke logs). The specific case we use involves human translations where replacements can be indicative of translator behaviour that leads to specific features of translations that distinguish them from non-translated texts. The toolkit uses a novel CCG chart parser customised so as to recognise grammatical words independently of space and punctuation boundaries. On the basis of the linguistic analysis, structures in different versions of the target text are compared and classified as potential equivalents of the same source text segment by {`}equivalence judges{'}. In that way, replacements of grammatical and semantic structures can be detected. Beyond the specific task at hand the approach will also be useful for the analysis of other types of spaceless text such as Twitter hashtags and texts in agglutinative or spaceless languages like Finnish or Chinese. \|
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1574/
PDF	https://www.aclweb.org/anthology/L16-1574
PWC	https://paperswithcode.com/paper/automatic-recognition-of-linguistic
Repo
Framework

TweetMT: A Parallel Microblog Corpus


Title	TweetMT: A Parallel Microblog Corpus
Authors	I{~n}aki San Vicente, I{~n}aki Alegr{'\i}a, Cristina Espa{~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri
Abstract	We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula. The corpus has been created by combining automatic collection and crowdsourcing approaches, and it is publicly available. It is intended for the development and testing of microtext machine translation systems. In this paper we describe the methodology followed to build the corpus, and present the results of the shared task in which it was tested.
Tasks	Machine Translation
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1469/
PDF	https://www.aclweb.org/anthology/L16-1469
PWC	https://paperswithcode.com/paper/tweetmt-a-parallel-microblog-corpus
Repo
Framework


Title	A Probabilistic Model of Social Decision Making based on Reward Maximization
Authors	Koosha Khalvati, Seongmin A. Park, Jean-Claude Dreher, Rajesh P. Rao
Abstract	A fundamental problem in cognitive neuroscience is how humans make decisions, act, and behave in relation to other humans. Here we adopt the hypothesis that when we are in an interactive social setting, our brains perform Bayesian inference of the intentions and cooperativeness of others using probabilistic representations. We employ the framework of partially observable Markov decision processes (POMDPs) to model human decision making in a social context, focusing specifically on the volunteer’s dilemma in a version of the classic Public Goods Game. We show that the POMDP model explains both the behavior of subjects as well as neural activity recorded using fMRI during the game. The decisions of subjects can be modeled across all trials using two interpretable parameters. Furthermore, the expected reward predicted by the model for each subject was correlated with the activation of brain areas related to reward expectation in social interactions. Our results suggest a probabilistic basis for human social decision making within the framework of expected reward maximization.
Tasks	Bayesian Inference, Decision Making
Published	2016-12-01
URL	http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization
PDF	http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization.pdf
PWC	https://paperswithcode.com/paper/a-probabilistic-model-of-social-decision
Repo
Framework

Measuring the Effect of Conversational Aspects on Machine Translation Quality


Title	Measuring the Effect of Conversational Aspects on Machine Translation Quality
Authors	Marlies van der Wees, Arianna Bisazza, Christof Monz
Abstract	Research in statistical machine translation (SMT) is largely driven by formal translation tasks, while translating informal text is much more challenging. In this paper we focus on SMT for the informal genre of dialogues, which has rarely been addressed to date. Concretely, we investigate the effect of dialogue acts, speakers, gender, and text register on SMT quality when translating fictional dialogues. We first create and release a corpus of multilingual movie dialogues annotated with these four dialogue-specific aspects. When measuring translation performance for each of these variables, we find that BLEU fluctuations between their categories are often significantly larger than randomly expected. Following this finding, we hypothesize and show that SMT of fictional dialogues benefits from adaptation towards dialogue acts and registers. Finally, we find that male speakers are harder to translate and use more vulgar language than female speakers, and that vulgarity is often not preserved during translation.
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1242/
PDF	https://www.aclweb.org/anthology/C16-1242
PWC	https://paperswithcode.com/paper/measuring-the-effect-of-conversational
Repo
Framework

Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks


Title	Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks
Authors	Xiaotian Jiang, Quan Wang, Peng Li, Bin Wang
Abstract	Distant supervision is an efficient approach that automatically generates labeled data for relation extraction (RE). Traditional distantly supervised RE systems rely heavily on handcrafted features, and hence suffer from error propagation. Recently, a neural network architecture has been proposed to automatically extract features for relation classification. However, this approach follows the traditional expressed-at-least-once assumption, and fails to make full use of information across different sentences. Moreover, it ignores the fact that there can be multiple relations holding between the same entity pair. In this paper, we propose a multi-instance multi-label convolutional neural network for distantly supervised RE. It first relaxes the expressed-at-least-once assumption, and employs cross-sentence max-pooling so as to enable information sharing across different sentences. Then it handles overlapping relations by multi-label learning with a neural network classifier. Experimental results show that our approach performs significantly and consistently better than state-of-the-art methods.
Tasks	Multi-Label Learning, Relation Classification, Relation Extraction
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1139/
PDF	https://www.aclweb.org/anthology/C16-1139
PWC	https://paperswithcode.com/paper/relation-extraction-with-multi-instance-multi
Repo
Framework

Survey: Computational Sociolinguistics: A Survey


Title	Survey: Computational Sociolinguistics: A Survey
Authors	Dong Nguyen, A. Seza Do{\u{g}}ru{"o}z, Carolyn P. Ros{'e}, Franciska de Jong
Abstract
Tasks
Published	2016-09-01
URL	https://www.aclweb.org/anthology/J16-3007/
PDF	https://www.aclweb.org/anthology/J16-3007
PWC	https://paperswithcode.com/paper/survey-computational-sociolinguistics-a
Repo
Framework

Squib: Expressive Power of Abstract Meaning Representations


Title	Squib: Expressive Power of Abstract Meaning Representations
Authors	Johan Bos
Abstract
Tasks
Published	2016-09-01
URL	https://www.aclweb.org/anthology/J16-3006/
PDF	https://www.aclweb.org/anthology/J16-3006
PWC	https://paperswithcode.com/paper/squib-expressive-power-of-abstract-meaning
Repo
Framework

Formal Distributional Semantics: Introduction to the Special Issue


Title	Formal Distributional Semantics: Introduction to the Special Issue
Authors	Gemma Boleda, Aur{'e}lie Herbelot
Abstract
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/J16-4002/
PDF	https://www.aclweb.org/anthology/J16-4002
PWC	https://paperswithcode.com/paper/formal-distributional-semantics-introduction
Repo
Framework