Paper Group NANR 28
Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases. Proceedings of the CoNLL-16 shared task. Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. Multi-prototype Chinese Character Embedding. A Two-stage Approach for Extending Event Detection to New Types via Neural N …
Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases
Title | Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases |
Authors | Sree Harsha Ramesh, Jayant Jain, Sarath K S, Krishna R Sundaresan |
Abstract | |
Tasks | Decision Making, Question Answering, Tokenization |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-6338/ |
https://www.aclweb.org/anthology/W16-6338 | |
PWC | https://paperswithcode.com/paper/towards-building-a-domain-agnostic-natural |
Repo | |
Framework | |
Proceedings of the CoNLL-16 shared task
Title | Proceedings of the CoNLL-16 shared task |
Authors | |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/K16-2000/ |
https://www.aclweb.org/anthology/K16-2000 | |
PWC | https://paperswithcode.com/paper/proceedings-of-the-conll-16-shared-task |
Repo | |
Framework | |
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Title | Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study |
Authors | Silvie Cinkov{'a}, Ema Krej{\v{c}}ov{'a}, Anna Vernerov{'a}, V{'\i}t Baisa |
Abstract | We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis. |
Tasks | Word Sense Disambiguation |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1137/ |
https://www.aclweb.org/anthology/L16-1137 | |
PWC | https://paperswithcode.com/paper/graded-and-word-sense-disambiguation |
Repo | |
Framework | |
Multi-prototype Chinese Character Embedding
Title | Multi-prototype Chinese Character Embedding |
Authors | Yanan Lu, Yue Zhang, Donghong Ji |
Abstract | Chinese sentences are written as sequences of characters, which are elementary units of syntax and semantics. Characters are highly polysemous in forming words. We present a position-sensitive skip-gram model to learn multi-prototype Chinese character embeddings, and explore the usefulness of such character embeddings to Chinese NLP tasks. Evaluation on character similarity shows that multi-prototype embeddings are significantly better than a single-prototype baseline. In addition, used as features in the Chinese NER task, the embeddings result in a 1.74{%} F-score improvement over a state-of-the-art baseline. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1138/ |
https://www.aclweb.org/anthology/L16-1138 | |
PWC | https://paperswithcode.com/paper/multi-prototype-chinese-character-embedding |
Repo | |
Framework | |
A Two-stage Approach for Extending Event Detection to New Types via Neural Networks
Title | A Two-stage Approach for Extending Event Detection to New Types via Neural Networks |
Authors | Thien Huu Nguyen, Lisheng Fu, Kyunghyun Cho, Ralph Grishman |
Abstract | |
Tasks | Domain Adaptation, Representation Learning, Transfer Learning |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-1618/ |
https://www.aclweb.org/anthology/W16-1618 | |
PWC | https://paperswithcode.com/paper/a-two-stage-approach-for-extending-event |
Repo | |
Framework | |
Measuring the Similarity of Sentential Arguments in Dialogue
Title | Measuring the Similarity of Sentential Arguments in Dialogue |
Authors | Amita Misra, Brian Ecker, Marilyn Walker |
Abstract | |
Tasks | |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/W16-3636/ |
https://www.aclweb.org/anthology/W16-3636 | |
PWC | https://paperswithcode.com/paper/measuring-the-similarity-of-sentential-1 |
Repo | |
Framework | |
SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features
Title | SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features |
Authors | Kashif Shah, Fethi Bougares, Lo{"\i}c Barrault, Lucia Specia |
Abstract | |
Tasks | Feature Engineering, Language Modelling, Machine Translation, Sentence Embeddings, Speech Recognition, Word Embeddings |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2392/ |
https://www.aclweb.org/anthology/W16-2392 | |
PWC | https://paperswithcode.com/paper/shef-lium-nn-sentence-level-quality |
Repo | |
Framework | |
Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
Title | Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs |
Authors | Daniel Couto-Vale, Stella Neumann, Paula Niemietz |
Abstract | This paper introduces a toolkit used for the purpose of detecting replacements of different grammatical and semantic structures in ongoing text production logged as a chronological series of computer interaction events (so-called keystroke logs). The specific case we use involves human translations where replacements can be indicative of translator behaviour that leads to specific features of translations that distinguish them from non-translated texts. The toolkit uses a novel CCG chart parser customised so as to recognise grammatical words independently of space and punctuation boundaries. On the basis of the linguistic analysis, structures in different versions of the target text are compared and classified as potential equivalents of the same source text segment by {`}equivalence judges{'}. In that way, replacements of grammatical and semantic structures can be detected. Beyond the specific task at hand the approach will also be useful for the analysis of other types of spaceless text such as Twitter hashtags and texts in agglutinative or spaceless languages like Finnish or Chinese. | |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1574/ |
https://www.aclweb.org/anthology/L16-1574 | |
PWC | https://paperswithcode.com/paper/automatic-recognition-of-linguistic |
Repo | |
Framework | |
TweetMT: A Parallel Microblog Corpus
Title | TweetMT: A Parallel Microblog Corpus |
Authors | I{~n}aki San Vicente, I{~n}aki Alegr{'\i}a, Cristina Espa{~n}a-Bonet, Pablo Gamallo, Hugo Gon{\c{c}}alo Oliveira, Eva Mart{'\i}nez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri |
Abstract | We introduce TweetMT, a parallel corpus of tweets in four language pairs that combine five languages (Spanish from/to Basque, Catalan, Galician and Portuguese), all of which have an official status in the Iberian Peninsula. The corpus has been created by combining automatic collection and crowdsourcing approaches, and it is publicly available. It is intended for the development and testing of microtext machine translation systems. In this paper we describe the methodology followed to build the corpus, and present the results of the shared task in which it was tested. |
Tasks | Machine Translation |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1469/ |
https://www.aclweb.org/anthology/L16-1469 | |
PWC | https://paperswithcode.com/paper/tweetmt-a-parallel-microblog-corpus |
Repo | |
Framework | |
A Probabilistic Model of Social Decision Making based on Reward Maximization
Title | A Probabilistic Model of Social Decision Making based on Reward Maximization |
Authors | Koosha Khalvati, Seongmin A. Park, Jean-Claude Dreher, Rajesh P. Rao |
Abstract | A fundamental problem in cognitive neuroscience is how humans make decisions, act, and behave in relation to other humans. Here we adopt the hypothesis that when we are in an interactive social setting, our brains perform Bayesian inference of the intentions and cooperativeness of others using probabilistic representations. We employ the framework of partially observable Markov decision processes (POMDPs) to model human decision making in a social context, focusing specifically on the volunteer’s dilemma in a version of the classic Public Goods Game. We show that the POMDP model explains both the behavior of subjects as well as neural activity recorded using fMRI during the game. The decisions of subjects can be modeled across all trials using two interpretable parameters. Furthermore, the expected reward predicted by the model for each subject was correlated with the activation of brain areas related to reward expectation in social interactions. Our results suggest a probabilistic basis for human social decision making within the framework of expected reward maximization. |
Tasks | Bayesian Inference, Decision Making |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization |
http://papers.nips.cc/paper/6537-a-probabilistic-model-of-social-decision-making-based-on-reward-maximization.pdf | |
PWC | https://paperswithcode.com/paper/a-probabilistic-model-of-social-decision |
Repo | |
Framework | |
Measuring the Effect of Conversational Aspects on Machine Translation Quality
Title | Measuring the Effect of Conversational Aspects on Machine Translation Quality |
Authors | Marlies van der Wees, Arianna Bisazza, Christof Monz |
Abstract | Research in statistical machine translation (SMT) is largely driven by formal translation tasks, while translating informal text is much more challenging. In this paper we focus on SMT for the informal genre of dialogues, which has rarely been addressed to date. Concretely, we investigate the effect of dialogue acts, speakers, gender, and text register on SMT quality when translating fictional dialogues. We first create and release a corpus of multilingual movie dialogues annotated with these four dialogue-specific aspects. When measuring translation performance for each of these variables, we find that BLEU fluctuations between their categories are often significantly larger than randomly expected. Following this finding, we hypothesize and show that SMT of fictional dialogues benefits from adaptation towards dialogue acts and registers. Finally, we find that male speakers are harder to translate and use more vulgar language than female speakers, and that vulgarity is often not preserved during translation. |
Tasks | Machine Translation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1242/ |
https://www.aclweb.org/anthology/C16-1242 | |
PWC | https://paperswithcode.com/paper/measuring-the-effect-of-conversational |
Repo | |
Framework | |
Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks
Title | Relation Extraction with Multi-instance Multi-label Convolutional Neural Networks |
Authors | Xiaotian Jiang, Quan Wang, Peng Li, Bin Wang |
Abstract | Distant supervision is an efficient approach that automatically generates labeled data for relation extraction (RE). Traditional distantly supervised RE systems rely heavily on handcrafted features, and hence suffer from error propagation. Recently, a neural network architecture has been proposed to automatically extract features for relation classification. However, this approach follows the traditional expressed-at-least-once assumption, and fails to make full use of information across different sentences. Moreover, it ignores the fact that there can be multiple relations holding between the same entity pair. In this paper, we propose a multi-instance multi-label convolutional neural network for distantly supervised RE. It first relaxes the expressed-at-least-once assumption, and employs cross-sentence max-pooling so as to enable information sharing across different sentences. Then it handles overlapping relations by multi-label learning with a neural network classifier. Experimental results show that our approach performs significantly and consistently better than state-of-the-art methods. |
Tasks | Multi-Label Learning, Relation Classification, Relation Extraction |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1139/ |
https://www.aclweb.org/anthology/C16-1139 | |
PWC | https://paperswithcode.com/paper/relation-extraction-with-multi-instance-multi |
Repo | |
Framework | |
Survey: Computational Sociolinguistics: A Survey
Title | Survey: Computational Sociolinguistics: A Survey |
Authors | Dong Nguyen, A. Seza Do{\u{g}}ru{"o}z, Carolyn P. Ros{'e}, Franciska de Jong |
Abstract | |
Tasks | |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/J16-3007/ |
https://www.aclweb.org/anthology/J16-3007 | |
PWC | https://paperswithcode.com/paper/survey-computational-sociolinguistics-a |
Repo | |
Framework | |
Squib: Expressive Power of Abstract Meaning Representations
Title | Squib: Expressive Power of Abstract Meaning Representations |
Authors | Johan Bos |
Abstract | |
Tasks | |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/J16-3006/ |
https://www.aclweb.org/anthology/J16-3006 | |
PWC | https://paperswithcode.com/paper/squib-expressive-power-of-abstract-meaning |
Repo | |
Framework | |
Formal Distributional Semantics: Introduction to the Special Issue
Title | Formal Distributional Semantics: Introduction to the Special Issue |
Authors | Gemma Boleda, Aur{'e}lie Herbelot |
Abstract | |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/J16-4002/ |
https://www.aclweb.org/anthology/J16-4002 | |
PWC | https://paperswithcode.com/paper/formal-distributional-semantics-introduction |
Repo | |
Framework | |