Paper Group NANR 197
An Improved Crowdsourcing Based Evaluation Technique for Word Embedding Methods. Implicit Discourse Relation Recognition with Context-aware Character-enhanced Embeddings. What Papers Should I Cite from my Reading List? User Evaluation of a Manuscript Preparatory Assistive Task. UWB at SemEval-2016 Task 11: Exploring Features for Complex Word Identi …
An Improved Crowdsourcing Based Evaluation Technique for Word Embedding Methods
Title | An Improved Crowdsourcing Based Evaluation Technique for Word Embedding Methods |
Authors | Farhana Ferdousi Liza, Marek Grze{'s} |
Abstract | |
Tasks | Word Embeddings, Word Sense Disambiguation |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2510/ |
https://www.aclweb.org/anthology/W16-2510 | |
PWC | https://paperswithcode.com/paper/an-improved-crowdsourcing-based-evaluation |
Repo | |
Framework | |
Implicit Discourse Relation Recognition with Context-aware Character-enhanced Embeddings
Title | Implicit Discourse Relation Recognition with Context-aware Character-enhanced Embeddings |
Authors | Lianhui Qin, Zhisong Zhang, Hai Zhao |
Abstract | For the task of implicit discourse relation recognition, traditional models utilizing manual features can suffer from data sparsity problem. Neural models provide a solution with distributed representations, which could encode the latent semantic information, and are suitable for recognizing semantic relations between argument pairs. However, conventional vector representations usually adopt embeddings at the word level and cannot well handle the rare word problem without carefully considering morphological information at character level. Moreover, embeddings are assigned to individual words independently, which lacks of the crucial contextual information. This paper proposes a neural model utilizing context-aware character-enhanced embeddings to alleviate the drawbacks of the current word level representation. Our experiments show that the enhanced embeddings work well and the proposed model obtains state-of-the-art results. |
Tasks | Machine Translation, Question Answering, Sentiment Analysis |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1180/ |
https://www.aclweb.org/anthology/C16-1180 | |
PWC | https://paperswithcode.com/paper/implicit-discourse-relation-recognition-with |
Repo | |
Framework | |
What Papers Should I Cite from my Reading List? User Evaluation of a Manuscript Preparatory Assistive Task
Title | What Papers Should I Cite from my Reading List? User Evaluation of a Manuscript Preparatory Assistive Task |
Authors | Aravind Sesagiri Raamkumar, Schubert Foo, Natalie Pang |
Abstract | |
Tasks | Information Retrieval, Recommendation Systems |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-1507/ |
https://www.aclweb.org/anthology/W16-1507 | |
PWC | https://paperswithcode.com/paper/what-papers-should-i-cite-from-my-reading |
Repo | |
Framework | |
UWB at SemEval-2016 Task 11: Exploring Features for Complex Word Identification
Title | UWB at SemEval-2016 Task 11: Exploring Features for Complex Word Identification |
Authors | Michal Konkol |
Abstract | |
Tasks | Complex Word Identification, Text Simplification |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1162/ |
https://www.aclweb.org/anthology/S16-1162 | |
PWC | https://paperswithcode.com/paper/uwb-at-semeval-2016-task-11-exploring |
Repo | |
Framework | |
Learning Treewidth-Bounded Bayesian Networks with Thousands of Variables
Title | Learning Treewidth-Bounded Bayesian Networks with Thousands of Variables |
Authors | Mauro Scanagatta, Giorgio Corani, Cassio P. De Campos, Marco Zaffalon |
Abstract | We present a method for learning treewidth-bounded Bayesian networks from data sets containing thousands of variables. Bounding the treewidth of a Bayesian network greatly reduces the complexity of inferences. Yet, being a global property of the graph, it considerably increases the difficulty of the learning process. Our novel algorithm accomplishes this task, scaling both to large domains and to large treewidths. Our novel approach consistently outperforms the state of the art on experiments with up to thousands of variables. |
Tasks | |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6232-learning-treewidth-bounded-bayesian-networks-with-thousands-of-variables |
http://papers.nips.cc/paper/6232-learning-treewidth-bounded-bayesian-networks-with-thousands-of-variables.pdf | |
PWC | https://paperswithcode.com/paper/learning-treewidth-bounded-bayesian-networks |
Repo | |
Framework | |
D(H)ante: A New Set of Tools for XIII Century Italian
Title | D(H)ante: A New Set of Tools for XIII Century Italian |
Authors | Angelo Basile, Federico Sangati |
Abstract | In this paper we describe 1) the process of converting a corpus of Dante Alighieri from a TEI XML format in to a pseudo-CoNLL format; 2) how a pos-tagger trained on modern Italian performs on Dante{'}s Italian 3) the performances of two different pos-taggers trained on the given corpus. We are making our conversion scripts and models available to the community. The two other models trained on the corpus performs reasonably well. The tool used for the conversion process might turn useful for bridging the gap between traditional digital humanities and modern NLP applications since the TEI original format is not usually suitable for being processed with standard NLP tools. We believe our work will serve both communities: the DH community will be able to tag new documents and the NLP world will have an easier way in converting existing documents to a standardized machine-readable format. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1450/ |
https://www.aclweb.org/anthology/L16-1450 | |
PWC | https://paperswithcode.com/paper/dhante-a-new-set-of-tools-for-xiii-century |
Repo | |
Framework | |
Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of rapefugee',
rapeugee’, and `rapugee’.
Title | Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of rapefugee', rapeugee’, and `rapugee’. | |
Authors | Quirin W{"u}rschinger, Mohammad Fazleh Elahi, Desislava Zhekova, Hans-J{"o}rg Schmid |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2605/ |
https://www.aclweb.org/anthology/W16-2605 | |
PWC | https://paperswithcode.com/paper/using-the-web-and-social-media-as-corpora-for |
Repo | |
Framework | |
Detecting Uncertainty Cues in Hungarian Social Media Texts
Title | Detecting Uncertainty Cues in Hungarian Social Media Texts |
Authors | Veronika Vincze |
Abstract | In this paper, we aim at identifying uncertainty cues in Hungarian social media texts. We present our machine learning based uncertainty detector which is based on a rich features set including lexical, morphological, syntactic, semantic and discourse-based features, and we evaluate our system on a small set of manually annotated social media texts. We also carry out cross-domain and domain adaptation experiments using an annotated corpus of standard Hungarian texts and show that domain differences significantly affect machine learning. Furthermore, we argue that differences among uncertainty cue types may also affect the efficiency of uncertainty detection. |
Tasks | Domain Adaptation, Instance Search |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-5002/ |
https://www.aclweb.org/anthology/W16-5002 | |
PWC | https://paperswithcode.com/paper/detecting-uncertainty-cues-in-hungarian |
Repo | |
Framework | |
Automatic Identification of Suicide Notes from Linguistic and Sentiment Features
Title | Automatic Identification of Suicide Notes from Linguistic and Sentiment Features |
Authors | Annika Marie Schoene, Nina Dethlefs |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2116/ |
https://www.aclweb.org/anthology/W16-2116 | |
PWC | https://paperswithcode.com/paper/automatic-identification-of-suicide-notes |
Repo | |
Framework | |
Proceedings of the First Conference on Machine Translation: Volume 1, Research Papers
Title | Proceedings of the First Conference on Machine Translation: Volume 1, Research Papers |
Authors | |
Abstract | |
Tasks | Machine Translation |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2200/ |
https://www.aclweb.org/anthology/W16-2200 | |
PWC | https://paperswithcode.com/paper/proceedings-of-the-first-conference-on-2 |
Repo | |
Framework | |
Automatic discovery of Latin syntactic changes
Title | Automatic discovery of Latin syntactic changes |
Authors | Micha Elsner, Emily Lane |
Abstract | |
Tasks | Language Identification |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2120/ |
https://www.aclweb.org/anthology/W16-2120 | |
PWC | https://paperswithcode.com/paper/automatic-discovery-of-latin-syntactic |
Repo | |
Framework | |
The IMAGACT4ALL Ontology of Animated Images: Implications for Theoretical and Machine Translation of Action Verbs from English-Indian Languages
Title | The IMAGACT4ALL Ontology of Animated Images: Implications for Theoretical and Machine Translation of Action Verbs from English-Indian Languages |
Authors | Pitambar Behera, Sharmin Muzaffar, Atul Ku. Ojha, Girish Jha |
Abstract | Action verbs are one of the frequently occurring linguistic elements in any given natural language as the speakers use them during every linguistic intercourse. However, each language expresses action verbs in its own inherently unique manner by categorization. One verb can refer to several interpretations of actions and one action can be expressed by more than one verb. The inter-language and intra-language variations create ambiguity for the translation of languages from the source language to target language with respect to action verbs. IMAGACT is a corpus-based ontological platform of action verbs translated from prototypic animated images explained in English and Italian as meta-languages. In this paper, we are presenting the issues and challenges in translating action verbs of Indian languages as target and English as source language by observing the animated images. Among the ten Indian languages which have been annotated so far on the platform are Sanskrit, Hindi, Urdu, Odia (Oriya), Bengali, Manipuri, Tamil, Assamese, Magahi and Marathi. Out of them, Manipuri belongs to the Sino-Tibetan, Tamil comes off the Dravidian and the rest owe their genesis to the Indo-Aryan language family. One of the issues is that the one-word morphological English verbs are translated into most of the Indian languages as verbs having more than one-word form; for instance as in the case of conjunct, compound, serial verbs and so on. We are further presenting a cross-lingual comparison of action verbs among Indian languages. In addition, we are also dealing with the issues in disambiguating animated images by the L1 native speakers using competence-based judgements and the theoretical and machine translation implications they bear. |
Tasks | Machine Translation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3707/ |
https://www.aclweb.org/anthology/W16-3707 | |
PWC | https://paperswithcode.com/paper/the-imagact4all-ontology-of-animated-images |
Repo | |
Framework | |
Whodunit… and to Whom? Subjects, Objects, and Actions in Research Articles on American Labor Unions
Title | Whodunit… and to Whom? Subjects, Objects, and Actions in Research Articles on American Labor Unions |
Authors | Vilja Hulden |
Abstract | |
Tasks | Semantic Role Labeling |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2118/ |
https://www.aclweb.org/anthology/W16-2118 | |
PWC | https://paperswithcode.com/paper/whodunit-and-to-whom-subjects-objects-and |
Repo | |
Framework | |
Inferring Psycholinguistic Properties of Words
Title | Inferring Psycholinguistic Properties of Words |
Authors | Gustavo Paetzold, Lucia Specia |
Abstract | |
Tasks | Lexical Simplification, Reading Comprehension, Semantic Composition |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/N16-1050/ |
https://www.aclweb.org/anthology/N16-1050 | |
PWC | https://paperswithcode.com/paper/inferring-psycholinguistic-properties-of |
Repo | |
Framework | |
German NER with a Multilingual Rule Based Information Extraction System: Analysis and Issues
Title | German NER with a Multilingual Rule Based Information Extraction System: Analysis and Issues |
Authors | Anna Druzhkina, Alexey Leontyev, Maria Stepanova |
Abstract | |
Tasks | Named Entity Recognition |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2704/ |
https://www.aclweb.org/anthology/W16-2704 | |
PWC | https://paperswithcode.com/paper/german-ner-with-a-multilingual-rule-based |
Repo | |
Framework | |