Paper Group NANR 162
NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval. Kotonush: Understanding Concepts Based on Values behind Social Media. IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation. Compositional Distributional Mod …
NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval
Title | NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval |
Authors | Sheng-Lun Wei, Yen-Pin Chiu, Hen-Hsen Huang, Hsin-Hsi Chen |
Abstract | Words to express relations in natural language (NL) statements may be different from those to represent properties in knowledge bases (KB). The vocabulary gap becomes barriers for knowledge base construction and retrieval. With the demo system called NL2KB in this paper, users can browse which properties in KB side may be mapped to for a given relational pattern in NL side. Besides, they can retrieve the sets of relational patterns in NL side for a given property in KB side. We describe how the mapping is established in detail. Although the mined patterns are used for Chinese knowledge base applications, the methodology can be extended to other languages. |
Tasks | Question Answering, Relation Extraction |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-2059/ |
https://www.aclweb.org/anthology/C16-2059 | |
PWC | https://paperswithcode.com/paper/nl2kb-resolving-vocabulary-gap-between |
Repo | |
Framework | |
Kotonush: Understanding Concepts Based on Values behind Social Media
Title | Kotonush: Understanding Concepts Based on Values behind Social Media |
Authors | Tatsuya Iwanari, Kohei Ohara, Naoki Yoshinaga, Nobuhiro Kaji, Masashi Toyoda, Masaru Kitsuregawa |
Abstract | Kotonush, a system that clarifies people{'}s values on various concepts on the basis of what they write about on social media, is presented. The values are represented by ordering sets of concepts (e.g., London, Berlin, and Rome) in accordance with a common attribute intensity expressed by an adjective (e.g., entertaining). We exploit social media text written by different demographics and at different times in order to induce specific orderings for comparison. The system combines a text-to-ordering module with an interactive querying interface enabled by massive hyponymy relations and provides mechanisms to compare the induced orderings from various viewpoints. We empirically evaluate Kotonush and present some case studies, featuring real-world concept orderings with different domains on Twitter, to demonstrate the usefulness of our system. |
Tasks | Aspect-Based Sentiment Analysis, Sentiment Analysis |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-2061/ |
https://www.aclweb.org/anthology/C16-2061 | |
PWC | https://paperswithcode.com/paper/kotonush-understanding-concepts-based-on |
Repo | |
Framework | |
IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation
Title | IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation |
Authors | Henrik B{\o}hler, Petter Asla, Erwin Marsi, Rune S{\ae}tre |
Abstract | |
Tasks | Sentiment Analysis, Stance Detection |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1072/ |
https://www.aclweb.org/anthology/S16-1072 | |
PWC | https://paperswithcode.com/paper/idintnu-at-semeval-2016-task-6-detecting |
Repo | |
Framework | |
Compositional Distributional Models of Meaning
Title | Compositional Distributional Models of Meaning |
Authors | Mehrnoosh Sadrzadeh, Dimitri Kartsaklis |
Abstract | Compositional distributional models of meaning (CDMs) provide a function that produces a vectorial representation for a phrase or a sentence by composing the vectors of its words. Being the natural evolution of the traditional and well-studied distributional models at the word level, CDMs are steadily evolving to a popular and active area of NLP. This COLING 2016 tutorial aims at providing a concise introduction to this emerging field, presenting the different classes of CDMs and the various issues related to them in sufficient detail. |
Tasks | Machine Translation, Natural Language Inference, Sentiment Analysis |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-3001/ |
https://www.aclweb.org/anthology/C16-3001 | |
PWC | https://paperswithcode.com/paper/compositional-distributional-models-of |
Repo | |
Framework | |
Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
Title | Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives |
Authors | Jens Edlund, Joakim Gustafson |
Abstract | In 2014, the Swedish government tasked a Swedish agency, The Swedish Post and Telecom Authority (PTS), with investigating how to best create and populate an infrastructure for spoken language resources (Ref N2014/2840/ITP). As a part of this work, the department of Speech, Music and Hearing at KTH Royal Institute of Technology have taken inventory of existing potential spoken language resources, mainly in Swedish national archives and other governmental or public institutions. In this position paper, key priorities, perspectives, and strategies that may be of general, rather than Swedish, interest are presented. We discuss broad types of potential spoken language resources available; to what extent these resources are free to use; and thirdly the main contribution: strategies to ensure the continuous acquisition of spoken language resources in a manner that facilitates speech and speech technology research. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1717/ |
https://www.aclweb.org/anthology/L16-1717 | |
PWC | https://paperswithcode.com/paper/hidden-resources-a-strategies-to-acquire-and |
Repo | |
Framework | |
Universal Morphology for Old Hungarian
Title | Universal Morphology for Old Hungarian |
Authors | Eszter Simon, Veronika Vincze |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2115/ |
https://www.aclweb.org/anthology/W16-2115 | |
PWC | https://paperswithcode.com/paper/universal-morphology-for-old-hungarian |
Repo | |
Framework | |
Translationese: Between Human and Machine Translation
Title | Translationese: Between Human and Machine Translation |
Authors | Shuly Wintner |
Abstract | Translated texts, in any language, have unique characteristics that set them apart from texts originally written in the same language. Translation Studies is a research field that focuses on investigating these characteristics. Until recently, research in machine translation (MT) has been entirely divorced from translation studies. The main goal of this tutorial is to introduce some of the findings of translation studies to researchers interested mainly in machine translation, and to demonstrate that awareness to these findings can result in better, more accurate MT systems. |
Tasks | Language Identification, Machine Translation, Text Classification |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-3005/ |
https://www.aclweb.org/anthology/C16-3005 | |
PWC | https://paperswithcode.com/paper/translationese-between-human-and-machine |
Repo | |
Framework | |
Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts
Title | Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts |
Authors | Yong Xu, Fran{\c{c}}ois Yvon |
Abstract | Resources for evaluating sentence-level and word-level alignment algorithms are unsatisfactory. Regarding sentence alignments, the existing data is too scarce, especially when it comes to difficult bitexts, containing instances of non-literal translations. Regarding word-level alignments, most available hand-aligned data provide a complete annotation at the level of words that is difficult to exploit, for lack of a clear semantics for alignment links. In this study, we propose new methodologies for collecting human judgements on alignment links, which have been used to annotate 4 new data sets, at the sentence and at the word level. These will be released online, with the hope that they will prove useful to evaluate alignment software and quality estimation tools for automatic alignment. Keywords: Parallel corpora, Sentence Alignments, Word Alignments, Confidence Estimation |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1099/ |
https://www.aclweb.org/anthology/L16-1099 | |
PWC | https://paperswithcode.com/paper/novel-elicitation-and-annotation-schemes-for |
Repo | |
Framework | |
On the Non-canonical Valency Filling
Title | On the Non-canonical Valency Filling |
Authors | Igor Boguslavsky |
Abstract | Valency slot filling is a semantic glue, which brings together the meanings of words. As regards the position of an argument in the dependency structure with respect to its predicate, there exist three types of valency filling: active (canonical), passive, and discontinuous. Of these, the first type is studied much better than the other two. As a rule, canonical actants are unambiguously marked in the syntactic structure, and each actant corresponds to a unique syntactic position. Linguistic information on which syntactic function an actant might have (subject, direct or indirect object), what its morphological form should be and which prepositions or conjunctions it requires, can be given in the lexicon in the form of government patterns, subcategorization frames, or similar data structures. We concentrate on non-canonical cases of valency filling in Russian, which are characteristic of non-verbal parts of speech, such as adverbs, adjectives, and particles, in the first place. They are more difficult to handle than canonical ones, because the position of the actant in the tree is governed by more complicated rules. A valency may be filled by expressions occupying different syntactic positions, and a syntactic position may accept expressions filling different valencies of the same word. We show how these phenomena can be processed in a semantic analyzer. |
Tasks | Slot Filling |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3808/ |
https://www.aclweb.org/anthology/W16-3808 | |
PWC | https://paperswithcode.com/paper/on-the-non-canonical-valency-filling |
Repo | |
Framework | |
Succinct Data Structures for NLP-at-Scale
Title | Succinct Data Structures for NLP-at-Scale |
Authors | Matthias Petri, Trevor Cohn |
Abstract | Succinct data structures involve the use of novel data structures, compression technologies, and other mechanisms to allow data to be stored in extremely small memory or disk footprints, while still allowing for efficient access to the underlying data. They have successfully been applied in areas such as Information Retrieval and Bioinformatics to create highly compressible in-memory search indexes which provide efficient search functionality over datasets which traditionally could only be processed using external memory data structures. Modern technologies in this space are not well known within the NLP community, but have the potential to revolutionise NLP, particularly the application to {`}big data{'} in the form of terabyte and larger corpora. This tutorial will present a practical introduction to the most important succinct data structures, tools, and applications with the intent of providing the researchers with a jump-start into this domain. The focus of this tutorial will be efficient text processing utilising space efficient representations of suffix arrays, suffix trees and searchable integer compression schemes with specific applications of succinct data structures to common NLP tasks such as $n$-gram language modelling. | |
Tasks | Information Retrieval, Language Modelling |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-3006/ |
https://www.aclweb.org/anthology/C16-3006 | |
PWC | https://paperswithcode.com/paper/succinct-data-structures-for-nlp-at-scale |
Repo | |
Framework | |
Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Title | Monolingual Social Media Datasets for Detecting Contradiction and Entailment |
Authors | Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva, Thierry Declerck |
Abstract | Entailment recognition approaches are useful for application domains such as information extraction, question answering or summarisation, for which evidence from multiple sentences needs to be combined. We report on a new 3-way judgement Recognizing Textual Entailment (RTE) resource that originates in the Social Media domain, and explain our semi-automatic creation method for the special purpose of information verification, which draws on manually established rumourous claims reported during crisis events. From about 500 English tweets related to 70 unique claims we compile and evaluate 5.4k RTE pairs, while continue automatizing the workflow to generate similar-sized datasets in other languages. |
Tasks | Natural Language Inference, Question Answering |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1729/ |
https://www.aclweb.org/anthology/L16-1729 | |
PWC | https://paperswithcode.com/paper/monolingual-social-media-datasets-for |
Repo | |
Framework | |
An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms
Title | An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms |
Authors | Danuta Ploch, Andreas Lommatzsch, Florian Schultze |
Abstract | |
Tasks | Sentiment Analysis |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-4019/ |
https://www.aclweb.org/anthology/P16-4019 | |
PWC | https://paperswithcode.com/paper/an-advanced-press-review-system-combining |
Repo | |
Framework | |
MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System
Title | MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System |
Authors | Eun-kyung Kim, Key-Sun Choi |
Abstract | This demo presents MAGES (multilingual angle-integrated grouping-based entity summarization), an entity summarization system for a large knowledge base such as DBpedia based on a entity-group-bound ranking in a single integrated entity space across multiple language-specific editions. MAGES offers a multilingual angle-integrated space model, which has the advantage of overcoming missing semantic tags (i.e., categories) caused by biases in different language communities, and can contribute to the creation of entity groups that are well-formed and more stable than the monolingual condition within it. MAGES can help people quickly identify the essential points of the entities when they search or browse a large volume of entity-centric data. Evaluation results on the same experimental data demonstrate that our system produces a better summary compared with other representative DBpedia entity summarization methods. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-2043/ |
https://www.aclweb.org/anthology/C16-2043 | |
PWC | https://paperswithcode.com/paper/mages-a-multilingual-angle-integrated |
Repo | |
Framework | |
Model Architectures for Quotation Detection
Title | Model Architectures for Quotation Detection |
Authors | Christian Scheible, Roman Klinger, Sebastian Pad{'o} |
Abstract | |
Tasks | Named Entity Recognition |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1164/ |
https://www.aclweb.org/anthology/P16-1164 | |
PWC | https://paperswithcode.com/paper/model-architectures-for-quotation-detection |
Repo | |
Framework | |
Phrase Structure Annotation and Parsing for Learner English
Title | Phrase Structure Annotation and Parsing for Learner English |
Authors | Ryo Nagata, Keisuke Sakaguchi |
Abstract | |
Tasks | Grammatical Error Correction, Part-Of-Speech Tagging |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1173/ |
https://www.aclweb.org/anthology/P16-1173 | |
PWC | https://paperswithcode.com/paper/phrase-structure-annotation-and-parsing-for |
Repo | |
Framework | |