May 4, 2019

1941 words 10 mins read

Paper Group NANR 162

Paper Group NANR 162

NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval. Kotonush: Understanding Concepts Based on Values behind Social Media. IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation. Compositional Distributional Mod …

NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval

Title NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval
Authors Sheng-Lun Wei, Yen-Pin Chiu, Hen-Hsen Huang, Hsin-Hsi Chen
Abstract Words to express relations in natural language (NL) statements may be different from those to represent properties in knowledge bases (KB). The vocabulary gap becomes barriers for knowledge base construction and retrieval. With the demo system called NL2KB in this paper, users can browse which properties in KB side may be mapped to for a given relational pattern in NL side. Besides, they can retrieve the sets of relational patterns in NL side for a given property in KB side. We describe how the mapping is established in detail. Although the mined patterns are used for Chinese knowledge base applications, the methodology can be extended to other languages.
Tasks Question Answering, Relation Extraction
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2059/
PDF https://www.aclweb.org/anthology/C16-2059
PWC https://paperswithcode.com/paper/nl2kb-resolving-vocabulary-gap-between
Repo
Framework

Kotonush: Understanding Concepts Based on Values behind Social Media

Title Kotonush: Understanding Concepts Based on Values behind Social Media
Authors Tatsuya Iwanari, Kohei Ohara, Naoki Yoshinaga, Nobuhiro Kaji, Masashi Toyoda, Masaru Kitsuregawa
Abstract Kotonush, a system that clarifies people{'}s values on various concepts on the basis of what they write about on social media, is presented. The values are represented by ordering sets of concepts (e.g., London, Berlin, and Rome) in accordance with a common attribute intensity expressed by an adjective (e.g., entertaining). We exploit social media text written by different demographics and at different times in order to induce specific orderings for comparison. The system combines a text-to-ordering module with an interactive querying interface enabled by massive hyponymy relations and provides mechanisms to compare the induced orderings from various viewpoints. We empirically evaluate Kotonush and present some case studies, featuring real-world concept orderings with different domains on Twitter, to demonstrate the usefulness of our system.
Tasks Aspect-Based Sentiment Analysis, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2061/
PDF https://www.aclweb.org/anthology/C16-2061
PWC https://paperswithcode.com/paper/kotonush-understanding-concepts-based-on
Repo
Framework

IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation

Title IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation
Authors Henrik B{\o}hler, Petter Asla, Erwin Marsi, Rune S{\ae}tre
Abstract
Tasks Sentiment Analysis, Stance Detection
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1072/
PDF https://www.aclweb.org/anthology/S16-1072
PWC https://paperswithcode.com/paper/idintnu-at-semeval-2016-task-6-detecting
Repo
Framework

Compositional Distributional Models of Meaning

Title Compositional Distributional Models of Meaning
Authors Mehrnoosh Sadrzadeh, Dimitri Kartsaklis
Abstract Compositional distributional models of meaning (CDMs) provide a function that produces a vectorial representation for a phrase or a sentence by composing the vectors of its words. Being the natural evolution of the traditional and well-studied distributional models at the word level, CDMs are steadily evolving to a popular and active area of NLP. This COLING 2016 tutorial aims at providing a concise introduction to this emerging field, presenting the different classes of CDMs and the various issues related to them in sufficient detail.
Tasks Machine Translation, Natural Language Inference, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-3001/
PDF https://www.aclweb.org/anthology/C16-3001
PWC https://paperswithcode.com/paper/compositional-distributional-models-of
Repo
Framework

Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives

Title Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
Authors Jens Edlund, Joakim Gustafson
Abstract In 2014, the Swedish government tasked a Swedish agency, The Swedish Post and Telecom Authority (PTS), with investigating how to best create and populate an infrastructure for spoken language resources (Ref N2014/2840/ITP). As a part of this work, the department of Speech, Music and Hearing at KTH Royal Institute of Technology have taken inventory of existing potential spoken language resources, mainly in Swedish national archives and other governmental or public institutions. In this position paper, key priorities, perspectives, and strategies that may be of general, rather than Swedish, interest are presented. We discuss broad types of potential spoken language resources available; to what extent these resources are free to use; and thirdly the main contribution: strategies to ensure the continuous acquisition of spoken language resources in a manner that facilitates speech and speech technology research.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1717/
PDF https://www.aclweb.org/anthology/L16-1717
PWC https://paperswithcode.com/paper/hidden-resources-a-strategies-to-acquire-and
Repo
Framework

Universal Morphology for Old Hungarian

Title Universal Morphology for Old Hungarian
Authors Eszter Simon, Veronika Vincze
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2115/
PDF https://www.aclweb.org/anthology/W16-2115
PWC https://paperswithcode.com/paper/universal-morphology-for-old-hungarian
Repo
Framework

Translationese: Between Human and Machine Translation

Title Translationese: Between Human and Machine Translation
Authors Shuly Wintner
Abstract Translated texts, in any language, have unique characteristics that set them apart from texts originally written in the same language. Translation Studies is a research field that focuses on investigating these characteristics. Until recently, research in machine translation (MT) has been entirely divorced from translation studies. The main goal of this tutorial is to introduce some of the findings of translation studies to researchers interested mainly in machine translation, and to demonstrate that awareness to these findings can result in better, more accurate MT systems.
Tasks Language Identification, Machine Translation, Text Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-3005/
PDF https://www.aclweb.org/anthology/C16-3005
PWC https://paperswithcode.com/paper/translationese-between-human-and-machine
Repo
Framework

Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts

Title Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts
Authors Yong Xu, Fran{\c{c}}ois Yvon
Abstract Resources for evaluating sentence-level and word-level alignment algorithms are unsatisfactory. Regarding sentence alignments, the existing data is too scarce, especially when it comes to difficult bitexts, containing instances of non-literal translations. Regarding word-level alignments, most available hand-aligned data provide a complete annotation at the level of words that is difficult to exploit, for lack of a clear semantics for alignment links. In this study, we propose new methodologies for collecting human judgements on alignment links, which have been used to annotate 4 new data sets, at the sentence and at the word level. These will be released online, with the hope that they will prove useful to evaluate alignment software and quality estimation tools for automatic alignment. Keywords: Parallel corpora, Sentence Alignments, Word Alignments, Confidence Estimation
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1099/
PDF https://www.aclweb.org/anthology/L16-1099
PWC https://paperswithcode.com/paper/novel-elicitation-and-annotation-schemes-for
Repo
Framework

On the Non-canonical Valency Filling

Title On the Non-canonical Valency Filling
Authors Igor Boguslavsky
Abstract Valency slot filling is a semantic glue, which brings together the meanings of words. As regards the position of an argument in the dependency structure with respect to its predicate, there exist three types of valency filling: active (canonical), passive, and discontinuous. Of these, the first type is studied much better than the other two. As a rule, canonical actants are unambiguously marked in the syntactic structure, and each actant corresponds to a unique syntactic position. Linguistic information on which syntactic function an actant might have (subject, direct or indirect object), what its morphological form should be and which prepositions or conjunctions it requires, can be given in the lexicon in the form of government patterns, subcategorization frames, or similar data structures. We concentrate on non-canonical cases of valency filling in Russian, which are characteristic of non-verbal parts of speech, such as adverbs, adjectives, and particles, in the first place. They are more difficult to handle than canonical ones, because the position of the actant in the tree is governed by more complicated rules. A valency may be filled by expressions occupying different syntactic positions, and a syntactic position may accept expressions filling different valencies of the same word. We show how these phenomena can be processed in a semantic analyzer.
Tasks Slot Filling
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3808/
PDF https://www.aclweb.org/anthology/W16-3808
PWC https://paperswithcode.com/paper/on-the-non-canonical-valency-filling
Repo
Framework

Succinct Data Structures for NLP-at-Scale

Title Succinct Data Structures for NLP-at-Scale
Authors Matthias Petri, Trevor Cohn
Abstract Succinct data structures involve the use of novel data structures, compression technologies, and other mechanisms to allow data to be stored in extremely small memory or disk footprints, while still allowing for efficient access to the underlying data. They have successfully been applied in areas such as Information Retrieval and Bioinformatics to create highly compressible in-memory search indexes which provide efficient search functionality over datasets which traditionally could only be processed using external memory data structures. Modern technologies in this space are not well known within the NLP community, but have the potential to revolutionise NLP, particularly the application to {`}big data{'} in the form of terabyte and larger corpora. This tutorial will present a practical introduction to the most important succinct data structures, tools, and applications with the intent of providing the researchers with a jump-start into this domain. The focus of this tutorial will be efficient text processing utilising space efficient representations of suffix arrays, suffix trees and searchable integer compression schemes with specific applications of succinct data structures to common NLP tasks such as $n$-gram language modelling. |
Tasks Information Retrieval, Language Modelling
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-3006/
PDF https://www.aclweb.org/anthology/C16-3006
PWC https://paperswithcode.com/paper/succinct-data-structures-for-nlp-at-scale
Repo
Framework

Monolingual Social Media Datasets for Detecting Contradiction and Entailment

Title Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Authors Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva, Thierry Declerck
Abstract Entailment recognition approaches are useful for application domains such as information extraction, question answering or summarisation, for which evidence from multiple sentences needs to be combined. We report on a new 3-way judgement Recognizing Textual Entailment (RTE) resource that originates in the Social Media domain, and explain our semi-automatic creation method for the special purpose of information verification, which draws on manually established rumourous claims reported during crisis events. From about 500 English tweets related to 70 unique claims we compile and evaluate 5.4k RTE pairs, while continue automatizing the workflow to generate similar-sized datasets in other languages.
Tasks Natural Language Inference, Question Answering
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1729/
PDF https://www.aclweb.org/anthology/L16-1729
PWC https://paperswithcode.com/paper/monolingual-social-media-datasets-for
Repo
Framework

An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms

Title An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms
Authors Danuta Ploch, Andreas Lommatzsch, Florian Schultze
Abstract
Tasks Sentiment Analysis
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4019/
PDF https://www.aclweb.org/anthology/P16-4019
PWC https://paperswithcode.com/paper/an-advanced-press-review-system-combining
Repo
Framework

MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System

Title MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System
Authors Eun-kyung Kim, Key-Sun Choi
Abstract This demo presents MAGES (multilingual angle-integrated grouping-based entity summarization), an entity summarization system for a large knowledge base such as DBpedia based on a entity-group-bound ranking in a single integrated entity space across multiple language-specific editions. MAGES offers a multilingual angle-integrated space model, which has the advantage of overcoming missing semantic tags (i.e., categories) caused by biases in different language communities, and can contribute to the creation of entity groups that are well-formed and more stable than the monolingual condition within it. MAGES can help people quickly identify the essential points of the entities when they search or browse a large volume of entity-centric data. Evaluation results on the same experimental data demonstrate that our system produces a better summary compared with other representative DBpedia entity summarization methods.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2043/
PDF https://www.aclweb.org/anthology/C16-2043
PWC https://paperswithcode.com/paper/mages-a-multilingual-angle-integrated
Repo
Framework

Model Architectures for Quotation Detection

Title Model Architectures for Quotation Detection
Authors Christian Scheible, Roman Klinger, Sebastian Pad{'o}
Abstract
Tasks Named Entity Recognition
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1164/
PDF https://www.aclweb.org/anthology/P16-1164
PWC https://paperswithcode.com/paper/model-architectures-for-quotation-detection
Repo
Framework

Phrase Structure Annotation and Parsing for Learner English

Title Phrase Structure Annotation and Parsing for Learner English
Authors Ryo Nagata, Keisuke Sakaguchi
Abstract
Tasks Grammatical Error Correction, Part-Of-Speech Tagging
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1173/
PDF https://www.aclweb.org/anthology/P16-1173
PWC https://paperswithcode.com/paper/phrase-structure-annotation-and-parsing-for
Repo
Framework
comments powered by Disqus