May 4, 2019

1941 words 10 mins read

Paper Group NANR 162

NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval. Kotonush: Understanding Concepts Based on Values behind Social Media. IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation. Compositional Distributional Mod …

NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval


Title	NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval
Authors	Sheng-Lun Wei, Yen-Pin Chiu, Hen-Hsen Huang, Hsin-Hsi Chen
Abstract	Words to express relations in natural language (NL) statements may be different from those to represent properties in knowledge bases (KB). The vocabulary gap becomes barriers for knowledge base construction and retrieval. With the demo system called NL2KB in this paper, users can browse which properties in KB side may be mapped to for a given relational pattern in NL side. Besides, they can retrieve the sets of relational patterns in NL side for a given property in KB side. We describe how the mapping is established in detail. Although the mined patterns are used for Chinese knowledge base applications, the methodology can be extended to other languages.
Tasks	Question Answering, Relation Extraction
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2059/
PDF	https://www.aclweb.org/anthology/C16-2059
PWC	https://paperswithcode.com/paper/nl2kb-resolving-vocabulary-gap-between
Repo
Framework


Title	Kotonush: Understanding Concepts Based on Values behind Social Media
Authors	Tatsuya Iwanari, Kohei Ohara, Naoki Yoshinaga, Nobuhiro Kaji, Masashi Toyoda, Masaru Kitsuregawa
Abstract	Kotonush, a system that clarifies people{'}s values on various concepts on the basis of what they write about on social media, is presented. The values are represented by ordering sets of concepts (e.g., London, Berlin, and Rome) in accordance with a common attribute intensity expressed by an adjective (e.g., entertaining). We exploit social media text written by different demographics and at different times in order to induce specific orderings for comparison. The system combines a text-to-ordering module with an interactive querying interface enabled by massive hyponymy relations and provides mechanisms to compare the induced orderings from various viewpoints. We empirically evaluate Kotonush and present some case studies, featuring real-world concept orderings with different domains on Twitter, to demonstrate the usefulness of our system.
Tasks	Aspect-Based Sentiment Analysis, Sentiment Analysis
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2061/
PDF	https://www.aclweb.org/anthology/C16-2061
PWC	https://paperswithcode.com/paper/kotonush-understanding-concepts-based-on
Repo
Framework

IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation


Title	IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation
Authors	Henrik B{\o}hler, Petter Asla, Erwin Marsi, Rune S{\ae}tre
Abstract
Tasks	Sentiment Analysis, Stance Detection
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1072/
PDF	https://www.aclweb.org/anthology/S16-1072
PWC	https://paperswithcode.com/paper/idintnu-at-semeval-2016-task-6-detecting
Repo
Framework

Compositional Distributional Models of Meaning


Title	Compositional Distributional Models of Meaning
Authors	Mehrnoosh Sadrzadeh, Dimitri Kartsaklis
Abstract	Compositional distributional models of meaning (CDMs) provide a function that produces a vectorial representation for a phrase or a sentence by composing the vectors of its words. Being the natural evolution of the traditional and well-studied distributional models at the word level, CDMs are steadily evolving to a popular and active area of NLP. This COLING 2016 tutorial aims at providing a concise introduction to this emerging field, presenting the different classes of CDMs and the various issues related to them in sufficient detail.
Tasks	Machine Translation, Natural Language Inference, Sentiment Analysis
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-3001/
PDF	https://www.aclweb.org/anthology/C16-3001
PWC	https://paperswithcode.com/paper/compositional-distributional-models-of
Repo
Framework

Hidden Resources â€• Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives


Title	Hidden Resources â€• Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
Authors	Jens Edlund, Joakim Gustafson
Abstract	In 2014, the Swedish government tasked a Swedish agency, The Swedish Post and Telecom Authority (PTS), with investigating how to best create and populate an infrastructure for spoken language resources (Ref N2014/2840/ITP). As a part of this work, the department of Speech, Music and Hearing at KTH Royal Institute of Technology have taken inventory of existing potential spoken language resources, mainly in Swedish national archives and other governmental or public institutions. In this position paper, key priorities, perspectives, and strategies that may be of general, rather than Swedish, interest are presented. We discuss broad types of potential spoken language resources available; to what extent these resources are free to use; and thirdly the main contribution: strategies to ensure the continuous acquisition of spoken language resources in a manner that facilitates speech and speech technology research.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1717/
PDF	https://www.aclweb.org/anthology/L16-1717
PWC	https://paperswithcode.com/paper/hidden-resources-a-strategies-to-acquire-and
Repo
Framework

Universal Morphology for Old Hungarian


Title	Universal Morphology for Old Hungarian
Authors	Eszter Simon, Veronika Vincze
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2115/
PDF	https://www.aclweb.org/anthology/W16-2115
PWC	https://paperswithcode.com/paper/universal-morphology-for-old-hungarian
Repo
Framework

Translationese: Between Human and Machine Translation


Title	Translationese: Between Human and Machine Translation
Authors	Shuly Wintner
Abstract	Translated texts, in any language, have unique characteristics that set them apart from texts originally written in the same language. Translation Studies is a research field that focuses on investigating these characteristics. Until recently, research in machine translation (MT) has been entirely divorced from translation studies. The main goal of this tutorial is to introduce some of the findings of translation studies to researchers interested mainly in machine translation, and to demonstrate that awareness to these findings can result in better, more accurate MT systems.
Tasks	Language Identification, Machine Translation, Text Classification
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-3005/
PDF	https://www.aclweb.org/anthology/C16-3005
PWC	https://paperswithcode.com/paper/translationese-between-human-and-machine
Repo
Framework

Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts


Title	Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts
Authors	Yong Xu, Fran{\c{c}}ois Yvon
Abstract	Resources for evaluating sentence-level and word-level alignment algorithms are unsatisfactory. Regarding sentence alignments, the existing data is too scarce, especially when it comes to difficult bitexts, containing instances of non-literal translations. Regarding word-level alignments, most available hand-aligned data provide a complete annotation at the level of words that is difficult to exploit, for lack of a clear semantics for alignment links. In this study, we propose new methodologies for collecting human judgements on alignment links, which have been used to annotate 4 new data sets, at the sentence and at the word level. These will be released online, with the hope that they will prove useful to evaluate alignment software and quality estimation tools for automatic alignment. Keywords: Parallel corpora, Sentence Alignments, Word Alignments, Confidence Estimation
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1099/
PDF	https://www.aclweb.org/anthology/L16-1099
PWC	https://paperswithcode.com/paper/novel-elicitation-and-annotation-schemes-for
Repo
Framework

On the Non-canonical Valency Filling


Title	On the Non-canonical Valency Filling
Authors	Igor Boguslavsky
Abstract	Valency slot filling is a semantic glue, which brings together the meanings of words. As regards the position of an argument in the dependency structure with respect to its predicate, there exist three types of valency filling: active (canonical), passive, and discontinuous. Of these, the first type is studied much better than the other two. As a rule, canonical actants are unambiguously marked in the syntactic structure, and each actant corresponds to a unique syntactic position. Linguistic information on which syntactic function an actant might have (subject, direct or indirect object), what its morphological form should be and which prepositions or conjunctions it requires, can be given in the lexicon in the form of government patterns, subcategorization frames, or similar data structures. We concentrate on non-canonical cases of valency filling in Russian, which are characteristic of non-verbal parts of speech, such as adverbs, adjectives, and particles, in the first place. They are more difficult to handle than canonical ones, because the position of the actant in the tree is governed by more complicated rules. A valency may be filled by expressions occupying different syntactic positions, and a syntactic position may accept expressions filling different valencies of the same word. We show how these phenomena can be processed in a semantic analyzer.
Tasks	Slot Filling
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3808/
PDF	https://www.aclweb.org/anthology/W16-3808
PWC	https://paperswithcode.com/paper/on-the-non-canonical-valency-filling
Repo
Framework

Succinct Data Structures for NLP-at-Scale


Title	Succinct Data Structures for NLP-at-Scale
Authors	Matthias Petri, Trevor Cohn
Abstract	Succinct data structures involve the use of novel data structures, compression technologies, and other mechanisms to allow data to be stored in extremely small memory or disk footprints, while still allowing for efficient access to the underlying data. They have successfully been applied in areas such as Information Retrieval and Bioinformatics to create highly compressible in-memory search indexes which provide efficient search functionality over datasets which traditionally could only be processed using external memory data structures. Modern technologies in this space are not well known within the NLP community, but have the potential to revolutionise NLP, particularly the application to {`}big data{'} in the form of terabyte and larger corpora. This tutorial will present a practical introduction to the most important succinct data structures, tools, and applications with the intent of providing the researchers with a jump-start into this domain. The focus of this tutorial will be efficient text processing utilising space efficient representations of suffix arrays, suffix trees and searchable integer compression schemes with specific applications of succinct data structures to common NLP tasks such as $n$-gram language modelling. \|
Tasks	Information Retrieval, Language Modelling
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-3006/
PDF	https://www.aclweb.org/anthology/C16-3006
PWC	https://paperswithcode.com/paper/succinct-data-structures-for-nlp-at-scale
Repo
Framework


Title	Monolingual Social Media Datasets for Detecting Contradiction and Entailment
Authors	Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva, Thierry Declerck
Abstract	Entailment recognition approaches are useful for application domains such as information extraction, question answering or summarisation, for which evidence from multiple sentences needs to be combined. We report on a new 3-way judgement Recognizing Textual Entailment (RTE) resource that originates in the Social Media domain, and explain our semi-automatic creation method for the special purpose of information verification, which draws on manually established rumourous claims reported during crisis events. From about 500 English tweets related to 70 unique claims we compile and evaluate 5.4k RTE pairs, while continue automatizing the workflow to generate similar-sized datasets in other languages.
Tasks	Natural Language Inference, Question Answering
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1729/
PDF	https://www.aclweb.org/anthology/L16-1729
PWC	https://paperswithcode.com/paper/monolingual-social-media-datasets-for
Repo
Framework

An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms


Title	An Advanced Press Review System Combining Deep News Analysis and Machine Learning Algorithms
Authors	Danuta Ploch, Andreas Lommatzsch, Florian Schultze
Abstract
Tasks	Sentiment Analysis
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4019/
PDF	https://www.aclweb.org/anthology/P16-4019
PWC	https://paperswithcode.com/paper/an-advanced-press-review-system-combining
Repo
Framework

MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System


Title	MAGES: A Multilingual Angle-integrated Grouping-based Entity Summarization System
Authors	Eun-kyung Kim, Key-Sun Choi
Abstract	This demo presents MAGES (multilingual angle-integrated grouping-based entity summarization), an entity summarization system for a large knowledge base such as DBpedia based on a entity-group-bound ranking in a single integrated entity space across multiple language-specific editions. MAGES offers a multilingual angle-integrated space model, which has the advantage of overcoming missing semantic tags (i.e., categories) caused by biases in different language communities, and can contribute to the creation of entity groups that are well-formed and more stable than the monolingual condition within it. MAGES can help people quickly identify the essential points of the entities when they search or browse a large volume of entity-centric data. Evaluation results on the same experimental data demonstrate that our system produces a better summary compared with other representative DBpedia entity summarization methods.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2043/
PDF	https://www.aclweb.org/anthology/C16-2043
PWC	https://paperswithcode.com/paper/mages-a-multilingual-angle-integrated
Repo
Framework

Model Architectures for Quotation Detection


Title	Model Architectures for Quotation Detection
Authors	Christian Scheible, Roman Klinger, Sebastian Pad{'o}
Abstract
Tasks	Named Entity Recognition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1164/
PDF	https://www.aclweb.org/anthology/P16-1164
PWC	https://paperswithcode.com/paper/model-architectures-for-quotation-detection
Repo
Framework

Phrase Structure Annotation and Parsing for Learner English


Title	Phrase Structure Annotation and Parsing for Learner English
Authors	Ryo Nagata, Keisuke Sakaguchi
Abstract
Tasks	Grammatical Error Correction, Part-Of-Speech Tagging
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1173/
PDF	https://www.aclweb.org/anthology/P16-1173
PWC	https://paperswithcode.com/paper/phrase-structure-annotation-and-parsing-for
Repo
Framework