Paper Group NANR 166
CRST: a Claim Retrieval System in Twitter. ArgumenText: Searching for Arguments in Heterogeneous Sources. DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors. Using the Nunavut Hansard Data for Experiments in Morphological Analysis and Machine Translation. Interactive Instance-based Evaluation of Knowledge Base Question Answerin …
CRST: a Claim Retrieval System in Twitter
Title | CRST: a Claim Retrieval System in Twitter |
Authors | Wenjia Ma, WenHan Chao, Zhunchen Luo, Xin Jiang |
Abstract | For controversial topics, collecting argumentation-containing tweets which tend to be more convincing will help researchers analyze public opinions. Meanwhile, claim is the heart of argumentation. Hence, we present the first real-time claim retrieval system CRST that retrieves tweets containing claims for a given topic from Twitter. We propose a claim-oriented ranking module which can be divided into the offline topic-independent learning to rank model and the online topic-dependent lexicon model. Our system outperforms previous claim retrieval system and argument mining system. Moreover, the claim-oriented ranking module can be easily adapted to new topics without any manual process or external information, guaranteeing the practicability of our system. |
Tasks | Argument Mining, Learning-To-Rank |
Published | 2018-08-01 |
URL | https://www.aclweb.org/anthology/C18-2010/ |
https://www.aclweb.org/anthology/C18-2010 | |
PWC | https://paperswithcode.com/paper/crst-a-claim-retrieval-system-in-twitter |
Repo | |
Framework | |
ArgumenText: Searching for Arguments in Heterogeneous Sources
Title | ArgumenText: Searching for Arguments in Heterogeneous Sources |
Authors | Christian Stab, Johannes Daxenberger, Chris Stahlhut, Tristan Miller, Benjamin Schiller, Christopher Tauchmann, Steffen Eger, Iryna Gurevych |
Abstract | Argument mining is a core technology for enabling argument search in large corpora. However, most current approaches fall short when applied to heterogeneous texts. In this paper, we present an argument retrieval system capable of retrieving sentential arguments for any given controversial topic. By analyzing the highest-ranked results extracted from Web sources, we found that our system covers 89{%} of arguments found in expert-curated lists of arguments from an online debate portal, and also identifies additional valid arguments. |
Tasks | Argument Mining, Information Retrieval, Question Answering |
Published | 2018-06-01 |
URL | https://www.aclweb.org/anthology/N18-5005/ |
https://www.aclweb.org/anthology/N18-5005 | |
PWC | https://paperswithcode.com/paper/argumentext-searching-for-arguments-in |
Repo | |
Framework | |
DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors
Title | DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors |
Authors | Prodromos Kolyvakis, Alex Kalousis, ros, Dimitris Kiritsis |
Abstract | Ontologies compartmentalize types and relations in a target domain and provide the semantic backbone needed for a plethora of practical applications. Very often different ontologies are developed independently for the same domain. Such {``}parallel{''} ontologies raise the need for a process that will establish alignments between their entities in order to unify and extend the existing knowledge. In this work, we present a novel entity alignment method which we dub DeepAlignment. DeepAlignment refines pre-trained word vectors aiming at deriving ontological entity descriptions which are tailored to the ontology matching task. The absence of explicit information relevant to the ontology matching task during the refinement process makes DeepAlignment completely unsupervised. We empirically evaluate our method using standard ontology matching benchmarks. We present significant performance improvements over the current state-of-the-art, demonstrating the advantages that representation learning techniques bring to ontology matching. | |
Tasks | Entity Alignment, Feature Engineering, Representation Learning |
Published | 2018-06-01 |
URL | https://www.aclweb.org/anthology/N18-1072/ |
https://www.aclweb.org/anthology/N18-1072 | |
PWC | https://paperswithcode.com/paper/deepalignment-unsupervised-ontology-matching |
Repo | |
Framework | |
Using the Nunavut Hansard Data for Experiments in Morphological Analysis and Machine Translation
Title | Using the Nunavut Hansard Data for Experiments in Morphological Analysis and Machine Translation |
Authors | Jeffrey Micher |
Abstract | Inuktitut is a polysynthetic language spoken in Northern Canada and is one of the official languages of the Canadian territory of Nunavut. As such, the Nunavut Legislature publishes all of its proceedings in parallel English and Inuktitut. Several parallel English-Inuktitut corpora from these proceedings have been created from these data and are publically available. The corpus used for current experiments is described. Morphological processing of one of these corpora was carried out and details about the processing are provided. Then, the processed corpus was used in morphological analysis and machine translation (MT) experiments. The morphological analysis experiments aimed to improve the coverage of morphological processing of the corpus, and compare an additional experimental condition to previously published results. The machine translation experiments made use of the additional morphologically analyzed word types in a statistical machine translation system designed to translate to and from Inuktitut morphemes. Results are reported and next steps are defined. |
Tasks | Machine Translation, Morphological Analysis |
Published | 2018-08-01 |
URL | https://www.aclweb.org/anthology/W18-4807/ |
https://www.aclweb.org/anthology/W18-4807 | |
PWC | https://paperswithcode.com/paper/using-the-nunavut-hansard-data-for |
Repo | |
Framework | |
Interactive Instance-based Evaluation of Knowledge Base Question Answering
Title | Interactive Instance-based Evaluation of Knowledge Base Question Answering |
Authors | Daniil Sorokin, Iryna Gurevych |
Abstract | Most approaches to Knowledge Base Question Answering are based on semantic parsing. In this paper, we present a tool that aids in debugging of question answering systems that construct a structured semantic representation for the input question. Previous work has largely focused on building question answering interfaces or evaluation frameworks that unify multiple data sets. The primary objective of our system is to enable interactive debugging of model predictions on individual instances (questions) and to simplify manual error analysis. Our interactive interface helps researchers to understand the shortcomings of a particular model, qualitatively analyze the complete pipeline and compare different models. A set of sit-by sessions was used to validate our interface design. |
Tasks | Entity Linking, Knowledge Base Question Answering, Question Answering, Semantic Parsing |
Published | 2018-11-01 |
URL | https://www.aclweb.org/anthology/D18-2020/ |
https://www.aclweb.org/anthology/D18-2020 | |
PWC | https://paperswithcode.com/paper/interactive-instance-based-evaluation-of |
Repo | |
Framework | |
Towards making NLG a voice for interpretable Machine Learning
Title | Towards making NLG a voice for interpretable Machine Learning |
Authors | James Forrest, Somayajulu Sripada, Wei Pang, George Coghill |
Abstract | This paper presents a study to understand the issues related to using NLG to humanise explanations from a popular interpretable machine learning framework called LIME. Our study shows that self-reported rating of NLG explanation was higher than that for a non-NLG explanation. However, when tested for comprehension, the results were not as clear-cut showing the need for performing more studies to uncover the factors responsible for high-quality NLG explanations. |
Tasks | Interpretable Machine Learning, Text Generation |
Published | 2018-11-01 |
URL | https://www.aclweb.org/anthology/W18-6522/ |
https://www.aclweb.org/anthology/W18-6522 | |
PWC | https://paperswithcode.com/paper/towards-making-nlg-a-voice-for-interpretable |
Repo | |
Framework | |
An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans.
Title | An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans. |
Authors | Sanjay Kamath, Brigitte Grau, Yue Ma |
Abstract | BIOASQ Task B Phase B challenge focuses on extracting answers from snippets for a given question. The dataset provided by the organizers contains answers, but not all their variants. Henceforth a manual annotation was performed to extract all forms of correct answers. This article shows the impact of using all occurrences of correct answers for training on the evaluation scores which are improved significantly. |
Tasks | Domain Adaptation, Question Answering, Reading Comprehension |
Published | 2018-11-01 |
URL | https://www.aclweb.org/anthology/W18-5309/ |
https://www.aclweb.org/anthology/W18-5309 | |
PWC | https://paperswithcode.com/paper/an-adaption-of-bioasq-question-answering |
Repo | |
Framework | |
Predicting Japanese Word Order in Double Object Constructions
Title | Predicting Japanese Word Order in Double Object Constructions |
Authors | Masayuki Asahara, Satoshi Nambu, Shin-Ichiro Sano |
Abstract | This paper presents a statistical model to predict Japanese word order in the double object constructions. We employed a Bayesian linear mixed model with manually annotated predicate-argument structure data. The findings from the refined corpus analysis confirmed the effects of information status of an NP as {}givennew ordering{'} in addition to the effects of { }long-before-short{'} as a tendency of the general Japanese word order. |
Tasks | |
Published | 2018-07-01 |
URL | https://www.aclweb.org/anthology/W18-2805/ |
https://www.aclweb.org/anthology/W18-2805 | |
PWC | https://paperswithcode.com/paper/predicting-japanese-word-order-in-double |
Repo | |
Framework | |
Automatic Detection of Cross-Disciplinary Knowledge Associations
Title | Automatic Detection of Cross-Disciplinary Knowledge Associations |
Authors | Menasha Thilakaratne, Katrina Falkner, Thushari Atapattu |
Abstract | Detecting interesting, cross-disciplinary knowledge associations hidden in scientific publications can greatly assist scientists to formulate and validate scientifically sensible novel research hypotheses. This will also introduce new areas of research that can be successfully linked with their research discipline. Currently, this process is mostly performed manually by exploring the scientific publications, requiring a substantial amount of time and effort. Due to the exponential growth of scientific literature, it has become almost impossible for a scientist to keep track of all research advances. As a result, scientists tend to deal with fragments of the literature according to their specialisation. Consequently, important and hidden associations among these fragmented knowledge that can be linked to produce significant scientific discoveries remain unnoticed. This doctoral work aims to develop a novel knowledge discovery approach that suggests most promising research pathways by analysing the existing scientific literature. |
Tasks | |
Published | 2018-07-01 |
URL | https://www.aclweb.org/anthology/P18-3007/ |
https://www.aclweb.org/anthology/P18-3007 | |
PWC | https://paperswithcode.com/paper/automatic-detection-of-cross-disciplinary |
Repo | |
Framework | |
Designing and testing the messages produced by a virtual dietitian
Title | Designing and testing the messages produced by a virtual dietitian |
Authors | Luca Anselma, Aless Mazzei, ro |
Abstract | This paper presents a project about the automatic generation of persuasive messages in the context of the diet management. In the first part of the paper we introduce the basic mechanisms related to data interpretation and content selection for a numerical data-to-text generation architecture. In the second part of the paper we discuss a number of factors influencing the design of the messages. In particular, we consider the design of the aggregation procedure. Finally, we present the results of a human-based evaluation concerning this design factor. |
Tasks | Data-to-Text Generation, Text Generation |
Published | 2018-11-01 |
URL | https://www.aclweb.org/anthology/W18-6531/ |
https://www.aclweb.org/anthology/W18-6531 | |
PWC | https://paperswithcode.com/paper/designing-and-testing-the-messages-produced |
Repo | |
Framework | |
Frustratingly Easy Model Ensemble for Abstractive Summarization
Title | Frustratingly Easy Model Ensemble for Abstractive Summarization |
Authors | Hayato Kobayashi |
Abstract | Ensemble methods, which combine multiple models at decoding time, are now widely known to be effective for text-generation tasks. However, they generally increase computational costs, and thus, there have been many studies on compressing or distilling ensemble models. In this paper, we propose an alternative, simple but effective unsupervised ensemble method, \textit{post-ensemble}, that combines multiple models by selecting a majority-like output in post-processing. We theoretically prove that our method is closely related to kernel density estimation based on the von Mises-Fisher kernel. Experimental results on a news-headline-generation task show that the proposed method performs better than the current ensemble methods. |
Tasks | Abstractive Text Summarization, Density Estimation, Model Compression, Question Answering, Text Generation |
Published | 2018-10-01 |
URL | https://www.aclweb.org/anthology/D18-1449/ |
https://www.aclweb.org/anthology/D18-1449 | |
PWC | https://paperswithcode.com/paper/frustratingly-easy-model-ensemble-for |
Repo | |
Framework | |
Enhancement of Encoder and Attention Using Target Monolingual Corpora in Neural Machine Translation
Title | Enhancement of Encoder and Attention Using Target Monolingual Corpora in Neural Machine Translation |
Authors | Kenji Imamura, Atsushi Fujita, Eiichiro Sumita |
Abstract | A large-scale parallel corpus is required to train encoder-decoder neural machine translation. The method of using synthetic parallel texts, in which target monolingual corpora are automatically translated into source sentences, is effective in improving the decoder, but is unreliable for enhancing the encoder. In this paper, we propose a method that enhances the encoder and attention using target monolingual corpora by generating multiple source sentences via sampling. By using multiple source sentences, diversity close to that of humans is achieved. Our experimental results show that the translation quality is improved by increasing the number of synthetic source sentences for each given target sentence, and quality close to that using a manually created parallel corpus was achieved. |
Tasks | Language Modelling, Machine Translation |
Published | 2018-07-01 |
URL | https://www.aclweb.org/anthology/W18-2707/ |
https://www.aclweb.org/anthology/W18-2707 | |
PWC | https://paperswithcode.com/paper/enhancement-of-encoder-and-attention-using |
Repo | |
Framework | |
Technology Showcase and Presentations
Title | Technology Showcase and Presentations |
Authors | Jennifer DeCamp |
Abstract | |
Tasks | |
Published | 2018-03-01 |
URL | https://www.aclweb.org/anthology/W18-1904/ |
https://www.aclweb.org/anthology/W18-1904 | |
PWC | https://paperswithcode.com/paper/technology-showcase-and-presentations |
Repo | |
Framework | |
A Master-Apprentice Approach to Automatic Creation of Culturally Satirical Movie Titles
Title | A Master-Apprentice Approach to Automatic Creation of Culturally Satirical Movie Titles |
Authors | Khalid Alnajjar, Mika H{"a}m{"a}l{"a}inen |
Abstract | Satire has played a role in indirectly expressing critique towards an authority or a person from time immemorial. We present an autonomously creative master-apprentice approach consisting of a genetic algorithm and an NMT model to produce humorous and culturally apt satire out of movie titles automatically. Furthermore, we evaluate the approach in terms of its creativity and its output. We provide a solid definition for creativity to maximize the objectiveness of the evaluation. |
Tasks | Text Generation |
Published | 2018-11-01 |
URL | https://www.aclweb.org/anthology/W18-6534/ |
https://www.aclweb.org/anthology/W18-6534 | |
PWC | https://paperswithcode.com/paper/a-master-apprentice-approach-to-automatic |
Repo | |
Framework | |
Training, feedback and productivity measurement with NMT and Adaptive MT
Title | Training, feedback and productivity measurement with NMT and Adaptive MT |
Authors | Jean-Luc Saillard |
Abstract | |
Tasks | |
Published | 2018-03-01 |
URL | https://www.aclweb.org/anthology/W18-1906/ |
https://www.aclweb.org/anthology/W18-1906 | |
PWC | https://paperswithcode.com/paper/training-feedback-and-productivity |
Repo | |
Framework | |