May 5, 2019

1661 words 8 mins read

Paper Group NANR 147

Paper Group NANR 147

Exploring Differential Topic Models for Comparative Summarization of Scientific Papers. Towards Semantic-based Hybrid Machine Translation between Bulgarian and English. Combining syntactic patterns and Wikipedia’s hierarchy of hyperlinks to extract meronym relations. Natural Language Generation through Character-based RNNs with Finite-state Prior K …

Exploring Differential Topic Models for Comparative Summarization of Scientific Papers

Title Exploring Differential Topic Models for Comparative Summarization of Scientific Papers
Authors Lei He, Wei Li, Hai Zhuge
Abstract This paper investigates differential topic models (dTM) for summarizing the differences among document groups. Starting from a simple probabilistic generative model, we propose dTM-SAGE that explicitly models the deviations on group-specific word distributions to indicate how words are used differen-tially across different document groups from a background word distribution. It is more effective to capture unique characteristics for comparing document groups. To generate dTM-based comparative summaries, we propose two sentence scoring methods for measuring the sentence discriminative capacity. Experimental results on scientific papers dataset show that our dTM-based comparative summari-zation methods significantly outperform the generic baselines and the state-of-the-art comparative summarization methods under ROUGE metrics.
Tasks Sentiment Analysis, Topic Models
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1098/
PDF https://www.aclweb.org/anthology/C16-1098
PWC https://paperswithcode.com/paper/exploring-differential-topic-models-for
Repo
Framework

Towards Semantic-based Hybrid Machine Translation between Bulgarian and English

Title Towards Semantic-based Hybrid Machine Translation between Bulgarian and English
Authors Kiril Simov, Petya Osenova, Alex Popov, er
Abstract
Tasks Common Sense Reasoning, Language Modelling, Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0604/
PDF https://www.aclweb.org/anthology/W16-0604
PWC https://paperswithcode.com/paper/towards-semantic-based-hybrid-machine
Repo
Framework
Title Combining syntactic patterns and Wikipedia’s hierarchy of hyperlinks to extract meronym relations
Authors Debela Tesfaye Gemechu, Michael Zock, Solomon Teferra
Abstract
Tasks Information Retrieval, Question Answering, Text Summarization
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-2005/
PDF https://www.aclweb.org/anthology/N16-2005
PWC https://paperswithcode.com/paper/combining-syntactic-patterns-and-wikipedias
Repo
Framework

Natural Language Generation through Character-based RNNs with Finite-state Prior Knowledge

Title Natural Language Generation through Character-based RNNs with Finite-state Prior Knowledge
Authors Raghav Goyal, Marc Dymetman, Eric Gaussier
Abstract Recently Wen et al. (2015) have proposed a Recurrent Neural Network (RNN) approach to the generation of utterances from dialog acts, and shown that although their model requires less effort to develop than a rule-based system, it is able to improve certain aspects of the utterances, in particular their naturalness. However their system employs generation at the word-level, which requires one to pre-process the data by substituting named entities with placeholders. This pre-processing prevents the model from handling some contextual effects and from managing multiple occurrences of the same attribute. Our approach uses a character-level model, which unlike the word-level model makes it possible to learn to {``}copy{''} information from the dialog act to the target without having to pre-process the input. In order to avoid generating non-words and inventing information not present in the input, we propose a method for incorporating prior knowledge into the RNN in the form of a weighted finite-state automaton over character sequences. Automatic and human evaluations show improved performance over baselines on several evaluation criteria. |
Tasks Language Modelling, Machine Translation, Named Entity Recognition, Text Generation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1103/
PDF https://www.aclweb.org/anthology/C16-1103
PWC https://paperswithcode.com/paper/natural-language-generation-through-character
Repo
Framework

The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions

Title The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions
Authors Joachim Daiber, Rob van der Goot
Abstract We introduce the Denoised Web Treebank: a treebank including a normalization layer and a corresponding evaluation metric for dependency parsing of noisy text, such as Tweets. This benchmark enables the evaluation of parser robustness as well as text normalization methods, including normalization as machine translation and unsupervised lexical normalization, directly on syntactic trees. Experiments show that text normalization together with a combination of domain-specific and generic part-of-speech taggers can lead to a significant improvement in parsing accuracy on this test set.
Tasks Dependency Parsing, Lexical Normalization, Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1102/
PDF https://www.aclweb.org/anthology/L16-1102
PWC https://paperswithcode.com/paper/the-denoised-web-treebank-evaluating
Repo
Framework

Evaluation Strategies for Computational Construction Grammars

Title Evaluation Strategies for Computational Construction Grammars
Authors T{^a}nia Marques, Katrien Beuls
Abstract Despite the growing number of Computational Construction Grammar implementations, the field is still lacking evaluation methods to compare grammar fragments across different platforms. Moreover, the hand-crafted nature of most grammars requires profiling tools to understand the complex interactions between constructions of different types. This paper presents a number of evaluation measures, partially based on existing measures in the field of semantic parsing, that are especially relevant for reversible grammar formalisms. The measures are tested on a grammar fragment for European Portuguese clitic placement that is currently under development.
Tasks Semantic Parsing
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1108/
PDF https://www.aclweb.org/anthology/C16-1108
PWC https://paperswithcode.com/paper/evaluation-strategies-for-computational
Repo
Framework

Event Embeddings for Semantic Script Modeling

Title Event Embeddings for Semantic Script Modeling
Authors Ashutosh Modi
Abstract
Tasks Common Sense Reasoning
Published 2016-08-01
URL https://www.aclweb.org/anthology/K16-1008/
PDF https://www.aclweb.org/anthology/K16-1008
PWC https://paperswithcode.com/paper/event-embeddings-for-semantic-script-modeling
Repo
Framework

Semantic overfitting: what `world’ do we consider when evaluating disambiguation of text?

Title Semantic overfitting: what `world’ do we consider when evaluating disambiguation of text? |
Authors Filip Ilievski, Marten Postma, Piek Vossen
Abstract Semantic text processing faces the challenge of defining the relation between lexical expressions and the world to which they make reference within a period of time. It is unclear whether the current test sets used to evaluate disambiguation tasks are representative for the full complexity considering this time-anchored relation, resulting in semantic overfitting to a specific period and the frequent phenomena within. We conceptualize and formalize a set of metrics which evaluate this complexity of datasets. We provide evidence for their applicability on five different disambiguation tasks. To challenge semantic overfitting of disambiguation systems, we propose a time-based, metric-aware method for developing datasets in a systematic and semi-automated manner, as well as an event-based QA task.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1112/
PDF https://www.aclweb.org/anthology/C16-1112
PWC https://paperswithcode.com/paper/semantic-overfitting-what-world-do-we
Repo
Framework

Learning Linguistic Descriptors of User Roles in Online Communities

Title Learning Linguistic Descriptors of User Roles in Online Communities
Authors Alex Wang, William L. Hamilton, Jure Leskovec
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5610/
PDF https://www.aclweb.org/anthology/W16-5610
PWC https://paperswithcode.com/paper/learning-linguistic-descriptors-of-user-roles
Repo
Framework

NLDS-UCSC at SemEval-2016 Task 6: A Semi-Supervised Approach to Detecting Stance in Tweets

Title NLDS-UCSC at SemEval-2016 Task 6: A Semi-Supervised Approach to Detecting Stance in Tweets
Authors Amita Misra, Brian Ecker, H, Theodore leman, Nicolas Hahn, Marilyn Walker
Abstract
Tasks Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1068/
PDF https://www.aclweb.org/anthology/S16-1068
PWC https://paperswithcode.com/paper/nlds-ucsc-at-semeval-2016-task-6-a-semi
Repo
Framework

Monday mornings are my fave :) #not Exploring the Automatic Recognition of Irony in English tweets

Title Monday mornings are my fave :) #not Exploring the Automatic Recognition of Irony in English tweets
Authors Cynthia Van Hee, Els Lefever, V{'e}ronique Hoste
Abstract Recognising and understanding irony is crucial for the improvement natural language processing tasks including sentiment analysis. In this study, we describe the construction of an English Twitter corpus and its annotation for irony based on a newly developed fine-grained annotation scheme. We also explore the feasibility of automatic irony recognition by exploiting a varied set of features including lexical, syntactic, sentiment and semantic (Word2Vec) information. Experiments on a held-out test set show that our irony classifier benefits from this combined information, yielding an F1-score of 67.66{%}. When explicit hashtag information like {#}irony is included in the data, the system even obtains an F1-score of 92.77{%}. A qualitative analysis of the output reveals that recognising irony that results from a polarity clash appears to be (much) more feasible than recognising other forms of ironic utterances (e.g., descriptions of situational irony).
Tasks Opinion Mining, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1257/
PDF https://www.aclweb.org/anthology/C16-1257
PWC https://paperswithcode.com/paper/monday-mornings-are-my-fave-not-exploring-the
Repo
Framework

High Accuracy Rule-based Question Classification using Question Syntax and Semantics

Title High Accuracy Rule-based Question Classification using Question Syntax and Semantics
Authors Harish Tayyar Madabushi, Mark Lee
Abstract We present in this paper a purely rule-based system for Question Classification which we divide into two parts: The first is the extraction of relevant words from a question by use of its structure, and the second is the classification of questions based on rules that associate these words to Concepts. We achieve an accuracy of 97.2{%}, close to a 6 point improvement over the previous State of the Art of 91.6{%}. Additionally, we believe that machine learning algorithms can be applied on top of this method to further improve accuracy.
Tasks Feature Selection, Question Answering, Text Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1116/
PDF https://www.aclweb.org/anthology/C16-1116
PWC https://paperswithcode.com/paper/high-accuracy-rule-based-question
Repo
Framework

SemEval-2016 Task 1: Semantic Textual Similarity, Monolingual and Cross-Lingual Evaluation

Title SemEval-2016 Task 1: Semantic Textual Similarity, Monolingual and Cross-Lingual Evaluation
Authors Eneko Agirre, Carmen Banea, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Rada Mihalcea, German Rigau, Janyce Wiebe
Abstract
Tasks Machine Translation, Natural Language Inference, Question Answering, Semantic Textual Similarity
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1081/
PDF https://www.aclweb.org/anthology/S16-1081
PWC https://paperswithcode.com/paper/semeval-2016-task-1-semantic-textual
Repo
Framework

DLS@CU at SemEval-2016 Task 1: Supervised Models of Sentence Similarity

Title DLS@CU at SemEval-2016 Task 1: Supervised Models of Sentence Similarity
Authors Md Arafat Sultan, Steven Bethard, Tamara Sumner
Abstract
Tasks Machine Translation, Natural Language Inference, Question Answering, Semantic Textual Similarity, Text Summarization
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1099/
PDF https://www.aclweb.org/anthology/S16-1099
PWC https://paperswithcode.com/paper/dlscu-at-semeval-2016-task-1-supervised
Repo
Framework

Semantically Motivated Hebrew Verb-Noun Multi-Word Expressions Identification

Title Semantically Motivated Hebrew Verb-Noun Multi-Word Expressions Identification
Authors Chaya Liebeskind, Yaakov HaCohen-Kerner
Abstract Identification of Multi-Word Expressions (MWEs) lies at the heart of many natural language processing applications. In this research, we deal with a particular type of Hebrew MWEs, Verb-Noun MWEs (VN-MWEs), which combine a verb and a noun with or without other words. Most prior work on MWEs classification focused on linguistic and statistical information. In this paper, we claim that it is essential to utilize semantic information. To this end, we propose a semantically motivated indicator for classifying VN-MWE and define features that are related to various semantic spaces and combine them as features in a supervised classification framework. We empirically demonstrate that our semantic feature set yields better performance than the common linguistic and statistical feature sets and that combining semantic features contributes to the VN-MWEs identification task.
Tasks Information Retrieval, Machine Translation, Question Answering, Text Generation, Text Summarization, Word Sense Disambiguation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1118/
PDF https://www.aclweb.org/anthology/C16-1118
PWC https://paperswithcode.com/paper/semantically-motivated-hebrew-verb-noun-multi
Repo
Framework
comments powered by Disqus