May 4, 2019

1819 words 9 mins read

Paper Group NANR 219

Paper Group NANR 219

Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data. The ADAPT Bilingual Document Alignment system at WMT16. Yes, We Care! Results of the Ethics and Natural Language Processing Surveys. Improving Dependency Parsing Using Sentence Clause Charts. Enlarging Scarce In-domain …

Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data

Title Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
Authors {'E}va M{'u}jdricza-Maydt, Silvana Hartmann, Iryna Gurevych, Anette Frank
Abstract We present a VerbNet-based annotation scheme for semantic roles that we explore in an annotation study on German language data that combines word sense and semantic role annotation. We reannotate a substantial portion of the SALSA corpus with GermaNet senses and a revised scheme of VerbNet roles. We provide a detailed evaluation of the interaction between sense and role annotation. The resulting corpus will allow us to compare VerbNet role annotation for German to FrameNet and PropBank annotation by mapping to existing role annotations on the SALSA corpus. We publish the annotated corpus and detailed guidelines for the new role annotation scheme.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1484/
PDF https://www.aclweb.org/anthology/L16-1484
PWC https://paperswithcode.com/paper/combining-semantic-annotation-of-word-sense
Repo
Framework

The ADAPT Bilingual Document Alignment system at WMT16

Title The ADAPT Bilingual Document Alignment system at WMT16
Authors Pintu Lohar, Haithem Afli, Chao-Hong Liu, Andy Way
Abstract
Tasks Domain Adaptation, Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2372/
PDF https://www.aclweb.org/anthology/W16-2372
PWC https://paperswithcode.com/paper/the-adapt-bilingual-document-alignment-system
Repo
Framework

Yes, We Care! Results of the Ethics and Natural Language Processing Surveys

Title Yes, We Care! Results of the Ethics and Natural Language Processing Surveys
Authors Kar{"e}n Fort, Alain Couillault
Abstract We present here the context and results of two surveys (a French one and an international one) concerning Ethics and NLP, which we designed and conducted between June and September 2015. These surveys follow other actions related to raising concern for ethics in our community, including a Journ{'e}e d{'}{'e}tudes, a workshop and the Ethics and Big Data Charter. The concern for ethics shows to be quite similar in both surveys, despite a few differences which we present and discuss. The surveys also lead to think there is a growing awareness in the field concerning ethical issues, which translates into a willingness to get involved in ethics-related actions, to debate about the topic and to see ethics be included in major conferences themes. We finally discuss the limits of the surveys and the means of action we consider for the future. The raw data from the two surveys are freely available online.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1252/
PDF https://www.aclweb.org/anthology/L16-1252
PWC https://paperswithcode.com/paper/yes-we-care-results-of-the-ethics-and-natural
Repo
Framework

Improving Dependency Parsing Using Sentence Clause Charts

Title Improving Dependency Parsing Using Sentence Clause Charts
Authors Vincent Kr{'\i}{\v{z}}, Barbora Hladk{'a}
Abstract
Tasks Dependency Parsing
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3013/
PDF https://www.aclweb.org/anthology/P16-3013
PWC https://paperswithcode.com/paper/improving-dependency-parsing-using-sentence
Repo
Framework

Enlarging Scarce In-domain English-Croatian Corpus for SMT of MOOCs Using Serbian

Title Enlarging Scarce In-domain English-Croatian Corpus for SMT of MOOCs Using Serbian
Authors Maja Popovi{'c}, Kostadin Cholakov, Valia Kordoni, Nikola Ljube{\v{s}}i{'c}
Abstract Massive Open Online Courses have been growing rapidly in size and impact. Yet the language barrier constitutes a major growth impediment in reaching out all people and educating all citizens. A vast majority of educational material is available only in English, and state-of-the-art machine translation systems still have not been tailored for this peculiar genre. In addition, a mere collection of appropriate in-domain training material is a challenging task. In this work, we investigate statistical machine translation of lecture subtitles from English into Croatian, which is morphologically rich and generally weakly supported, especially for the educational domain. We show that results comparable with publicly available systems trained on much larger data can be achieved if a small in-domain training set is used in combination with additional in-domain corpus originating from the closely related Serbian language.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4813/
PDF https://www.aclweb.org/anthology/W16-4813
PWC https://paperswithcode.com/paper/enlarging-scarce-in-domain-english-croatian
Repo
Framework

Boosting for Efficient Model Selection for Syntactic Parsing

Title Boosting for Efficient Model Selection for Syntactic Parsing
Authors Rachel Bawden, Beno{^\i}t Crabb{'e}
Abstract We present an efficient model selection method using boosting for transition-based constituency parsing. It is designed for exploring a high-dimensional search space, defined by a large set of feature templates, as for example is typically the case when parsing morphologically rich languages. Our method removes the need to manually define heuristic constraints, which are often imposed in current state-of-the-art selection methods. Our experiments for French show that the method is more efficient and is also capable of producing compact, state-of-the-art models.
Tasks Constituency Parsing, Model Selection
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1001/
PDF https://www.aclweb.org/anthology/C16-1001
PWC https://paperswithcode.com/paper/boosting-for-efficient-model-selection-for
Repo
Framework

Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification

Title Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification
Authors Serhiy Bykh, Detmar Meurers
Abstract We propose a hierarchical clustering approach designed to group linguistic features for supervised machine learning that is inspired by variationist linguistics. The method makes it possible to abstract away from the individual feature occurrences by grouping features together that behave alike with respect to the target class, thus providing a new, more general perspective on the data. On the one hand, it reduces data sparsity, leading to quantitative performance gains. On the other, it supports the formation and evaluation of hypotheses about individual choices of linguistic structures. We explore the method using features based on verb subcategorization information and evaluate the approach in the context of the Native Language Identification (NLI) task.
Tasks Language Acquisition, Language Identification, Native Language Identification, Text Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1071/
PDF https://www.aclweb.org/anthology/C16-1071
PWC https://paperswithcode.com/paper/advancing-linguistic-features-and-insights-by
Repo
Framework

Anecdote Recognition and Recommendation

Title Anecdote Recognition and Recommendation
Authors Wei Song, Ruiji Fu, Lizhen Liu, Hanshi Wang, Ting Liu
Abstract We introduce a novel task Anecdote Recognition and Recommendation. An anecdote is a story with a point revealing account of an individual person. Recommending proper anecdotes can be used as evidence to support argumentative writing or as a clue for further reading. We represent an anecdote as a structured tuple {—} {\textless} person, story, implication {\textgreater}. Anecdote recognition runs on archived argumentative essays. We extract narratives containing events of a person as the anecdote story. More importantly, we uncover the anecdote implication, which reveals the meaning and topic of an anecdote. Our approach depends on discourse role identification. Discourse roles such as thesis, main ideas and support help us locate stories and their implications in essays. The experiments show that informative and interpretable anecdotes can be recognized. These anecdotes are used for anecdote recommendation. The anecdote recommender can recommend proper anecdotes in response to given topics. The anecdote implication contributes most for bridging user interested topics and relevant anecdotes.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1244/
PDF https://www.aclweb.org/anthology/C16-1244
PWC https://paperswithcode.com/paper/anecdote-recognition-and-recommendation
Repo
Framework

Learning Connective-based Word Representations for Implicit Discourse Relation Identification

Title Learning Connective-based Word Representations for Implicit Discourse Relation Identification
Authors Chlo{'e} Braud, Pascal Denis
Abstract
Tasks Feature Engineering, Word Embeddings
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1020/
PDF https://www.aclweb.org/anthology/D16-1020
PWC https://paperswithcode.com/paper/learning-connective-based-word
Repo
Framework

Structured Aspect Extraction

Title Structured Aspect Extraction
Authors Omer Gunes, Tim Furche, Giorgio Orsi
Abstract Aspect extraction identifies relevant features from a textual description of an entity, e.g., a phone, and is typically targeted to product descriptions, reviews, and other short texts as an enabling task for, e.g., opinion mining and information retrieval. Current aspect extraction methods mostly focus on aspect terms and often neglect interesting modifiers of the term or embed them in the aspect term without proper distinction. Moreover, flat syntactic structures are often assumed, resulting in inaccurate extractions of complex aspects. This paper studies the problem of structured aspect extraction, a variant of traditional aspect extraction aiming at a fine-grained extraction of complex (i.e., hierarchical) aspects. We propose an unsupervised and scalable method for structured aspect extraction consisting of statistical noun phrase clustering, cPMI-based noun phrase segmentation, and hierarchical pattern induction. Our evaluation shows a substantial improvement over existing methods in terms of both quality and computational efficiency.
Tasks Aspect-Based Sentiment Analysis, Aspect Extraction, Information Retrieval, Opinion Mining, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1219/
PDF https://www.aclweb.org/anthology/C16-1219
PWC https://paperswithcode.com/paper/structured-aspect-extraction
Repo
Framework

Models and Inference for Prefix-Constrained Machine Translation

Title Models and Inference for Prefix-Constrained Machine Translation
Authors Joern Wuebker, Spence Green, John DeNero, Sa{\v{s}}a Hasan, Minh-Thang Luong
Abstract
Tasks Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1007/
PDF https://www.aclweb.org/anthology/P16-1007
PWC https://paperswithcode.com/paper/models-and-inference-for-prefix-constrained
Repo
Framework

Infinite Hidden Semi-Markov Modulated Interaction Point Process

Title Infinite Hidden Semi-Markov Modulated Interaction Point Process
Authors Matt Zhang, Peng Lin, Peng Lin, Ting Guo, Yang Wang, Yang Wang, Fang Chen
Abstract The correlation between events is ubiquitous and important for temporal events modelling. In many cases, the correlation exists between not only events’ emitted observations, but also their arrival times. State space models (e.g., hidden Markov model) and stochastic interaction point process models (e.g., Hawkes process) have been studied extensively yet separately for the two types of correlations in the past. In this paper, we propose a Bayesian nonparametric approach that considers both types of correlations via unifying and generalizing hidden semi-Markov model and interaction point process model. The proposed approach can simultaneously model both the observations and arrival times of temporal events, and determine the number of latent states from data. A Metropolis-within-particle-Gibbs sampler with ancestor resampling is developed for efficient posterior inference. The approach is tested on both synthetic and real-world data with promising outcomes.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6243-infinite-hidden-semi-markov-modulated-interaction-point-process
PDF http://papers.nips.cc/paper/6243-infinite-hidden-semi-markov-modulated-interaction-point-process.pdf
PWC https://paperswithcode.com/paper/infinite-hidden-semi-markov-modulated
Repo
Framework

Categorization of Semantic Roles for Dictionary Definitions

Title Categorization of Semantic Roles for Dictionary Definitions
Authors Vivian Silva, H, Siegfried schuh, Andr{'e} Freitas
Abstract Understanding the semantic relationships between terms is a fundamental task in natural language processing applications. While structured resources that can express those relationships in a formal way, such as ontologies, are still scarce, a large number of linguistic resources gathering dictionary definitions is becoming available, but understanding the semantic structure of natural language definitions is fundamental to make them useful in semantic interpretation tasks. Based on an analysis of a subset of WordNet{'}s glosses, we propose a set of semantic roles that compose the semantic structure of a dictionary definition, and show how they are related to the definition{'}s syntactic configuration, identifying patterns that can be used in the development of information extraction frameworks and semantic models.
Tasks Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5323/
PDF https://www.aclweb.org/anthology/W16-5323
PWC https://paperswithcode.com/paper/categorization-of-semantic-roles-for-1
Repo
Framework

A Transition-Based System for Joint Lexical and Syntactic Analysis

Title A Transition-Based System for Joint Lexical and Syntactic Analysis
Authors Matthieu Constant, Joakim Nivre
Abstract
Tasks Dependency Parsing
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1016/
PDF https://www.aclweb.org/anthology/P16-1016
PWC https://paperswithcode.com/paper/a-transition-based-system-for-joint-lexical
Repo
Framework

Sentiment Domain Adaptation with Multiple Sources

Title Sentiment Domain Adaptation with Multiple Sources
Authors Fangzhao Wu, Yongfeng Huang
Abstract
Tasks Domain Adaptation, Multi-Task Learning, Sentiment Analysis
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1029/
PDF https://www.aclweb.org/anthology/P16-1029
PWC https://paperswithcode.com/paper/sentiment-domain-adaptation-with-multiple
Repo
Framework
comments powered by Disqus