May 5, 2019

1736 words 9 mins read

Paper Group NANR 59

Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian. Disfluent but effective? A quantitative study of disfluencies and conversational moves in team discourse. Real-Time Understanding of Complex Discriminative Scene Descriptions. Recognition of non-domain phrases in automatically extracted l …

Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian


Title	Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian
Authors	Victor M. S{'a}nchez-Cartagena, Nikola Ljube{\v{s}}i{'c}, Filip Klubi{\v{c}}ka
Abstract
Tasks	Machine Translation
Published	2016-01-01
URL	https://www.aclweb.org/anthology/W16-3421/
PDF	https://www.aclweb.org/anthology/W16-3421
PWC	https://paperswithcode.com/paper/dealing-with-data-sparseness-in-smt-with
Repo
Framework

Disfluent but effective? A quantitative study of disfluencies and conversational moves in team discourse


Title	Disfluent but effective? A quantitative study of disfluencies and conversational moves in team discourse
Authors	Felix Gervits, Kathleen Eberhard, Matthias Scheutz
Abstract	Situated dialogue systems that interact with humans as part of a team (e.g., robot teammates) need to be able to use information from communication channels to gauge the coordination level and effectiveness of the team. Currently, the feasibility of this end goal is limited by several gaps in both the empirical and computational literature. The purpose of this paper is to address those gaps in the following ways: (1) investigate which properties of task-oriented discourse correspond with effective performance in human teams, and (2) discuss how and to what extent these properties can be utilized in spoken dialogue systems. To this end, we analyzed natural language data from a unique corpus of spontaneous, task-oriented dialogue (CReST corpus), which was annotated for disfluencies and conversational moves. We found that effective teams made more self-repair disfluencies and used specific communication strategies to facilitate grounding and coordination. Our results indicate that truly robust and natural dialogue systems will need to interpret highly disfluent utterances and also utilize specific collaborative mechanisms to facilitate grounding. These data shed light on effective communication in performance scenarios and directly inform the development of robust dialogue systems for situated artificial agents.
Tasks	Decision Making, Spoken Dialogue Systems
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1317/
PDF	https://www.aclweb.org/anthology/C16-1317
PWC	https://paperswithcode.com/paper/disfluent-but-effective-a-quantitative-study
Repo
Framework

Real-Time Understanding of Complex Discriminative Scene Descriptions


Title	Real-Time Understanding of Complex Discriminative Scene Descriptions
Authors	Ramesh Manuvinakurike, Casey Kennington, David DeVault, David Schlangen
Abstract
Tasks	Spoken Dialogue Systems
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3630/
PDF	https://www.aclweb.org/anthology/W16-3630
PWC	https://paperswithcode.com/paper/real-time-understanding-of-complex
Repo
Framework

Recognition of non-domain phrases in automatically extracted lists of terms


Title	Recognition of non-domain phrases in automatically extracted lists of terms
Authors	Agnieszka Mykowiecka, Malgorzata Marciniak, Piotr Rychlik
Abstract	In the paper, we address the problem of recognition of non-domain phrases in terminology lists obtained with an automatic term extraction tool. We focus on identification of multi-word phrases that are general terms and discourse function expressions. We tested several methods based on domain corpora comparison and a method based on contexts of phrases identified in a large corpus of general language. We compared the results of the methods to manual annotation. The results show that the task is quite hard as the inter-annotator agreement is low. Several tested methods achieved similar overall results, although the phrase ordering varied between methods. The most successful method with the precision about 0.75 at the half of the tested list was the context based method using a modified contextual diversity coefficient.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4703/
PDF	https://www.aclweb.org/anthology/W16-4703
PWC	https://paperswithcode.com/paper/recognition-of-non-domain-phrases-in
Repo
Framework

Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization


Title	Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization
Authors	Muhammad Humayoun, Hwanjo Yu
Abstract	Preprocessing is a preliminary step in many fields including IR and NLP. The effect of basic preprocessing settings on English for text summarization is well-studied. However, there is no such effort found for the Urdu language (with the best of our knowledge). In this study, we analyze the effect of basic preprocessing settings for single-document text summarization for Urdu, on a benchmark corpus using various experiments. The analysis is performed using the state-of-the-art algorithms for extractive summarization and the effect of stopword removal, lemmatization, and stemming is analyzed. Results showed that these pre-processing settings improve the results.
Tasks	Lemmatization, Text Summarization
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1585/
PDF	https://www.aclweb.org/anthology/L16-1585
PWC	https://paperswithcode.com/paper/analyzing-pre-processing-settings-for-urdu
Repo
Framework

Supervised classification of end-of-lines in clinical text with no manual annotation


Title	Supervised classification of end-of-lines in clinical text with no manual annotation
Authors	Pierre Zweigenbaum, Cyril Grouin, Thomas Lavergne
Abstract	In some plain text documents, end-of-line marks may or may not mark the boundary of a text unit (e.g., of a paragraph). This vexing problem is likely to impact subsequent natural language processing components, but is seldom addressed in the literature. We propose a method which uses no manual annotation to classify whether end-of-lines must actually be seen as simple spaces (soft line breaks) or as true text unit boundaries. This method, which includes self-training and co-training steps based on token and line length features, achieves 0.943 F-measure on a corpus of short e-books with controlled format, F=0.904 on a random sample of 24 clinical texts with soft line breaks, and F=0.898 on a larger set of mixed clinical texts which may or may not contain soft line breaks, a fairly high value for a method with no manual annotation.
Tasks	Language Modelling, Relation Extraction
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5109/
PDF	https://www.aclweb.org/anthology/W16-5109
PWC	https://paperswithcode.com/paper/supervised-classification-of-end-of-lines-in
Repo
Framework

Parallel Chinese-English Entities, Relations and Events Corpora


Title	Parallel Chinese-English Entities, Relations and Events Corpora
Authors	Justin Mott, Ann Bies, Zhiyi Song, Stephanie Strassel
Abstract	This paper introduces the parallel Chinese-English Entities, Relations and Events (ERE) corpora developed by Linguistic Data Consortium under the DARPA Deep Exploration and Filtering of Text (DEFT) Program. Original Chinese newswire and discussion forum documents are annotated for two versions of the ERE task. The texts are manually translated into English and then annotated for the same ERE tasks on the English translation, resulting in a rich parallel resource that has utility for performers within the DEFT program, for participants in NIST{'}s Knowledge Base Population evaluations, and for cross-language projection research more generally.
Tasks	Knowledge Base Population
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1589/
PDF	https://www.aclweb.org/anthology/L16-1589
PWC	https://paperswithcode.com/paper/parallel-chinese-english-entities-relations
Repo
Framework

Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties


Title	Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties
Authors	Alex Balahur, ra, Hristo Tanev
Abstract	Emotions are an important part of the human experience. They are responsible for the adaptation and integration in the environment, offering, most of the time together with the cognitive system, the appropriate responses to stimuli in the environment. As such, they are an important component in decision-making processes. In today{'}s society, the avalanche of stimuli present in the environment (physical or virtual) makes people more prone to respond to stronger affective stimuli (i.e., those that are related to their basic needs and motivations â€• survival, food, shelter, etc.). In media reporting, this is translated in the use of arguments (factual data) that are known to trigger specific (strong, affective) behavioural reactions from the readers. This paper describes initial efforts to detect such arguments from text, based on the properties of concepts. The final system able to retrieve and label this type of data from the news in traditional and social platforms is intended to be integrated Europe Media Monitor family of applications to detect texts that trigger certain (especially negative) reactions from the public, with consequences on citizen safety and security.
Tasks	Decision Making
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1185/
PDF	https://www.aclweb.org/anthology/L16-1185
PWC	https://paperswithcode.com/paper/detecting-implicit-expressions-of-affect-from
Repo
Framework

mib at SemEval-2016 Task 4a: Exploiting lexicon based features for Sentiment Analysis in Twitter


Title	mib at SemEval-2016 Task 4a: Exploiting lexicon based features for Sentiment Analysis in Twitter
Authors	Vittoria Cozza, Marinella Petrocchi
Abstract
Tasks	Recommendation Systems, Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1019/
PDF	https://www.aclweb.org/anthology/S16-1019
PWC	https://paperswithcode.com/paper/mib-at-semeval-2016-task-4a-exploiting
Repo
Framework

SoNLP-DP System for ConLL-2016 Chinese Shallow Discourse Parsing


Title	SoNLP-DP System for ConLL-2016 Chinese Shallow Discourse Parsing
Authors	Junhui Li, Fang Kong, Sheng Li, Muhua Zhu, Guodong Zhou
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/K16-2011/
PDF	https://www.aclweb.org/anthology/K16-2011
PWC	https://paperswithcode.com/paper/sonlp-dp-system-for-conll-2016-chinese
Repo
Framework

The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval


Title	The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval
Authors	Kira Griffitt, Stephanie Strassel
Abstract	The DARPA BOLT Information Retrieval evaluations target open-domain natural-language queries over a large corpus of informal text in English, Chinese and Egyptian Arabic. We outline the goals of BOLT IR, comparing it with the prior GALE Distillation task. After discussing the properties of the BOLT IR corpus, we provide a detailed description of the query creation process, contrasting the summary query format presented to systems at run time with the full query format created by annotators. We describe the relevance criteria used to assess BOLT system responses, highlighting the evolution of the procedures used over the three evaluation phases. We provide a detailed review of the decision points model for relevance assessment introduced during Phase 2, and conclude with information about inter-assessor consistency achieved with the decision points assessment model.
Tasks	Information Retrieval
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1593/
PDF	https://www.aclweb.org/anthology/L16-1593
PWC	https://paperswithcode.com/paper/the-query-of-everything-developing-open
Repo
Framework

bunji at SemEval-2016 Task 5: Neural and Syntactic Models of Entity-Attribute Relationship for Aspect-based Sentiment Analysis


Title	bunji at SemEval-2016 Task 5: Neural and Syntactic Models of Entity-Attribute Relationship for Aspect-based Sentiment Analysis
Authors	Toshihiko Yanase, Kohsuke Yanai, Misa Sato, Toshinori Miyoshi, Yoshiki Niwa
Abstract
Tasks	Aspect-Based Sentiment Analysis, Multi-Label Classification, Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1046/
PDF	https://www.aclweb.org/anthology/S16-1046
PWC	https://paperswithcode.com/paper/bunji-at-semeval-2016-task-5-neural-and
Repo
Framework

BUTknot at SemEval-2016 Task 5: Supervised Machine Learning with Term Substitution Approach in Aspect Category Detection


Title	BUTknot at SemEval-2016 Task 5: Supervised Machine Learning with Term Substitution Approach in Aspect Category Detection
Authors	Jakub Mach{'a}{\v{c}}ek
Abstract
Tasks	Aspect-Based Sentiment Analysis, Multi-Label Classification, Opinion Mining, Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1048/
PDF	https://www.aclweb.org/anthology/S16-1048
PWC	https://paperswithcode.com/paper/butknot-at-semeval-2016-task-5-supervised
Repo
Framework

ECNU at SemEval 2016 Task 6: Relevant or Not? Supportive or Not? A Two-step Learning System for Automatic Detecting Stance in Tweets


Title	ECNU at SemEval 2016 Task 6: Relevant or Not? Supportive or Not? A Two-step Learning System for Automatic Detecting Stance in Tweets
Authors	Zhihua Zhang, Man Lan
Abstract
Tasks	Feature Engineering, Sentiment Analysis, Stance Detection
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1073/
PDF	https://www.aclweb.org/anthology/S16-1073
PWC	https://paperswithcode.com/paper/ecnu-at-semeval-2016-task-6-relevant-or-not
Repo
Framework

NileTMRG at SemEval-2016 Task 7: Deriving Prior Polarities for Arabic Sentiment Terms


Title	NileTMRG at SemEval-2016 Task 7: Deriving Prior Polarities for Arabic Sentiment Terms
Authors	Samhaa R. El-Beltagy
Abstract
Tasks	Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1079/
PDF	https://www.aclweb.org/anthology/S16-1079
PWC	https://paperswithcode.com/paper/niletmrg-at-semeval-2016-task-7-deriving
Repo
Framework