July 26, 2019

2149 words 11 mins read

Paper Group NANR 74

Introducing EVALD – Software Applications for Automatic Evaluation of Discourse in Czech. How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis. Revisiting the ISO standard for dialogue act annotation. Improving NER for Clinical Texts by Ensemble Approach using Segment Representations. May I take your order? A Neural M …

Introducing EVALD – Software Applications for Automatic Evaluation of Discourse in Czech


Title	Introducing EVALD – Software Applications for Automatic Evaluation of Discourse in Czech
Authors	Kate{\v{r}}ina Rysov{'a}, Magdal{'e}na Rysov{'a}, Ji{\v{r}}{'\i} M{'\i}rovsk{'y}, Michal Nov{'a}k
Abstract	In the paper, we introduce two software applications for automatic evaluation of coherence in Czech texts called EVALD {–} Evaluator of Discourse. The first one {–} EVALD 1.0 {–} evaluates texts written by native speakers of Czech on a five-step scale commonly used at Czech schools (grade 1 is the best, grade 5 is the worst). The second application is EVALD 1.0 for Foreigners assessing texts by non-native speakers of Czech using six-step scale (A1{–}C2) according to CEFR. Both appli-cations are available online at \url{https://lindat.mff.cuni.cz/services/evald-foreign/}.
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1082/
PDF	https://doi.org/10.26615/978-954-452-049-6_082
PWC	https://paperswithcode.com/paper/introducing-evald-a-software-applications-for
Repo
Framework

How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis


Title	How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis
Authors	Ivan Sanchez, Sebastian Riedel
Abstract	One key property of word embeddings currently under study is their capacity to encode hypernymy. Previous works have used supervised models to recover hypernymy structures from embeddings. However, the overall results do not clearly show how well we can recover such structures. We conduct the first dataset-centric analysis that shows how only the Baroni dataset provides consistent results. We empirically show that a possible reason for its good performance is its alignment to dimensions specific of hypernymy: generality and similarity
Tasks	Natural Language Inference, Word Embeddings
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2064/
PDF	https://www.aclweb.org/anthology/E17-2064
PWC	https://paperswithcode.com/paper/how-well-can-we-predict-hypernyms-from-word
Repo
Framework

Revisiting the ISO standard for dialogue act annotation


Title	Revisiting the ISO standard for dialogue act annotation
Authors	Harry Bunt, Volha Petukhova, Alex Chengyu Fang
Abstract
Tasks
Published	2017-01-01
URL	https://www.aclweb.org/anthology/W17-7404/
PDF	https://www.aclweb.org/anthology/W17-7404
PWC	https://paperswithcode.com/paper/revisiting-the-iso-standard-for-dialogue-act
Repo
Framework

Improving NER for Clinical Texts by Ensemble Approach using Segment Representations


Title	Improving NER for Clinical Texts by Ensemble Approach using Segment Representations
Authors	Hamada Nayel, H. L. Shashirekha
Abstract
Tasks
Published	2017-12-01
URL	https://www.aclweb.org/anthology/W17-7525/
PDF	https://www.aclweb.org/anthology/W17-7525
PWC	https://paperswithcode.com/paper/improving-ner-for-clinical-texts-by-ensemble
Repo
Framework

May I take your order? A Neural Model for Extracting Structured Information from Conversations


Title	May I take your order? A Neural Model for Extracting Structured Information from Conversations
Authors	Baolin Peng, Michael Seltzer, Y.C. Ju, Geoffrey Zweig, Kam-Fai Wong
Abstract	In this paper we tackle a unique and important problem of extracting a structured order from the conversation a customer has with an order taker at a restaurant. This is motivated by an actual system under development to assist in the order taking process. We develop a sequence-to-sequence model that is able to map from unstructured conversational input to the structured form that is conveyed to the kitchen and appears on the customer receipt. This problem is critically different from other tasks like machine translation where sequence-to-sequence models have been used: the input includes two sides of a conversation; the output is highly structured; and logical manipulations must be performed, for example when the customer changes his mind while ordering. We present a novel sequence-to-sequence model that incorporates a special attention-memory gating mechanism and conversational role markers. The proposed model improves performance over both a phrase-based machine translation approach and a standard sequence-to-sequence model.
Tasks	Machine Translation
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-1043/
PDF	https://www.aclweb.org/anthology/E17-1043
PWC	https://paperswithcode.com/paper/may-i-take-your-order-a-neural-model-for
Repo
Framework

Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning


Title	Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning
Authors	Ekaterina Shutova, Lin Sun, Elkin Dar{'\i}o Guti{'e}rrez, Patricia Lichtenstein, Srini Narayanan
Abstract	Highly frequent in language and communication, metaphor represents a significant challenge for Natural Language Processing (NLP) applications. Computational work on metaphor has traditionally evolved around the use of hand-coded knowledge, making the systems hard to scale. Recent years have witnessed a rise in statistical approaches to metaphor processing. However, these approaches often require extensive human annotation effort and are predominantly evaluated within a limited domain. In contrast, we experiment with weakly supervised and unsupervised techniques{—}with little or no annotation{—}to generalize higher-level mechanisms of metaphor from distributional properties of concepts. We investigate different levels and types of supervision (learning from linguistic examples vs. learning from a given set of metaphorical mappings vs. learning without annotation) in flat and hierarchical, unconstrained and constrained clustering settings. Our aim is to identify the optimal type of supervision for a learning algorithm that discovers patterns of metaphorical association from text. In order to investigate the scalability and adaptability of our models, we applied them to data in three languages from different language groups{—}English, Spanish, and Russian{—}achieving state-of-the-art results with little supervision. Finally, we demonstrate that statistical methods can facilitate and scale up cross-linguistic research on metaphor.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/J17-1003/
PDF	https://www.aclweb.org/anthology/J17-1003
PWC	https://paperswithcode.com/paper/multilingual-metaphor-processing-experiments
Repo
Framework

EEG-GRAPH: A Factor-Graph-Based Model for Capturing Spatial, Temporal, and Observational Relationships in Electroencephalograms


Title	EEG-GRAPH: A Factor-Graph-Based Model for Capturing Spatial, Temporal, and Observational Relationships in Electroencephalograms
Authors	Yogatheesan Varatharajah, Min Jin Chong, Krishnakant Saboo, Brent Berry, Benjamin Brinkmann, Gregory Worrell, Ravishankar Iyer
Abstract	This paper presents a probabilistic-graphical model that can be used to infer characteristics of instantaneous brain activity by jointly analyzing spatial and temporal dependencies observed in electroencephalograms (EEG). Specifically, we describe a factor-graph-based model with customized factor-functions defined based on domain knowledge, to infer pathologic brain activity with the goal of identifying seizure-generating brain regions in epilepsy patients. We utilize an inference technique based on the graph-cut algorithm to exactly solve graph inference in polynomial time. We validate the model by using clinically collected intracranial EEG data from 29 epilepsy patients to show that the model correctly identifies seizure-generating brain regions. Our results indicate that our model outperforms two conventional approaches used for seizure-onset localization (5-7% better AUC: 0.72, 0.67, 0.65) and that the proposed inference technique provides 3-10% gain in AUC (0.72, 0.62, 0.69) compared to sampling-based alternatives.
Tasks	EEG
Published	2017-12-01
URL	http://papers.nips.cc/paper/7121-eeg-graph-a-factor-graph-based-model-for-capturing-spatial-temporal-and-observational-relationships-in-electroencephalograms
PDF	http://papers.nips.cc/paper/7121-eeg-graph-a-factor-graph-based-model-for-capturing-spatial-temporal-and-observational-relationships-in-electroencephalograms.pdf
PWC	https://paperswithcode.com/paper/eeg-graph-a-factor-graph-based-model-for
Repo
Framework

Facebook sentiment: Reactions and Emojis


Title	Facebook sentiment: Reactions and Emojis
Authors	Ye Tian, Thiago Galery, Giulio Dulcinati, Emilia Molimpakis, Chao Sun
Abstract	Emojis are used frequently in social media. A widely assumed view is that emojis express the emotional state of the user, which has led to research focusing on the expressiveness of emojis independent from the linguistic context. We argue that emojis and the linguistic texts can modify the meaning of each other. The overall communicated meaning is not a simple sum of the two channels. In order to study the meaning interplay, we need data indicating the overall sentiment of the entire message as well as the sentiment of the emojis stand-alone. We propose that Facebook Reactions are a good data source for such a purpose. FB reactions (e.g. {`}Love{''} and {`}Angry{''}) indicate the readers{'} overall sentiment, against which we can investigate the types of emojis used the comments under different reaction profiles. We present a data set of 21,000 FB posts (57 million reactions and 8 million comments) from public media pages across four countries.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1102/
PDF	https://www.aclweb.org/anthology/W17-1102
PWC	https://paperswithcode.com/paper/facebook-sentiment-reactions-and-emojis
Repo
Framework

Exploring an Efficient Handwritten Manipuri Meetei-Mayek Character Recognition Using Gradient Feature Extractor and Cosine Distance Based Multiclass k-Nearest Neighbor Classifier


Title	Exploring an Efficient Handwritten Manipuri Meetei-Mayek Character Recognition Using Gradient Feature Extractor and Cosine Distance Based Multiclass k-Nearest Neighbor Classifier
Authors	Kishorjit Nongmeikapam, Wahengbam Kumar, Mithlesh Prasad Singh
Abstract
Tasks
Published	2017-12-01
URL	https://www.aclweb.org/anthology/W17-7541/
PDF	https://www.aclweb.org/anthology/W17-7541
PWC	https://paperswithcode.com/paper/exploring-an-efficient-handwritten-manipuri
Repo
Framework

When a Red Herring in Not a Red Herring: Using Compositional Methods to Detect Non-Compositional Phrases


Title	When a Red Herring in Not a Red Herring: Using Compositional Methods to Detect Non-Compositional Phrases
Authors	Julie Weeds, Thomas Kober, Jeremy Reffin, David Weir
Abstract	Non-compositional phrases such as \textit{red herring} and weakly compositionalphrases such as \textit{spelling bee} are an integral part of natural language(Sag, 2002). They are also the phrases that are difficult, or evenimpossible, for good compositional distributional models of semantics.Compositionality detection therefore provides a good testbed for compositionalmethods. We compare an integrated compositional distributional approach, usingsparse high dimensional representations, with the ad-hoc compositional approachof applying simple composition operations to state-of-the-art neuralembeddings.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2085/
PDF	https://www.aclweb.org/anthology/E17-2085
PWC	https://paperswithcode.com/paper/when-a-red-herring-in-not-a-red-herring-using
Repo
Framework

Applying Multi-Sense Embeddings for German Verbs to Determine Semantic Relatedness and to Detect Non-Literal Language


Title	Applying Multi-Sense Embeddings for German Verbs to Determine Semantic Relatedness and to Detect Non-Literal Language
Authors	Maximilian K{"o}per, Sabine Schulte im Walde
Abstract	Up to date, the majority of computational models still determines the semantic relatedness between words (or larger linguistic units) on the type level. In this paper, we compare and extend multi-sense embeddings, in order to model and utilise word senses on the token level. We focus on the challenging class of complex verbs, and evaluate the model variants on various semantic tasks: semantic classification; predicting compositionality; and detecting non-literal language usage. While there is no overall best model, all models significantly outperform a word2vec single-sense skip baseline, thus demonstrating the need to distinguish between word senses in a distributional semantic model.
Tasks	Semantic Textual Similarity, Word Embeddings, Word Sense Disambiguation
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2086/
PDF	https://www.aclweb.org/anthology/E17-2086
PWC	https://paperswithcode.com/paper/applying-multi-sense-embeddings-for-german
Repo
Framework

Investigating Redundancy in Emoji Use: Study on a Twitter Based Corpus


Title	Investigating Redundancy in Emoji Use: Study on a Twitter Based Corpus
Authors	Giulia Donato, Patrizia Paggio
Abstract	In this paper we present an annotated corpus created with the aim of analyzing the informative behaviour of emoji {–} an issue of importance for sentiment analysis and natural language processing. The corpus consists of 2475 tweets all containing at least one emoji, which has been annotated using one of the three possible classes: Redundant, Non Redundant, and Non Redundant + POS. We explain how the corpus was collected, describe the annotation procedure and the interface developed for the task. We provide an analysis of the corpus, considering also possible predictive features, discuss the problematic aspects of the annotation, and suggest future improvements.
Tasks	Sentiment Analysis
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5216/
PDF	https://www.aclweb.org/anthology/W17-5216
PWC	https://paperswithcode.com/paper/investigating-redundancy-in-emoji-use-study
Repo
Framework


Title	Investigating Patient Attitudes Towards the use of Social Media Data to Augment Depression Diagnosis and Treatment: a Qualitative Study
Authors	Jude Mikal, Samantha Hurst, Mike Conway
Abstract	In this paper, we use qualitative research methods to investigate the attitudes of social media users towards the (opt-in) integration of social media data with routine mental health care and diagnosis. Our investigation was based on secondary analysis of a series of five focus groups with Twitter users, including three groups consisting of participants with a self-reported history of depression, and two groups consisting of participants without a self reported history of depression. Our results indicate that, overall, research participants were enthusiastic about the possibility of using social media (in conjunction with automated Natural Language Processing algorithms) for mood tracking under the supervision of a mental health practitioner. However, for at least some participants, there was skepticism related to how well social media represents the mental health of users, and hence its usefulness in the clinical context.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/W17-3105/
PDF	https://www.aclweb.org/anthology/W17-3105
PWC	https://paperswithcode.com/paper/investigating-patient-attitudes-towards-the
Repo
Framework

Proceedings of the Sixth Workshop on Vision and Language


Title	Proceedings of the Sixth Workshop on Vision and Language
Authors
Abstract
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-2000/
PDF	https://www.aclweb.org/anthology/W17-2000
PWC	https://paperswithcode.com/paper/proceedings-of-the-sixth-workshop-on-vision
Repo
Framework

A Calibration Method for Evaluation of Sentiment Analysis


Title	A Calibration Method for Evaluation of Sentiment Analysis
Authors	F.Sharmila Satthar, Roger Evans, Gulden Uchyigit
Abstract	Sentiment analysis is the computational task of extracting sentiment from a text document {–} for example whether it expresses a positive, negative or neutral opinion. Various approaches have been introduced in recent years, using a range of different techniques to extract sentiment information from a document. Measuring these methods against a gold standard dataset is a useful way to evaluate such systems. However, different sentiment analysis techniques represent sentiment values in different ways, such as discrete categorical classes or continuous numerical sentiment scores. This creates a challenge for evaluating and comparing such systems; in particular assessing numerical scores against datasets that use fixed classes is difficult, because the numerical outputs have to be mapped onto the ordered classes. This paper proposes a novel calibration technique that uses precision vs. recall curves to set class thresholds to optimize a continuous sentiment analyser{'}s performance against a discrete gold standard dataset. In experiments mapping a continuous score onto a three-class classification of movie reviews, we show that calibration results in a substantial increase in f-score when compared to a non-calibrated mapping.
Tasks	Calibration, Information Retrieval, Sentiment Analysis
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1084/
PDF	https://doi.org/10.26615/978-954-452-049-6_084
PWC	https://paperswithcode.com/paper/a-calibration-method-for-evaluation-of
Repo
Framework