May 4, 2019

1995 words 10 mins read

Paper Group NANR 206

Paper Group NANR 206

Disaster Analysis using User-Generated Weather Report. Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder. SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering. Asynchronous Parallel Greedy Coordinate Descent. Whose Nickname is This? Recognizing Politicians from Th …

Disaster Analysis using User-Generated Weather Report

Title Disaster Analysis using User-Generated Weather Report
Authors Yasunobu Asakura, Masatsugu Hangyo, Mamoru Komachi
Abstract Information extraction from user-generated text has gained much attention with the growth of the Web.Disaster analysis using information from social media provides valuable, real-time, geolocation information for helping people caught up these in disasters. However, it is not convenient to analyze texts posted on social media because disaster keywords match any texts that contain words. For collecting posts about a disaster from social media, we need to develop a classifier to filter posts irrelevant to disasters. Moreover, because of the nature of social media, we can take advantage of posts that come with GPS information. However, a post does not always refer to an event occurring at the place where it has been posted. Therefore, we propose a new task of classifying whether a flood disaster occurred, in addition to predicting the geolocation of events from user-generated text. We report the annotation of the flood disaster corpus and develop a classifier to demonstrate the use of this corpus for disaster analysis.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3906/
PDF https://www.aclweb.org/anthology/W16-3906
PWC https://paperswithcode.com/paper/disaster-analysis-using-user-generated
Repo
Framework

Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder

Title Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder
Authors Michaela Regneri, Diane King
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1901/
PDF https://www.aclweb.org/anthology/W16-1901
PWC https://paperswithcode.com/paper/automated-discourse-analysis-of-narrations-by
Repo
Framework

SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering

Title SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering
Authors Tsvetomila Mihaylova, Pepa Gencheva, Martin Boyanov, Ivana Yovcheva, Todor Mihaylov, Momchil Hardalov, Yasen Kiprov, Daniel Balchev, Ivan Koychev, Preslav Nakov, Ivelina Nikolova, Galia Angelova
Abstract
Tasks Answer Selection, Community Question Answering, Question Answering, Question Similarity
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1129/
PDF https://www.aclweb.org/anthology/S16-1129
PWC https://paperswithcode.com/paper/super-team-at-semeval-2016-task-3-building-a
Repo
Framework

Asynchronous Parallel Greedy Coordinate Descent

Title Asynchronous Parallel Greedy Coordinate Descent
Authors Yang You, Xiangru Lian, Ji Liu, Hsiang-Fu Yu, Inderjit S. Dhillon, James Demmel, Cho-Jui Hsieh
Abstract n this paper, we propose and study an Asynchronous parallel Greedy Coordinate Descent (Asy-GCD) algorithm for minimizing a smooth function with bounded constraints. At each iteration, workers asynchronously conduct greedy coordinate descent updates on a block of variables. In the first part of the paper, we analyze the theoretical behavior of Asy-GCD and prove a linear convergence rate. In the second part, we develop an efficient kernel SVM solver based on Asy-GCD in the shared memory multi-core setting. Since our algorithm is fully asynchronous—each core does not need to idle and wait for the other cores—the resulting algorithm enjoys good speedup and outperforms existing multi-core kernel SVM solvers including asynchronous stochastic coordinate descent and multi-core LIBSVM.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6070-asynchronous-parallel-greedy-coordinate-descent
PDF http://papers.nips.cc/paper/6070-asynchronous-parallel-greedy-coordinate-descent.pdf
PWC https://paperswithcode.com/paper/asynchronous-parallel-greedy-coordinate
Repo
Framework

Whose Nickname is This? Recognizing Politicians from Their Aliases

Title Whose Nickname is This? Recognizing Politicians from Their Aliases
Authors Wei-Chung Wang, Hung-Chen Chen, Zhi-Kai Ji, Hui-I Hsiao, Yu-Shian Chiu, Lun-Wei Ku
Abstract Using aliases to refer to public figures is one way to make fun of people, to express sarcasm, or even to sidestep legal issues when expressing opinions on social media. However, linking an alias back to the real name is difficult, as it entails phonemic, graphemic, and semantic challenges. In this paper, we propose a phonemic-based approach and inject semantic information to align aliases with politicians{'} Chinese formal names. The proposed approach creates an HMM model for each name to model its phonemes and takes into account document-level pairwise mutual information to capture the semantic relations to the alias. In this work we also introduce two new datasets consisting of 167 phonemic pairs and 279 mixed pairs of aliases and formal names. Experimental results show that the proposed approach models both phonemic and semantic information and outperforms previous work on both the phonemic and mixed datasets with the best top-1 accuracies of 0.78 and 0.59 respectively.
Tasks Entity Linking, Named Entity Recognition, Transliteration
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3910/
PDF https://www.aclweb.org/anthology/W16-3910
PWC https://paperswithcode.com/paper/whose-nickname-is-this-recognizing
Repo
Framework

Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators

Title Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators
Authors Ajit Jain, Girish Kasiviswanathan, Ruihong Huang
Abstract Accurate event detection in social media is very challenging because user generated contents are extremely noisy and sparse in content. Event indicators are generally words or phrases that act as a trigger that help us understand the semantics of the context they occur in. We present a weakly supervised approach that relies on using a single strong event indicator phrase as a seed to acquire a variety of additional event cues. We propose to leverage various types of implicit event indicators, such as props, actors and precursor events, to achieve precise event detection. We experimented with civil unrest events and show that the automatically learnt event indicators are effective in identifying specific types of events.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3911/
PDF https://www.aclweb.org/anthology/W16-3911
PWC https://paperswithcode.com/paper/towards-accurate-event-detection-in-social
Repo
Framework

Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)

Title Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4900/
PDF https://www.aclweb.org/anthology/W16-4900
PWC https://paperswithcode.com/paper/proceedings-of-the-3rd-workshop-on-natural
Repo
Framework

Extract Domain-specific Paraphrase from Monolingual Corpus for Automatic Evaluation of Machine Translation

Title Extract Domain-specific Paraphrase from Monolingual Corpus for Automatic Evaluation of Machine Translation
Authors Lilin Zhang, Zhen Weng, Wenyan Xiao, Jianyi Wan, Zhiming Chen, Yiming Tan, Maoxi Li, Mingwen Wang
Abstract
Tasks Domain Adaptation, Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2343/
PDF https://www.aclweb.org/anthology/W16-2343
PWC https://paperswithcode.com/paper/extract-domain-specific-paraphrase-from
Repo
Framework

Extracting Structured Scholarly Information from the Machine Translation Literature

Title Extracting Structured Scholarly Information from the Machine Translation Literature
Authors Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight, Daniel Marcu
Abstract Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: {}Which are the best published results reported on the NIST 09 Chinese to English dataset?{''} and {}What are the most important methods for speeding up phrase-based decoding?{''} Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
Tasks Machine Translation, Reading Comprehension
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1067/
PDF https://www.aclweb.org/anthology/L16-1067
PWC https://paperswithcode.com/paper/extracting-structured-scholarly-information
Repo
Framework

Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance

Title Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance
Authors Julie Pain, Jessie Levacher, Adam Quinquenel, Anja Belz
Abstract Postmarketing surveillance (PMS) has the vital aim to monitor effects of drugs after release for use by the general population, but suffers from under-reporting and limited coverage. Automatic methods for detecting drug effect reports, especially for social media, could vastly increase the scope of PMS. Very few automatic PMS methods are currently available, in particular for the messy text types encountered on Twitter. In this paper we describe first results for developing PMS methods specifically for tweets. We describe the corpus of 125,669 tweets we have created and annotated to train and test the tools. We find that generic tools perform well for tweet-level language identification and tweet-level sentiment analysis (both 0.94 F1-Score). For detection of effect mentions we are able to achieve 0.87 F1-Score, while effect-level adverse-vs.-beneficial analysis proves harder with an F1-Score of 0.64. Among other things, our results indicate that MetaMap semantic types provide a very promising basis for identifying drug effect mentions in tweets.
Tasks Language Identification, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3914/
PDF https://www.aclweb.org/anthology/W16-3914
PWC https://paperswithcode.com/paper/analysis-of-twitter-data-for-postmarketing
Repo
Framework

Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation

Title Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation
Authors Oliver {\v{C}}ulo, Jean Nitzke
Abstract
Tasks Machine Translation
Published 2016-01-01
URL https://www.aclweb.org/anthology/W16-3401/
PDF https://www.aclweb.org/anthology/W16-3401
PWC https://paperswithcode.com/paper/patterns-of-terminological-variation-in-post
Repo
Framework

Normalized Spectral Map Synchronization

Title Normalized Spectral Map Synchronization
Authors Yanyao Shen, Qixing Huang, Nati Srebro, Sujay Sanghavi
Abstract The algorithmic advancement of synchronizing maps is important in order to solve a wide range of practice problems with possible large-scale dataset. In this paper, we provide theoretical justifications for spectral techniques for the map synchronization problem, i.e., it takes as input a collection of objects and noisy maps estimated between pairs of objects, and outputs clean maps between all pairs of objects. We show that a simple normalized spectral method that projects the blocks of the top eigenvectors of a data matrix to the map space leads to surprisingly good results. As the noise is modelled naturally as random permutation matrix, this algorithm NormSpecSync leads to competing theoretical guarantees as state-of-the-art convex optimization techniques, yet it is much more efficient. We demonstrate the usefulness of our algorithm in a couple of applications, where it is optimal in both complexity and exactness among existing methods.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6128-normalized-spectral-map-synchronization
PDF http://papers.nips.cc/paper/6128-normalized-spectral-map-synchronization.pdf
PWC https://paperswithcode.com/paper/normalized-spectral-map-synchronization
Repo
Framework

Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets

Title Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets
Authors Billal Belainine, Alexs Fonseca, ro, Fatiha Sadat
Abstract In social networks services like Twitter, users are overwhelmed with huge amount of social data, most of which are short, unstructured and highly noisy. Identifying accurate information from this huge amount of data is indeed a hard task. Classification of tweets into organized form will help the user to easily access these required information. Our first contribution relates to filtering parts of speech and preprocessing this kind of highly noisy and short data. Our second contribution concerns the named entity recognition (NER) in tweets. Thus, the adaptation of existing language tools for natural languages, noisy and not accurate language tweets, is necessary. Our third contribution involves segmentation of hashtags and a semantic enrichment using a combination of relations from WordNet, which helps the performance of our classification system, including disambiguation of named entities, abbreviations and acronyms. Graph theory is used to cluster the words extracted from WordNet and tweets, based on the idea of connected components. We test our automatic classification system with four categories: politics, economy, sports and the medical field. We evaluate and compare several automatic classification systems using part or all of the items described in our contributions and found that filtering by part of speech and named entity recognition dramatically increase the classification precision to 77.3 {%}. Moreover, a classification system incorporating segmentation of hashtags and semantic enrichment by two relations from WordNet, synonymy and hyperonymy, increase classification precision up to 83.4 {%}.
Tasks Named Entity Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3915/
PDF https://www.aclweb.org/anthology/W16-3915
PWC https://paperswithcode.com/paper/named-entity-recognition-and-hashtag
Repo
Framework

Learning to Jointly Predict Ellipsis and Comparison Structures

Title Learning to Jointly Predict Ellipsis and Comparison Structures
Authors Bakhsh, Omid eh, Alexis Cornelia Wellwood, James Allen
Abstract
Tasks Question Answering, Reading Comprehension, Semantic Parsing, Sentiment Analysis, Structured Prediction
Published 2016-08-01
URL https://www.aclweb.org/anthology/K16-1007/
PDF https://www.aclweb.org/anthology/K16-1007
PWC https://paperswithcode.com/paper/learning-to-jointly-predict-ellipsis-and
Repo
Framework

Extracting token-level signals of syntactic processing from fMRI - with an application to PoS induction

Title Extracting token-level signals of syntactic processing from fMRI - with an application to PoS induction
Authors Joachim Bingel, Maria Barrett, Anders S{\o}gaard
Abstract
Tasks Sentence Compression
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1071/
PDF https://www.aclweb.org/anthology/P16-1071
PWC https://paperswithcode.com/paper/extracting-token-level-signals-of-syntactic
Repo
Framework
comments powered by Disqus