May 4, 2019

1995 words 10 mins read

Paper Group NANR 206

Disaster Analysis using User-Generated Weather Report. Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder. SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering. Asynchronous Parallel Greedy Coordinate Descent. Whose Nickname is This? Recognizing Politicians from Th …

Disaster Analysis using User-Generated Weather Report


Title	Disaster Analysis using User-Generated Weather Report
Authors	Yasunobu Asakura, Masatsugu Hangyo, Mamoru Komachi
Abstract	Information extraction from user-generated text has gained much attention with the growth of the Web.Disaster analysis using information from social media provides valuable, real-time, geolocation information for helping people caught up these in disasters. However, it is not convenient to analyze texts posted on social media because disaster keywords match any texts that contain words. For collecting posts about a disaster from social media, we need to develop a classifier to filter posts irrelevant to disasters. Moreover, because of the nature of social media, we can take advantage of posts that come with GPS information. However, a post does not always refer to an event occurring at the place where it has been posted. Therefore, we propose a new task of classifying whether a flood disaster occurred, in addition to predicting the geolocation of events from user-generated text. We report the annotation of the flood disaster corpus and develop a classifier to demonstrate the use of this corpus for disaster analysis.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3906/
PDF	https://www.aclweb.org/anthology/W16-3906
PWC	https://paperswithcode.com/paper/disaster-analysis-using-user-generated
Repo
Framework

Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder


Title	Automated Discourse Analysis of Narrations by Adolescents with Autistic Spectrum Disorder
Authors	Michaela Regneri, Diane King
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1901/
PDF	https://www.aclweb.org/anthology/W16-1901
PWC	https://paperswithcode.com/paper/automated-discourse-analysis-of-narrations-by
Repo
Framework

SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering


Title	SUper Team at SemEval-2016 Task 3: Building a Feature-Rich System for Community Question Answering
Authors	Tsvetomila Mihaylova, Pepa Gencheva, Martin Boyanov, Ivana Yovcheva, Todor Mihaylov, Momchil Hardalov, Yasen Kiprov, Daniel Balchev, Ivan Koychev, Preslav Nakov, Ivelina Nikolova, Galia Angelova
Abstract
Tasks	Answer Selection, Community Question Answering, Question Answering, Question Similarity
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1129/
PDF	https://www.aclweb.org/anthology/S16-1129
PWC	https://paperswithcode.com/paper/super-team-at-semeval-2016-task-3-building-a
Repo
Framework

Asynchronous Parallel Greedy Coordinate Descent


Title	Asynchronous Parallel Greedy Coordinate Descent
Authors	Yang You, Xiangru Lian, Ji Liu, Hsiang-Fu Yu, Inderjit S. Dhillon, James Demmel, Cho-Jui Hsieh
Abstract	n this paper, we propose and study an Asynchronous parallel Greedy Coordinate Descent (Asy-GCD) algorithm for minimizing a smooth function with bounded constraints. At each iteration, workers asynchronously conduct greedy coordinate descent updates on a block of variables. In the first part of the paper, we analyze the theoretical behavior of Asy-GCD and prove a linear convergence rate. In the second part, we develop an efficient kernel SVM solver based on Asy-GCD in the shared memory multi-core setting. Since our algorithm is fully asynchronous—each core does not need to idle and wait for the other cores—the resulting algorithm enjoys good speedup and outperforms existing multi-core kernel SVM solvers including asynchronous stochastic coordinate descent and multi-core LIBSVM.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6070-asynchronous-parallel-greedy-coordinate-descent
PDF	http://papers.nips.cc/paper/6070-asynchronous-parallel-greedy-coordinate-descent.pdf
PWC	https://paperswithcode.com/paper/asynchronous-parallel-greedy-coordinate
Repo
Framework

Whose Nickname is This? Recognizing Politicians from Their Aliases


Title	Whose Nickname is This? Recognizing Politicians from Their Aliases
Authors	Wei-Chung Wang, Hung-Chen Chen, Zhi-Kai Ji, Hui-I Hsiao, Yu-Shian Chiu, Lun-Wei Ku
Abstract	Using aliases to refer to public figures is one way to make fun of people, to express sarcasm, or even to sidestep legal issues when expressing opinions on social media. However, linking an alias back to the real name is difficult, as it entails phonemic, graphemic, and semantic challenges. In this paper, we propose a phonemic-based approach and inject semantic information to align aliases with politicians{'} Chinese formal names. The proposed approach creates an HMM model for each name to model its phonemes and takes into account document-level pairwise mutual information to capture the semantic relations to the alias. In this work we also introduce two new datasets consisting of 167 phonemic pairs and 279 mixed pairs of aliases and formal names. Experimental results show that the proposed approach models both phonemic and semantic information and outperforms previous work on both the phonemic and mixed datasets with the best top-1 accuracies of 0.78 and 0.59 respectively.
Tasks	Entity Linking, Named Entity Recognition, Transliteration
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3910/
PDF	https://www.aclweb.org/anthology/W16-3910
PWC	https://paperswithcode.com/paper/whose-nickname-is-this-recognizing
Repo
Framework


Title	Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators
Authors	Ajit Jain, Girish Kasiviswanathan, Ruihong Huang
Abstract	Accurate event detection in social media is very challenging because user generated contents are extremely noisy and sparse in content. Event indicators are generally words or phrases that act as a trigger that help us understand the semantics of the context they occur in. We present a weakly supervised approach that relies on using a single strong event indicator phrase as a seed to acquire a variety of additional event cues. We propose to leverage various types of implicit event indicators, such as props, actors and precursor events, to achieve precise event detection. We experimented with civil unrest events and show that the automatically learnt event indicators are effective in identifying specific types of events.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3911/
PDF	https://www.aclweb.org/anthology/W16-3911
PWC	https://paperswithcode.com/paper/towards-accurate-event-detection-in-social
Repo
Framework

Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)


Title	Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)
Authors
Abstract
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4900/
PDF	https://www.aclweb.org/anthology/W16-4900
PWC	https://paperswithcode.com/paper/proceedings-of-the-3rd-workshop-on-natural
Repo
Framework

Extract Domain-specific Paraphrase from Monolingual Corpus for Automatic Evaluation of Machine Translation


Title	Extract Domain-specific Paraphrase from Monolingual Corpus for Automatic Evaluation of Machine Translation
Authors	Lilin Zhang, Zhen Weng, Wenyan Xiao, Jianyi Wan, Zhiming Chen, Yiming Tan, Maoxi Li, Mingwen Wang
Abstract
Tasks	Domain Adaptation, Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2343/
PDF	https://www.aclweb.org/anthology/W16-2343
PWC	https://paperswithcode.com/paper/extract-domain-specific-paraphrase-from
Repo
Framework

Extracting Structured Scholarly Information from the Machine Translation Literature


Title	Extracting Structured Scholarly Information from the Machine Translation Literature
Authors	Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight, Daniel Marcu
Abstract	Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: {`}Which are the best published results reported on the NIST 09 Chinese to English dataset?{''} and {`}What are the most important methods for speeding up phrase-based decoding?{''} Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
Tasks	Machine Translation, Reading Comprehension
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1067/
PDF	https://www.aclweb.org/anthology/L16-1067
PWC	https://paperswithcode.com/paper/extracting-structured-scholarly-information
Repo
Framework

Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance


Title	Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance
Authors	Julie Pain, Jessie Levacher, Adam Quinquenel, Anja Belz
Abstract	Postmarketing surveillance (PMS) has the vital aim to monitor effects of drugs after release for use by the general population, but suffers from under-reporting and limited coverage. Automatic methods for detecting drug effect reports, especially for social media, could vastly increase the scope of PMS. Very few automatic PMS methods are currently available, in particular for the messy text types encountered on Twitter. In this paper we describe first results for developing PMS methods specifically for tweets. We describe the corpus of 125,669 tweets we have created and annotated to train and test the tools. We find that generic tools perform well for tweet-level language identification and tweet-level sentiment analysis (both 0.94 F1-Score). For detection of effect mentions we are able to achieve 0.87 F1-Score, while effect-level adverse-vs.-beneficial analysis proves harder with an F1-Score of 0.64. Among other things, our results indicate that MetaMap semantic types provide a very promising basis for identifying drug effect mentions in tweets.
Tasks	Language Identification, Sentiment Analysis
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3914/
PDF	https://www.aclweb.org/anthology/W16-3914
PWC	https://paperswithcode.com/paper/analysis-of-twitter-data-for-postmarketing
Repo
Framework

Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation


Title	Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation
Authors	Oliver {\v{C}}ulo, Jean Nitzke
Abstract
Tasks	Machine Translation
Published	2016-01-01
URL	https://www.aclweb.org/anthology/W16-3401/
PDF	https://www.aclweb.org/anthology/W16-3401
PWC	https://paperswithcode.com/paper/patterns-of-terminological-variation-in-post
Repo
Framework

Normalized Spectral Map Synchronization


Title	Normalized Spectral Map Synchronization
Authors	Yanyao Shen, Qixing Huang, Nati Srebro, Sujay Sanghavi
Abstract	The algorithmic advancement of synchronizing maps is important in order to solve a wide range of practice problems with possible large-scale dataset. In this paper, we provide theoretical justifications for spectral techniques for the map synchronization problem, i.e., it takes as input a collection of objects and noisy maps estimated between pairs of objects, and outputs clean maps between all pairs of objects. We show that a simple normalized spectral method that projects the blocks of the top eigenvectors of a data matrix to the map space leads to surprisingly good results. As the noise is modelled naturally as random permutation matrix, this algorithm NormSpecSync leads to competing theoretical guarantees as state-of-the-art convex optimization techniques, yet it is much more efficient. We demonstrate the usefulness of our algorithm in a couple of applications, where it is optimal in both complexity and exactness among existing methods.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6128-normalized-spectral-map-synchronization
PDF	http://papers.nips.cc/paper/6128-normalized-spectral-map-synchronization.pdf
PWC	https://paperswithcode.com/paper/normalized-spectral-map-synchronization
Repo
Framework

Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets


Title	Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets
Authors	Billal Belainine, Alexs Fonseca, ro, Fatiha Sadat
Abstract	In social networks services like Twitter, users are overwhelmed with huge amount of social data, most of which are short, unstructured and highly noisy. Identifying accurate information from this huge amount of data is indeed a hard task. Classification of tweets into organized form will help the user to easily access these required information. Our first contribution relates to filtering parts of speech and preprocessing this kind of highly noisy and short data. Our second contribution concerns the named entity recognition (NER) in tweets. Thus, the adaptation of existing language tools for natural languages, noisy and not accurate language tweets, is necessary. Our third contribution involves segmentation of hashtags and a semantic enrichment using a combination of relations from WordNet, which helps the performance of our classification system, including disambiguation of named entities, abbreviations and acronyms. Graph theory is used to cluster the words extracted from WordNet and tweets, based on the idea of connected components. We test our automatic classification system with four categories: politics, economy, sports and the medical field. We evaluate and compare several automatic classification systems using part or all of the items described in our contributions and found that filtering by part of speech and named entity recognition dramatically increase the classification precision to 77.3 {%}. Moreover, a classification system incorporating segmentation of hashtags and semantic enrichment by two relations from WordNet, synonymy and hyperonymy, increase classification precision up to 83.4 {%}.
Tasks	Named Entity Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3915/
PDF	https://www.aclweb.org/anthology/W16-3915
PWC	https://paperswithcode.com/paper/named-entity-recognition-and-hashtag
Repo
Framework

Learning to Jointly Predict Ellipsis and Comparison Structures


Title	Learning to Jointly Predict Ellipsis and Comparison Structures
Authors	Bakhsh, Omid eh, Alexis Cornelia Wellwood, James Allen
Abstract
Tasks	Question Answering, Reading Comprehension, Semantic Parsing, Sentiment Analysis, Structured Prediction
Published	2016-08-01
URL	https://www.aclweb.org/anthology/K16-1007/
PDF	https://www.aclweb.org/anthology/K16-1007
PWC	https://paperswithcode.com/paper/learning-to-jointly-predict-ellipsis-and
Repo
Framework

Extracting token-level signals of syntactic processing from fMRI - with an application to PoS induction


Title	Extracting token-level signals of syntactic processing from fMRI - with an application to PoS induction
Authors	Joachim Bingel, Maria Barrett, Anders S{\o}gaard
Abstract
Tasks	Sentence Compression
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1071/
PDF	https://www.aclweb.org/anthology/P16-1071
PWC	https://paperswithcode.com/paper/extracting-token-level-signals-of-syntactic
Repo
Framework