October 15, 2019

1942 words 10 mins read

Paper Group NANR 155

Task-driven Webpage Saliency. Tutorial: De-mystifying Neural MT. Towards an argumentative content search engine using weak supervision. Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types. The Metalogue Debate Trainee Corpus: Data Collection and Annotations. Argumentation Synthesis following Rhet …

Task-driven Webpage Saliency


Title	Task-driven Webpage Saliency
Authors	Quanlong Zheng, Jianbo Jiao, Ying Cao, Rynson W.H. Lau
Abstract	In this paper, we present an end-to-end learning framework for predicting task-driven visual saliency on webpages. Given a webpage, we propose a convolutional neural network to predict where people look at it under different task conditions. Inspired by the observation that given a specific task, human attention is strongly correlated with certain semantic components on a webpage (e.g., images, buttons and input boxes), our network explicitly disentangles saliency prediction into two independent sub-tasks: task-specific attention shift prediction and task-free saliency prediction. The task-specific branch estimates task-driven attention shift over a webpage from its semantic components, while the task-free branch infers visual saliency induced by visual features of the webpage. The outputs of the two branches are combined to produce the final prediction. Such a task decomposition framework allows us to efficiently learn our model from a small-scale task-driven saliency dataset with sparse labels (captured under a single task condition). Experimental results show that our method outperforms the baselines and prior works, achieving state-of-the-art performance on a newly collected benchmark dataset for task-driven webpage saliency detection.
Tasks	Saliency Detection, Saliency Prediction
Published	2018-09-01
URL	http://openaccess.thecvf.com/content_ECCV_2018/html/Quanlong_Zheng_Task-driven_Webpage_Saliency_ECCV_2018_paper.html
PDF	http://openaccess.thecvf.com/content_ECCV_2018/papers/Quanlong_Zheng_Task-driven_Webpage_Saliency_ECCV_2018_paper.pdf
PWC	https://paperswithcode.com/paper/task-driven-webpage-saliency
Repo
Framework

Tutorial: De-mystifying Neural MT


Title	Tutorial: De-mystifying Neural MT
Authors	Dragos Munteanu, Ling Tsou
Abstract
Tasks	Machine Translation
Published	2018-03-01
URL	https://www.aclweb.org/anthology/W18-1924/
PDF	https://www.aclweb.org/anthology/W18-1924
PWC	https://paperswithcode.com/paper/tutorial-de-mystifying-neural-mt
Repo
Framework

Towards an argumentative content search engine using weak supervision


Title	Towards an argumentative content search engine using weak supervision
Authors	Ran Levy, Ben Bogin, Shai Gretz, Ranit Aharonov, Noam Slonim
Abstract	Searching for sentences containing claims in a large text corpus is a key component in developing an argumentative content search engine. Previous works focused on detecting claims in a small set of documents or within documents enriched with argumentative content. However, pinpointing relevant claims in massive unstructured corpora, received little attention. A step in this direction was taken in (Levy et al. 2017), where the authors suggested using a weak signal to develop a relatively strict query for claim{–}sentence detection. Here, we leverage this work to define weak signals for training DNNs to obtain significantly greater performance. This approach allows to relax the query and increase the potential coverage. Our results clearly indicate that the system is able to successfully generalize from the weak signal, outperforming previously reported results in terms of both precision and coverage. Finally, we adapt our system to solve a recent argument mining task of identifying argumentative sentences in Web texts retrieved from heterogeneous sources, and obtain F1 scores comparable to the supervised baseline.
Tasks	Argument Mining, Decision Making
Published	2018-08-01
URL	https://www.aclweb.org/anthology/C18-1176/
PDF	https://www.aclweb.org/anthology/C18-1176
PWC	https://paperswithcode.com/paper/towards-an-argumentative-content-search
Repo
Framework

Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types


Title	Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types
Authors	Paul Felt, Eric Ringger, Jordan Boyd-Graber, Kevin Seppi
Abstract	Annotated corpora enable supervised machine learning and data analysis. To reduce the cost of manual annotation, tasks are often assigned to internet workers whose judgments are reconciled by crowdsourcing models. We approach the problem of crowdsourcing using a framework for learning from rich prior knowledge, and we identify a family of crowdsourcing models with the novel ability to combine annotations with differing structures: e.g., document labels and word labels. Annotator judgments are given in the form of the predicted expected value of measurement functions computed over annotations and the data, unifying annotation models. Our model, a specific instance of this framework, compares favorably with previous work. Furthermore, it enables active sample selection, jointly selecting annotator, data item, and annotation structure to reduce annotation effort.
Tasks
Published	2018-08-01
URL	https://www.aclweb.org/anthology/C18-1144/
PDF	https://www.aclweb.org/anthology/C18-1144
PWC	https://paperswithcode.com/paper/learning-from-measurements-in-crowdsourcing
Repo
Framework

The Metalogue Debate Trainee Corpus: Data Collection and Annotations


Title	The Metalogue Debate Trainee Corpus: Data Collection and Annotations
Authors	Volha Petukhova, Andrei Malchanau, Youssef Oualil, Dietrich Klakow, Saturnino Luz, Fasih Haider, Nick Campbell, Dimitris Koryzis, Dimitris Spiliotopoulos, Pierre Albert, Nicklas Linz, Alex, Jan ersson
Abstract
Tasks
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1120/
PDF	https://www.aclweb.org/anthology/L18-1120
PWC	https://paperswithcode.com/paper/the-metalogue-debate-trainee-corpus-data
Repo
Framework

Argumentation Synthesis following Rhetorical Strategies


Title	Argumentation Synthesis following Rhetorical Strategies
Authors	Henning Wachsmuth, Manfred Stede, Roxanne El Baff, Khalid Al-Khatib, Maria Skeppstedt, Benno Stein
Abstract	Persuasion is rarely achieved through a loose set of arguments alone. Rather, an effective delivery of arguments follows a rhetorical strategy, combining logical reasoning with appeals to ethics and emotion. We argue that such a strategy means to select, arrange, and phrase a set of argumentative discourse units. In this paper, we model rhetorical strategies for the computational synthesis of effective argumentation. In a study, we let 26 experts synthesize argumentative texts with different strategies for 10 topics. We find that the experts agree in the selection significantly more when following the same strategy. While the texts notably vary for different strategies, especially their arrangement remains stable. The results suggest that our model enables a strategical synthesis.
Tasks	Argument Mining
Published	2018-08-01
URL	https://www.aclweb.org/anthology/C18-1318/
PDF	https://www.aclweb.org/anthology/C18-1318
PWC	https://paperswithcode.com/paper/argumentation-synthesis-following-rhetorical
Repo
Framework

Detecting Simultaneously Chinese Grammar Errors Based on a BiLSTM-CRF Model


Title	Detecting Simultaneously Chinese Grammar Errors Based on a BiLSTM-CRF Model
Authors	Yajun Liu, Hongying Zan, Mengjie Zhong, Hongchao Ma
Abstract	In the process of learning and using Chinese, many learners of Chinese as foreign language(CFL) may have grammar errors due to negative migration of their native languages. This paper introduces our system that can simultaneously diagnose four types of grammatical errors including redundant (R), missing (M), selection (S), disorder (W) in NLPTEA-5 shared task. We proposed a Bidirectional LSTM CRF neural network (BiLSTM-CRF) that combines BiLSTM and CRF without hand-craft features for Chinese Grammatical Error Diagnosis (CGED). Evaluation includes three levels, which are detection level, identification level and position level. At the detection level and identification level, our system got the third recall scores, and achieved good F1 values.
Tasks
Published	2018-07-01
URL	https://www.aclweb.org/anthology/W18-3727/
PDF	https://www.aclweb.org/anthology/W18-3727
PWC	https://paperswithcode.com/paper/detecting-simultaneously-chinese-grammar
Repo
Framework

HGSGNLP at IEST 2018: An Ensemble of Machine Learning and Deep Neural Architectures for Implicit Emotion Classification in Tweets


Title	HGSGNLP at IEST 2018: An Ensemble of Machine Learning and Deep Neural Architectures for Implicit Emotion Classification in Tweets
Authors	Wenting Wang
Abstract	This paper describes our system designed for the WASSA-2018 Implicit Emotion Shared Task (IEST). The task is to predict the emotion category expressed in a tweet by removing the terms \textit{angry}, \textit{afraid}, \textit{happy}, \textit{sad}, \textit{surprised}, \textit{disgusted} and their synonyms. Our final submission is an ensemble of one supervised learning model and three deep neural network based models, where each model approaches the problem from essentially different directions. Our system achieves the macro F1 score of 65.8{%}, which is a 5.9{%} performance improvement over the baseline and is ranked 12 out of 30 participating teams.
Tasks	Common Sense Reasoning, Emotion Classification, Emotion Recognition, Word Embeddings
Published	2018-10-01
URL	https://www.aclweb.org/anthology/W18-6229/
PDF	https://www.aclweb.org/anthology/W18-6229
PWC	https://paperswithcode.com/paper/hgsgnlp-at-iest-2018-an-ensemble-of-machine
Repo
Framework

Decomposing phonological transformations in serial derivations


Title	Decomposing phonological transformations in serial derivations
Authors	Andrew Lamont
Abstract
Tasks
Published	2018-01-01
URL	https://www.aclweb.org/anthology/W18-0310/
PDF	https://www.aclweb.org/anthology/W18-0310
PWC	https://paperswithcode.com/paper/decomposing-phonological-transformations-in
Repo
Framework

Using Rhetorical Topics for Automatic Summarization


Title	Using Rhetorical Topics for Automatic Summarization
Authors	Natalie M. Schrimpf
Abstract
Tasks
Published	2018-01-01
URL	https://www.aclweb.org/anthology/W18-0313/
PDF	https://www.aclweb.org/anthology/W18-0313
PWC	https://paperswithcode.com/paper/using-rhetorical-topics-for-automatic
Repo
Framework

Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes


Title	Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes
Authors	Sk Muralidhar, a, Laurent Nguyen, Daniel Gatica-Perez
Abstract	Automatic hirability prediction from video resumes is gaining increasing attention in both psychology and computing. Most existing works have investigated hirability from the perspective of nonverbal behavior, with verbal content receiving little interest. In this study, we leverage the advances in deep-learning based text representation techniques (like word embedding) in natural language processing to investigate the relationship between verbal content and perceived hirability ratings. To this end, we use 292 conversational video resumes from YouTube, develop a computational framework to automatically extract various representations of verbal content, and evaluate them in a regression task. We obtain a best performance of R{\mbox{$^2$}} = 0.23 using GloVe, and R{\mbox{$^2$}} = 0.22 using Word2Vec representations for manual and automatically transcribed texts respectively. Our inference results indicate the feasibility of using deep learning based verbal content representation in inferring hirability scores from online conversational video resumes.
Tasks	Speech Recognition
Published	2018-10-01
URL	https://www.aclweb.org/anthology/W18-6247/
PDF	https://www.aclweb.org/anthology/W18-6247
PWC	https://paperswithcode.com/paper/words-worth-verbal-content-and-hirability
Repo
Framework

Semi-automatic Korean FrameNet Annotation over KAIST Treebank


Title	Semi-automatic Korean FrameNet Annotation over KAIST Treebank
Authors	Younggyun Hahm, Jiseong Kim, Sunggoo Kwon, Key-Sun Choi
Abstract
Tasks	Dependency Parsing, Morphological Analysis, Part-Of-Speech Tagging, Question Answering, Semantic Role Labeling
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1013/
PDF	https://www.aclweb.org/anthology/L18-1013
PWC	https://paperswithcode.com/paper/semi-automatic-korean-framenet-annotation
Repo
Framework

Identification of Differences between Dutch Language Varieties with the VarDial2018 Dutch-Flemish Subtitle Data


Title	Identification of Differences between Dutch Language Varieties with the VarDial2018 Dutch-Flemish Subtitle Data
Authors	Hans van Halteren, Nelleke Oostdijk
Abstract	With the goal of discovering differences between Belgian and Netherlandic Dutch, we participated as Team Taurus in the Dutch-Flemish Subtitles task of VarDial2018. We used a rather simple marker-based method, but a wide range of features, including lexical, lexico-syntactic and syntactic ones, and achieved a second position in the ranking. Inspection of highly distin-guishing features did point towards differences between the two language varieties, but because of the nature of the experimental data, we have to treat our observations as very tentative and in need of further investigation.
Tasks	Text Classification
Published	2018-08-01
URL	https://www.aclweb.org/anthology/W18-3923/
PDF	https://www.aclweb.org/anthology/W18-3923
PWC	https://paperswithcode.com/paper/identification-of-differences-between-dutch
Repo
Framework

Scalable Deletion-Robust Submodular Maximization: Data Summarization with Privacy and Fairness Constraints


Title	Scalable Deletion-Robust Submodular Maximization: Data Summarization with Privacy and Fairness Constraints
Authors	Ehsan Kazemi, Morteza Zadimoghaddam, Amin Karbasi
Abstract	Can we efficiently extract useful information from a large user-generated dataset while protecting the privacy of the users and/or ensuring fairness in representation? We cast this problem as an instance of a deletion-robust submodular maximization where part of the data may be deleted or masked due to privacy concerns or fairness criteria. We propose the first memory-efficient centralized, streaming, and distributed methods with constant-factor approximation guarantees against any number of adversarial deletions. We extensively evaluate the performance of our algorithms on real-world applications, including (i) Uber-pick up locations with location privacy constraints; (ii) feature selection with fairness constraints for income prediction and crime rate prediction; and (iii) robust to deletion summarization of census data, consisting of 2,458,285 feature vectors. Our experiments show that our solution is robust against even $80%$ of data deletion.
Tasks	Data Summarization, Feature Selection
Published	2018-07-01
URL	https://icml.cc/Conferences/2018/Schedule?showEvent=1927
PDF	http://proceedings.mlr.press/v80/kazemi18a/kazemi18a.pdf
PWC	https://paperswithcode.com/paper/scalable-deletion-robust-submodular
Repo
Framework

Twist Bytes - German Dialect Identification with Data Mining Optimization


Title	Twist Bytes - German Dialect Identification with Data Mining Optimization
Authors	Fern Benites, o, Ralf Grubenmann, Pius von D{"a}niken, Dirk von Gr{"u}nigen, Jan Deriu, Mark Cieliebak
Abstract	We describe our approaches used in the German Dialect Identification (GDI) task at the VarDial Evaluation Campaign 2018. The goal was to identify to which out of four dialects spoken in German speaking part of Switzerland a sentence belonged to. We adopted two different meta classifier approaches and used some data mining insights to improve the preprocessing and the meta classifier parameters. Especially, we focused on using different feature extraction methods and how to combine them, since they influenced very differently the performance of the system. Our system achieved second place out of 8 teams, with a macro averaged F-1 of 64.6{%}.
Tasks
Published	2018-08-01
URL	https://www.aclweb.org/anthology/W18-3925/
PDF	https://www.aclweb.org/anthology/W18-3925
PWC	https://paperswithcode.com/paper/twist-bytes-german-dialect-identification
Repo
Framework