July 26, 2019

2198 words 11 mins read

Paper Group NANR 106

Cross-Lingual Parser Selection for Low-Resource Languages. Fake News Detection Through Multi-Perspective Speaker Profiles. Leveraging Eventive Information for Better Metaphor Detection and Classification. Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres. How Would You Say It? Elicitin …

Cross-Lingual Parser Selection for Low-Resource Languages


Title	Cross-Lingual Parser Selection for Low-Resource Languages
Authors	{\v{Z}}eljko Agi{'c}
Abstract
Tasks	Cross-Lingual Transfer, Dependency Parsing, Tokenization, Transfer Learning
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0401/
PDF	https://www.aclweb.org/anthology/W17-0401
PWC	https://paperswithcode.com/paper/cross-lingual-parser-selection-for-low
Repo
Framework

Fake News Detection Through Multi-Perspective Speaker Profiles


Title	Fake News Detection Through Multi-Perspective Speaker Profiles
Authors	Yunfei Long, Qin Lu, Rong Xiang, Minglei Li, Chu-Ren Huang
Abstract	Automatic fake news detection is an important, yet very challenging topic. Traditional methods using lexical features have only very limited success. This paper proposes a novel method to incorporate speaker profiles into an attention based LSTM model for fake news detection. Speaker profiles contribute to the model in two ways. One is to include them in the attention model. The other includes them as additional input data. By adding speaker profiles such as party affiliation, speaker title, location and credit history, our model outperforms the state-of-the-art method by 14.5{%} in accuracy using a benchmark fake news detection dataset. This proves that speaker profiles provide valuable information to validate the credibility of news articles.
Tasks	Fake News Detection
Published	2017-11-01
URL	https://www.aclweb.org/anthology/I17-2043/
PDF	https://www.aclweb.org/anthology/I17-2043
PWC	https://paperswithcode.com/paper/fake-news-detection-through-multi-perspective
Repo
Framework

Leveraging Eventive Information for Better Metaphor Detection and Classification


Title	Leveraging Eventive Information for Better Metaphor Detection and Classification
Authors	I-Hsuan Chen, Yunfei Long, Qin Lu, Chu-Ren Huang
Abstract	Metaphor detection has been both challenging and rewarding in natural language processing applications. This study offers a new approach based on eventive information in detecting metaphors by leveraging the Chinese writing system, which is a culturally bound ontological system organized according to the basic concepts represented by radicals. As such, the information represented is available in all Chinese text without pre-processing. Since metaphor detection is another culturally based conceptual representation, we hypothesize that sub-textual information can facilitate the identification and classification of the types of metaphoric events denoted in Chinese text. We propose a set of syntactic conditions crucial to event structures to improve the model based on the classification of radical groups. With the proposed syntactic conditions, the model achieves a performance of 0.8859 in terms of F-scores, making 1.7{%} of improvement than the same classifier with only Bag-of-word features. Results show that eventive information can improve the effectiveness of metaphor detection. Event information is rooted in every language, and thus this approach has a high potential to be applied to metaphor detection in other languages.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/K17-1006/
PDF	https://www.aclweb.org/anthology/K17-1006
PWC	https://paperswithcode.com/paper/leveraging-eventive-information-for-better
Repo
Framework

Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres


Title	Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres
Authors
Abstract
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1000/
PDF	https://www.aclweb.org/anthology/W17-1000
PWC	https://paperswithcode.com/paper/proceedings-of-the-multiling-2017-workshop-on
Repo
Framework

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing


Title	How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing
Authors	Ravich, Abhilasha er, Thomas Manzini, Matthias Grabmair, Graham Neubig, Jonathan Francis, Eric Nyberg
Abstract	Building dialogue interfaces for real-world scenarios often entails training semantic parsers starting from zero examples. How can we build datasets that better capture the variety of ways users might phrase their queries, and what queries are actually realistic? Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording. A limitation of this approach is that it induces bias towards using similar language as the canonical utterances. In this work, we present a methodology that elicits meaningful and lexically diverse queries from users for semantic parsing tasks. Starting from a seed lexicon and a generative grammar, we pair logical forms with mixed text-image representations and ask crowdworkers to paraphrase and confirm the plausibility of the queries that they generated. We use this method to build a semantic parsing dataset from scratch for a dialog agent in a smart-home simulation. We find evidence that this dataset, which we have named SmartHome, is demonstrably more lexically diverse and difficult to parse than existing domain-specific semantic parsing datasets.
Tasks	Semantic Parsing
Published	2017-08-01
URL	https://www.aclweb.org/anthology/W17-5545/
PDF	https://www.aclweb.org/anthology/W17-5545
PWC	https://paperswithcode.com/paper/how-would-you-say-it-eliciting-lexically
Repo
Framework

Robust Gaussian Graphical Model Estimation with Arbitrary Corruption


Title	Robust Gaussian Graphical Model Estimation with Arbitrary Corruption
Authors	Lingxiao Wang, Quanquan Gu
Abstract	We study the problem of estimating the high-dimensional Gaussian graphical model where the data are arbitrarily corrupted. We propose a robust estimator for the sparse precision matrix in the high-dimensional regime. At the core of our method is a robust covariance matrix estimator, which is based on truncated inner product. We establish the statistical guarantee of our estimator on both estimation error and model selection consistency. In particular, we show that provided that the number of corrupted samples $n_2$ for each variable satisfies $n_2 \lesssim \sqrt{n}/\sqrt{\log d}$, where $n$ is the sample size and $d$ is the number of variables, the proposed robust precision matrix estimator attains the same statistical rate as the standard estimator for Gaussian graphical models. In addition, we propose a hypothesis testing procedure to assess the uncertainty of our robust estimator. We demonstrate the effectiveness of our method through extensive experiments on both synthetic data and real-world genomic data.
Tasks	Model Selection
Published	2017-08-01
URL	https://icml.cc/Conferences/2017/Schedule?showEvent=718
PDF	http://proceedings.mlr.press/v70/wang17d/wang17d.pdf
PWC	https://paperswithcode.com/paper/robust-gaussian-graphical-model-estimation
Repo
Framework

Common Round: Application of Language Technologies to Large-Scale Web Debates


Title	Common Round: Application of Language Technologies to Large-Scale Web Debates
Authors	Hans Uszkoreit, Aleks Gabryszak, ra, Leonhard Hennig, J{"o}rg Steffen, Renlong Ai, Stephan Busemann, Jon Dehdari, Josef van Genabith, Georg Heigold, Nils Rethmeier, Raphael Rubino, Sven Schmeier, Philippe Thomas, He Wang, Feiyu Xu
Abstract	Web debates play an important role in enabling broad participation of constituencies in social, political and economic decision-taking. However, it is challenging to organize, structure, and navigate a vast number of diverse argumentations and comments collected from many participants over a long time period. In this paper we demonstrate Common Round, a next generation platform for large-scale web debates, which provides functions for eliciting the semantic content and structures from the contributions of participants. In particular, Common Round applies language technologies for the extraction of semantic essence from textual input, aggregation of the formulated opinions and arguments. The platform also provides a cross-lingual access to debates using machine translation.
Tasks	Decision Making, Machine Translation
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-3002/
PDF	https://www.aclweb.org/anthology/E17-3002
PWC	https://paperswithcode.com/paper/common-round-application-of-language
Repo
Framework

Predicting Depression for Japanese Blog Text


Title	Predicting Depression for Japanese Blog Text
Authors	Misato Hiraga
Abstract
Tasks	Feature Engineering, Text Classification
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-3018/
PDF	https://www.aclweb.org/anthology/P17-3018
PWC	https://paperswithcode.com/paper/predicting-depression-for-japanese-blog-text
Repo
Framework

Overview of the 4th Workshop on Asian Translation


Title	Overview of the 4th Workshop on Asian Translation
Authors	Toshiaki Nakazawa, Shohei Higashiyama, Chenchen Ding, Hideya Mino, Isao Goto, Hideto Kazawa, Yusuke Oda, Graham Neubig, Sadao Kurohashi
Abstract	This paper presents the results of the shared tasks from the 4th workshop on Asian translation (WAT2017) including J↔E, J↔C scientific paper translation subtasks, C↔J, K↔J, E↔J patent translation subtasks, H↔E mixed domain subtasks, J↔E newswire subtasks and J↔E recipe subtasks. For the WAT2017, 12 institutions participated in the shared tasks. About 300 translation results have been submitted to the automatic evaluation server, and selected submissions were manually evaluated.
Tasks	Machine Translation
Published	2017-11-01
URL	https://www.aclweb.org/anthology/W17-5701/
PDF	https://www.aclweb.org/anthology/W17-5701
PWC	https://paperswithcode.com/paper/overview-of-the-4th-workshop-on-asian
Repo
Framework

Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages


Title	Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages
Authors
Abstract
Tasks
Published	2017-01-01
URL	https://www.aclweb.org/anthology/W17-0600/
PDF	https://www.aclweb.org/anthology/W17-0600
PWC	https://paperswithcode.com/paper/proceedings-of-the-third-workshop-on-3
Repo
Framework

Countering Feedback Delays in Multi-Agent Learning


Title	Countering Feedback Delays in Multi-Agent Learning
Authors	Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W. Glynn, Claire Tomlin
Abstract	We consider a model of game-theoretic learning based on online mirror descent (OMD) with asynchronous and delayed feedback information. Instead of focusing on specific games, we consider a broad class of continuous games defined by the general equilibrium stability notion, which we call λ-variational stability. Our first contribution is that, in this class of games, the actual sequence of play induced by OMD-based learning converges to Nash equilibria provided that the feedback delays faced by the players are synchronous and bounded. Subsequently, to tackle fully decentralized, asynchronous environments with (possibly) unbounded delays between actions and feedback, we propose a variant of OMD which we call delayed mirror descent (DMD), and which relies on the repeated leveraging of past information. With this modification, the algorithm converges to Nash equilibria with no feedback synchronicity assumptions and even when the delays grow superlinearly relative to the horizon of play.
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/7197-countering-feedback-delays-in-multi-agent-learning
PDF	http://papers.nips.cc/paper/7197-countering-feedback-delays-in-multi-agent-learning.pdf
PWC	https://paperswithcode.com/paper/countering-feedback-delays-in-multi-agent
Repo
Framework

The Importance of Communities for Learning to Influence


Title	The Importance of Communities for Learning to Influence
Authors	Eric Balkanski, Nicole Immorlica, Yaron Singer
Abstract	We consider the canonical problem of influence maximization in social networks. Since the seminal work of Kempe, Kleinberg, and Tardos there have been two, largely disjoint efforts on this problem. The first studies the problem associated with learning the generative model that produces cascades, and the second focuses on the algorithmic challenge of identifying a set of influencers, assuming the generative model is known. Recent results on learning and optimization imply that in general, if the generative model is not known but rather learned from training data, no algorithm for influence maximization can yield a constant factor approximation guarantee using polynomially-many samples, drawn from any distribution. In this paper we describe a simple algorithm for maximizing influence from training data. The main idea behind the algorithm is to leverage the strong community structure of social networks and identify a set of individuals who are influentials but whose communities have little overlap. Although in general, the approximation guarantee of such an algorithm is unbounded, we show that this algorithm performs well experimentally. To analyze its performance, we prove this algorithm obtains a constant factor approximation guarantee on graphs generated through the stochastic block model, traditionally used to model networks with community structure.
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/7168-the-importance-of-communities-for-learning-to-influence
PDF	http://papers.nips.cc/paper/7168-the-importance-of-communities-for-learning-to-influence.pdf
PWC	https://paperswithcode.com/paper/the-importance-of-communities-for-learning-to
Repo
Framework

Proceedings of the First Workshop on Language technology for Digital Humanities in Central and (South-)Eastern Europe


Title	Proceedings of the First Workshop on Language technology for Digital Humanities in Central and (South-)Eastern Europe
Authors	Anca Dinu, Petya Osenova, Cristina Vertan
Abstract
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/papers/W17-8100/w17-8100
PDF	https://www.aclweb.org/anthology/W17-8100
PWC	https://paperswithcode.com/paper/proceedings-of-the-first-workshop-on-language
Repo
Framework

Improving Japanese-to-English Neural Machine Translation by Paraphrasing the Target Language


Title	Improving Japanese-to-English Neural Machine Translation by Paraphrasing the Target Language
Authors	Yuuki Sekizawa, Tomoyuki Kajiwara, Mamoru Komachi
Abstract	Neural machine translation (NMT) produces sentences that are more fluent than those produced by statistical machine translation (SMT). However, NMT has a very high computational cost because of the high dimensionality of the output layer. Generally, NMT restricts the size of vocabulary, which results in infrequent words being treated as out-of-vocabulary (OOV) and degrades the performance of the translation. In evaluation, we achieved a statistically significant BLEU score improvement of 0.55-0.77 over the baselines including the state-of-the-art method.
Tasks	Machine Translation
Published	2017-11-01
URL	https://www.aclweb.org/anthology/W17-5703/
PDF	https://www.aclweb.org/anthology/W17-5703
PWC	https://paperswithcode.com/paper/improving-japanese-to-english-neural-machine
Repo
Framework

A Comprehensive Analysis of Bilingual Lexicon Induction


Title	A Comprehensive Analysis of Bilingual Lexicon Induction
Authors	Ann Irvine, Chris Callison-Burch
Abstract	Bilingual lexicon induction is the task of inducing word translations from monolingual corpora in two languages. In this article we present the most comprehensive analysis of bilingual lexicon induction to date. We present experiments on a wide range of languages and data sizes. We examine translation into English from 25 foreign languages: Albanian, Azeri, Bengali, Bosnian, Bulgarian, Cebuano, Gujarati, Hindi, Hungarian, Indonesian, Latvian, Nepali, Romanian, Serbian, Slovak, Somali, Spanish, Swedish, Tamil, Telugu, Turkish, Ukrainian, Uzbek, Vietnamese, and Welsh. We analyze the behavior of bilingual lexicon induction on low-frequency words, rather than testing solely on high-frequency words, as previous research has done. Low-frequency words are more relevant to statistical machine translation, where systems typically lack translations of rare words that fall outside of their training data. We systematically explore a wide range of features and phenomena that affect the quality of the translations discovered by bilingual lexicon induction. We provide illustrative examples of the highest ranking translations for orthogonal signals of translation equivalence like contextual similarity and temporal similarity. We analyze the effects of frequency and burstiness, and the sizes of the seed bilingual dictionaries and the monolingual training corpora. Additionally, we introduce a novel discriminative approach to bilingual lexicon induction. Our discriminative model is capable of combining a wide variety of features that individually provide only weak indications of translation equivalence. When feature weights are discriminatively set, these signals produce dramatically higher translation quality than previous approaches that combined signals in an unsupervised fashion (e.g., using minimum reciprocal rank). We also directly compare our model{'}s performance against a sophisticated generative approach, the matching canonical correlation analysis (MCCA) algorithm used by Haghighi et al. (2008). Our algorithm achieves an accuracy of 42{%} versus MCCA{'}s 15{%}.
Tasks	Machine Translation
Published	2017-06-01
URL	https://www.aclweb.org/anthology/J17-2001/
PDF	https://www.aclweb.org/anthology/J17-2001
PWC	https://paperswithcode.com/paper/a-comprehensive-analysis-of-bilingual-lexicon
Repo
Framework