July 26, 2019

2145 words 11 mins read

Paper Group NANR 102

Paper Group NANR 102

Predictor-Estimator using Multilevel Task Learning with Stack Propagation for Neural Quality Estimation. Arabic POS Tagging: Don’t Abandon Feature Engineering Just Yet. Word vectors, reuse, and replicability: Towards a community repository of large-text resources. Using English Dictionaries to generate Commonsense Knowledge in Natural Language. A c …

Predictor-Estimator using Multilevel Task Learning with Stack Propagation for Neural Quality Estimation

Title Predictor-Estimator using Multilevel Task Learning with Stack Propagation for Neural Quality Estimation
Authors Hyun Kim, Jong-Hyeok Lee, Seung-Hoon Na
Abstract
Tasks Language Modelling, Machine Translation, Part-Of-Speech Tagging
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-4763/
PDF https://www.aclweb.org/anthology/W17-4763
PWC https://paperswithcode.com/paper/predictor-estimator-using-multilevel-task
Repo
Framework

Arabic POS Tagging: Don’t Abandon Feature Engineering Just Yet

Title Arabic POS Tagging: Don’t Abandon Feature Engineering Just Yet
Authors Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki
Abstract This paper focuses on comparing between using Support Vector Machine based ranking (SVM-Rank) and Bidirectional Long-Short-Term-Memory (bi-LSTM) neural-network based sequence labeling in building a state-of-the-art Arabic part-of-speech tagging system. Using SVM-Rank leads to state-of-the-art results, but with a fair amount of feature engineering. Using bi-LSTM, particularly when combined with word embeddings, may lead to competitive POS-tagging results by automatically deducing latent linguistic features. However, we show that augmenting bi-LSTM sequence labeling with some of the features that we used for the SVM-Rank based tagger yields to further improvements. We also show that gains that realized by using embeddings may not be additive with the gains achieved by the features. We are open-sourcing both the SVM-Rank and the bi-LSTM based systems for free.
Tasks Feature Engineering, Named Entity Recognition, Part-Of-Speech Tagging, Word Embeddings
Published 2017-04-01
URL https://www.aclweb.org/anthology/W17-1316/
PDF https://www.aclweb.org/anthology/W17-1316
PWC https://paperswithcode.com/paper/arabic-pos-tagging-dont-abandon-feature
Repo
Framework

Word vectors, reuse, and replicability: Towards a community repository of large-text resources

Title Word vectors, reuse, and replicability: Towards a community repository of large-text resources
Authors Murhaf Fares, Andrey Kutuzov, Stephan Oepen, Erik Velldal
Abstract
Tasks Semantic Textual Similarity, Word Embeddings
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0237/
PDF https://www.aclweb.org/anthology/W17-0237
PWC https://paperswithcode.com/paper/word-vectors-reuse-and-replicability-towards
Repo
Framework

Using English Dictionaries to generate Commonsense Knowledge in Natural Language

Title Using English Dictionaries to generate Commonsense Knowledge in Natural Language
Authors Ali Almiman, Allan Ramsay
Abstract This paper presents an approach to generating common sense knowledge written in raw English sentences. Instead of using public contributors to feed this source, this system chose to employ expert linguistics decisions by using definitions from English dictionaries. Because the definitions in English dictionaries are not prepared to be transformed into inference rules, some preprocessing steps were taken to turn each relation of word:definition in dictionaries into an inference rule in the form left-hand side ⇒ right-hand side. In this paper, we applied this mechanism using two dictionaries: The MacMillan Dictionary and WordNet definitions. A random set of 200 inference rules were extracted equally from the two dictionaries, and then we used human judgment as to whether these rules are {`}True{'} or not. For the MacMillan Dictionary the precision reaches 0.74 with 0.508 recall, and the WordNet definitions resulted in 0.73 precision with 0.09 recall. |
Tasks Common Sense Reasoning, Natural Language Inference
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1009/
PDF https://doi.org/10.26615/978-954-452-049-6_009
PWC https://paperswithcode.com/paper/using-english-dictionaries-to-generate
Repo
Framework

A constrained graph algebra for semantic parsing with AMRs

Title A constrained graph algebra for semantic parsing with AMRs
Authors Jonas Groschwitz, Meaghan Fowlie, Mark Johnson, Alex Koller, er
Abstract
Tasks Semantic Parsing
Published 2017-01-01
URL https://www.aclweb.org/anthology/W17-6810/
PDF https://www.aclweb.org/anthology/W17-6810
PWC https://paperswithcode.com/paper/a-constrained-graph-algebra-for-semantic
Repo
Framework

Attentive Language Models

Title Attentive Language Models
Authors Giancarlo Salton, Robert Ross, John Kelleher
Abstract In this paper, we extend Recurrent Neural Network Language Models (RNN-LMs) with an attention mechanism. We show that an {}attentive{''} RNN-LM (with 11M parameters) achieves a better perplexity than larger RNN-LMs (with 66M parameters) and achieves performance comparable to an ensemble of 10 similar sized RNN-LMs. We also show that an {}attentive{''} RNN-LM needs less contextual information to achieve similar results to the state-of-the-art on the wikitext2 dataset.
Tasks Image Captioning, Machine Translation, Speech Recognition
Published 2017-11-01
URL https://www.aclweb.org/anthology/I17-1045/
PDF https://www.aclweb.org/anthology/I17-1045
PWC https://paperswithcode.com/paper/attentive-language-models
Repo
Framework

Work Hard, Play Hard: Email Classification on the Avocado and Enron Corpora

Title Work Hard, Play Hard: Email Classification on the Avocado and Enron Corpora
Authors Sakhar Alkhereyf, Owen Rambow
Abstract In this paper, we present an empirical study of email classification into two main categories {}Business{''} and {}Personal{''}. We train on the Enron email corpus, and test on the Enron and Avocado email corpora. We show that information from the email exchange networks improves the performance of classification. We represent the email exchange networks as social networks with graph structures. For this classification task, we extract social networks features from the graphs in addition to lexical features from email content and we compare the performance of SVM and Extra-Trees classifiers using these features. Combining graph features with lexical features improves the performance on both classifiers. We also provide manually annotated sets of the Avocado and Enron email corpora as a supplementary contribution.
Tasks
Published 2017-08-01
URL https://www.aclweb.org/anthology/W17-2408/
PDF https://www.aclweb.org/anthology/W17-2408
PWC https://paperswithcode.com/paper/work-hard-play-hard-email-classification-on
Repo
Framework

Integrating Deep Linguistic Features in Factuality Prediction over Unified Datasets

Title Integrating Deep Linguistic Features in Factuality Prediction over Unified Datasets
Authors Gabriel Stanovsky, Judith Eckle-Kohler, Yevgeniy Puzikov, Ido Dagan, Iryna Gurevych
Abstract Previous models for the assessment of commitment towards a predicate in a sentence (also known as factuality prediction) were trained and tested against a specific annotated dataset, subsequently limiting the generality of their results. In this work we propose an intuitive method for mapping three previously annotated corpora onto a single factuality scale, thereby enabling models to be tested across these corpora. In addition, we design a novel model for factuality prediction by first extending a previous rule-based factuality prediction system and applying it over an abstraction of dependency trees, and then using the output of this system in a supervised classifier. We show that this model outperforms previous methods on all three datasets. We make both the unified factuality corpus and our new model publicly available.
Tasks Knowledge Base Population, Question Answering
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-2056/
PDF https://www.aclweb.org/anthology/P17-2056
PWC https://paperswithcode.com/paper/integrating-deep-linguistic-features-in
Repo
Framework

Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM

Title Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM
Authors Katrina Ligett, Seth Neel, Aaron Roth, Bo Waggoner, Steven Z. Wu
Abstract Traditional approaches to differential privacy assume a fixed privacy requirement ε for a computation, and attempt to maximize the accuracy of the computation subject to the privacy constraint. As differential privacy is increasingly deployed in practical settings, it may often be that there is instead a fixed accuracy requirement for a given computation and the data analyst would like to maximize the privacy of the computation subject to the accuracy constraint. This raises the question of how to find and run a maximally private empirical risk minimizer subject to a given accuracy requirement. We propose a general “noise reduction” framework that can apply to a variety of private empirical risk minimization (ERM) algorithms, using them to “search” the space of privacy levels to find the empirically strongest one that meets the accuracy constraint, and incurring only logarithmic overhead in the number of privacy levels searched. The privacy analysis of our algorithm leads naturally to a version of differential privacy where the privacy parameters are dependent on the data, which we term ex-post privacy, and which is related to the recently introduced notion of privacy odometers. We also give an ex-post privacy analysis of the classical AboveThreshold privacy tool, modifying it to allow for queries chosen depending on the database. Finally, we apply our approach to two common objective functions, regularized linear and logistic regression, and empirically compare our noise reduction methods to (i) inverting the theoretical utility guarantees of standard private ERM algorithms and (ii) a stronger empirical baseline based on binary search.
Tasks
Published 2017-12-01
URL http://papers.nips.cc/paper/6850-accuracy-first-selecting-a-differential-privacy-level-for-accuracy-constrained-erm
PDF http://papers.nips.cc/paper/6850-accuracy-first-selecting-a-differential-privacy-level-for-accuracy-constrained-erm.pdf
PWC https://paperswithcode.com/paper/accuracy-first-selecting-a-differential
Repo
Framework

Uprooting and Rerooting Higher-Order Graphical Models

Title Uprooting and Rerooting Higher-Order Graphical Models
Authors Mark Rowland, Adrian Weller
Abstract The idea of uprooting and rerooting graphical models was introduced specifically for binary pairwise models by Weller (2016) as a way to transform a model to any of a whole equivalence class of related models, such that inference on any one model yields inference results for all others. This is very helpful since inference, or relevant bounds, may be much easier to obtain or more accurate for some model in the class. Here we introduce methods to extend the approach to models with higher-order potentials and develop theoretical insights. In particular, we show that the triplet-consistent polytope TRI is unique in being `universally rooted’. We demonstrate empirically that rerooting can significantly improve accuracy of methods of inference for higher-order models at negligible computational cost. |
Tasks
Published 2017-12-01
URL http://papers.nips.cc/paper/6625-uprooting-and-rerooting-higher-order-graphical-models
PDF http://papers.nips.cc/paper/6625-uprooting-and-rerooting-higher-order-graphical-models.pdf
PWC https://paperswithcode.com/paper/uprooting-and-rerooting-higher-order
Repo
Framework

SUT System Description for Anti-Spoofing 2017 Challenge

Title SUT System Description for Anti-Spoofing 2017 Challenge
Authors Mohammad Adiban, Hossein Sameti, Noushin Maghsoodi, Sajjad Shahsavari
Abstract
Tasks Quantization, Speaker Verification, Speech Synthesis
Published 2017-11-01
URL https://www.aclweb.org/anthology/O17-1025/
PDF https://www.aclweb.org/anthology/O17-1025
PWC https://paperswithcode.com/paper/sut-system-description-for-anti-spoofing-2017
Repo
Framework

Improving ROUGE for Timeline Summarization

Title Improving ROUGE for Timeline Summarization
Authors Sebastian Martschat, Katja Markert
Abstract Current evaluation metrics for timeline summarization either ignore the temporal aspect of the task or require strict date matching. We introduce variants of ROUGE that allow alignment of daily summaries via temporal distance or semantic similarity. We argue for the suitability of these variants in a theoretical analysis and demonstrate it in a battery of task-specific tests.
Tasks Semantic Similarity, Semantic Textual Similarity, Timeline Summarization
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2046/
PDF https://www.aclweb.org/anthology/E17-2046
PWC https://paperswithcode.com/paper/improving-rouge-for-timeline-summarization
Repo
Framework

MappSent: a Textual Mapping Approach for Question-to-Question Similarity

Title MappSent: a Textual Mapping Approach for Question-to-Question Similarity
Authors Amir Hazem, Basma El Amel Boussaha, Hern, Nicolas ez
Abstract Since the advent of word embedding methods, the representation of longer pieces of texts such as sentences and paragraphs is gaining more and more interest, especially for textual similarity tasks. Mikolov et al. (2013) have demonstrated that words and phrases exhibit linear structures that allow to meaningfully combine words by an element-wise addition of their vector representations. Recently, Arora et al. (2017) have shown that removing the projections of the weighted average sum of word embedding vectors on their first principal components, outperforms sophisticated supervised methods including RNN{'}s and LSTM{'}s. Inspired by Mikolov et al. (2013) and Arora et al. (2017) findings and by a bilingual word mapping technique presented in Artetxe et al. (2016), we introduce MappSent, a novel approach for textual similarity. Based on a linear sentence embedding representation, its principle is to build a matrix that maps sentences in a joint-subspace where similar sets of sentences are pushed closer. We evaluate our approach on the SemEval 2016/2017 question-to-question similarity task and show that overall MappSent achieves competitive results and outperforms in most cases state-of-art methods.
Tasks Community Question Answering, Question Answering, Question Similarity, Sentence Embedding, Word Embeddings
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1040/
PDF https://doi.org/10.26615/978-954-452-049-6_040
PWC https://paperswithcode.com/paper/mappsent-a-textual-mapping-approach-for
Repo
Framework

Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization

Title Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization
Authors Qi Lei, Ian En-Hsu Yen, Chao-yuan Wu, Inderjit S. Dhillon, Pradeep Ravikumar
Abstract We consider the popular problem of sparse empirical risk minimization with linear predictors and a large number of both features and observations. With a convex-concave saddle point objective reformulation, we propose a Doubly Greedy Primal-Dual Coordinate Descent algorithm that is able to exploit sparsity in both primal and dual variables. It enjoys a low cost per iteration and our theoretical analysis shows that it converges linearly with a good iteration complexity, provided that the set of primal variables is sparse. We then extend this algorithm further to leverage active sets. The resulting new algorithm is even faster, and experiments on large-scale Multi-class data sets show that our algorithm achieves up to 30 times speedup on several state-of-the-art optimization methods.
Tasks
Published 2017-08-01
URL https://icml.cc/Conferences/2017/Schedule?showEvent=864
PDF http://proceedings.mlr.press/v70/lei17b/lei17b.pdf
PWC https://paperswithcode.com/paper/doubly-greedy-primal-dual-coordinate-descent
Repo
Framework

Graph-based Event Extraction from Twitter

Title Graph-based Event Extraction from Twitter
Authors Amosse Edouard, Elena Cabrio, Sara Tonelli, Nhan Le-Thanh
Abstract Detecting which tweets describe a specific event and clustering them is one of the main challenging tasks related to Social Media currently addressed in the NLP community. Existing approaches have mainly focused on detecting spikes in clusters around specific keywords or Named Entities (NE). However, one of the main drawbacks of such approaches is the difficulty in understanding when the same keywords describe different events. In this paper, we propose a novel approach that exploits NE mentions in tweets and their entity context to create a temporal event graph. Then, using simple graph theory techniques and a PageRank-like algorithm, we process the event graphs to detect clusters of tweets describing the same events. Experiments on two gold standard datasets show that our approach achieves state-of-the-art results both in terms of evaluation performances and the quality of the detected events.
Tasks Sentiment Analysis
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1031/
PDF https://doi.org/10.26615/978-954-452-049-6_031
PWC https://paperswithcode.com/paper/graph-based-event-extraction-from-twitter
Repo
Framework
comments powered by Disqus