May 5, 2019

2031 words 10 mins read

Paper Group NAWR 3

Paper Group NAWR 3

TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter. Learning local feature descriptors with triplets and shallow convolutional neural networks. Optimizing Statistical Machine Translation for Text Simplification. Deep Neural Networks with Inexact Matching for Person Re-Identification. MeTA: A Unified …

TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter

Title TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter
Authors Jeniya Tabassum, Alan Ritter, Wei Xu
Abstract
Tasks Information Retrieval, Knowledge Base Population
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1030/
PDF https://www.aclweb.org/anthology/D16-1030
PWC https://paperswithcode.com/paper/tweetime-a-minimally-supervised-method-for-1
Repo https://github.com/jeniyat/TweeTime
Framework none

Learning local feature descriptors with triplets and shallow convolutional neural networks

Title Learning local feature descriptors with triplets and shallow convolutional neural networks
Authors V. Balntas, E. Riba, D. Ponsa, and K. Mikolajczyk.
Abstract It has recently been demonstrated that local feature descriptors based on convolutional neural networks (CNN) can significantly improve the matching performance. Previous work on learning such descriptors has focused on exploiting pairs of positive and negative patches to learn discriminative CNN representations. In this work, we propose to utilize triplets of training samples, together with in-triplet mining of hard negatives. We show that our method achieves state of the art results, without the computational overhead typically associated with mining of negatives and with lower complexity of the network architecture. We compare our approach to recently introduced convolutional local feature descriptors, and demonstrate the advantages of the proposed methods in terms of performance and speed. We also examine different loss functions associated with triplets.
Tasks
Published 2016-09-01
URL http://www.bmva.org/bmvc/2016/papers/paper119/paper119.pdf
PDF http://www.bmva.org/bmvc/2016/papers/paper119/paper119.pdf
PWC https://paperswithcode.com/paper/learning-local-feature-descriptors-with
Repo https://github.com/vbalnt/tfeat
Framework pytorch

Optimizing Statistical Machine Translation for Text Simplification

Title Optimizing Statistical Machine Translation for Text Simplification
Authors Wei Xu, Courtney Napoles, Ellie Pavlick, Quanze Chen, Chris Callison-Burch
Abstract Most recent sentence simplification systems use basic machine translation models to learn lexical and syntactic paraphrases from a manually simplified parallel corpus. These methods are limited by the quality and quantity of manually simplified corpora, which are expensive to build. In this paper, we conduct an in-depth adaptation of statistical machine translation to perform text simplification, taking advantage of large-scale paraphrases learned from bilingual texts and a small amount of manual simplifications with multiple references. Our work is the first to design automatic metrics that are effective for tuning and evaluating simplification systems, which will facilitate iterative development for this task.
Tasks Machine Translation, Text Simplification
Published 2016-01-01
URL https://www.aclweb.org/anthology/Q16-1029/
PDF https://www.aclweb.org/anthology/Q16-1029
PWC https://paperswithcode.com/paper/optimizing-statistical-machine-translation
Repo https://github.com/cocoxu/simplification
Framework none

Deep Neural Networks with Inexact Matching for Person Re-Identification

Title Deep Neural Networks with Inexact Matching for Person Re-Identification
Authors Arulkumar Subramaniam, Moitreya Chatterjee, Anurag Mittal
Abstract Person Re-Identification is the task of matching images of a person across multiple camera views. Almost all prior approaches address this challenge by attempting to learn the possible transformations that relate the different views of a person from a training corpora. Then, they utilize these transformation patterns for matching a query image to those in a gallery image bank at test time. This necessitates learning good feature representations of the images and having a robust feature matching technique. Deep learning approaches, such as Convolutional Neural Networks (CNN), simultaneously do both and have shown great promise recently. In this work, we propose two CNN-based architectures for Person Re-Identification. In the first, given a pair of images, we extract feature maps from these images via multiple stages of convolution and pooling. A novel inexact matching technique then matches pixels in the first representation with those of the second. Furthermore, we search across a wider region in the second representation for matching. Our novel matching technique allows us to tackle the challenges posed by large viewpoint variations, illumination changes or partial occlusions. Our approach shows a promising performance and requires only about half the parameters as a current state-of-the-art technique. Nonetheless, it also suffers from false matches at times. In order to mitigate this issue, we propose a fused architecture that combines our inexact matching pipeline with a state-of-the-art exact matching technique. We observe substantial gains with the fused model over the current state-of-the-art on multiple challenging datasets of varying sizes, with gains of up to about 21%.
Tasks Person Re-Identification
Published 2016-12-01
URL http://papers.nips.cc/paper/6367-deep-neural-networks-with-inexact-matching-for-person-re-identification
PDF http://papers.nips.cc/paper/6367-deep-neural-networks-with-inexact-matching-for-person-re-identification.pdf
PWC https://paperswithcode.com/paper/deep-neural-networks-with-inexact-matching
Repo https://github.com/InnovArul/personreid_normxcorr
Framework torch

MeTA: A Unified Toolkit for Text Retrieval and Analysis

Title MeTA: A Unified Toolkit for Text Retrieval and Analysis
Authors Sean Massung, Chase Geigle, ChengXiang Zhai
Abstract
Tasks Document Classification, Information Retrieval, Tokenization
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4016/
PDF https://www.aclweb.org/anthology/P16-4016
PWC https://paperswithcode.com/paper/meta-a-unified-toolkit-for-text-retrieval-and
Repo https://github.com/meta-toolkit/meta
Framework none

Neural Sentiment Classification with User and Product Attention

Title Neural Sentiment Classification with User and Product Attention
Authors Huimin Chen, Maosong Sun, Cunchao Tu, Yankai Lin, Zhiyuan Liu
Abstract
Tasks Feature Engineering, Sentiment Analysis, Speech Recognition, Text Classification
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1171/
PDF https://www.aclweb.org/anthology/D16-1171
PWC https://paperswithcode.com/paper/neural-sentiment-classification-with-user-and
Repo https://github.com/thunlp/NSC
Framework none

FlexTag: A Highly Flexible PoS Tagging Framework

Title FlexTag: A Highly Flexible PoS Tagging Framework
Authors Torsten Zesch, Tobias Horsmann
Abstract We present FlexTag, a highly flexible PoS tagging framework. In contrast to monolithic implementations that can only be retrained but not adapted otherwise, FlexTag enables users to modify the feature space and the classification algorithm. Thus, FlexTag makes it easy to quickly develop custom-made taggers exactly fitting the research problem.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1675/
PDF https://www.aclweb.org/anthology/L16-1675
PWC https://paperswithcode.com/paper/flextag-a-highly-flexible-pos-tagging
Repo https://github.com/Horsmann/FlexTag
Framework none

Supervised Machine Learning for Hybrid Meter

Title Supervised Machine Learning for Hybrid Meter
Authors Alex Estes, Christopher Hench
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0201/
PDF https://www.aclweb.org/anthology/W16-0201
PWC https://paperswithcode.com/paper/supervised-machine-learning-for-hybrid-meter
Repo https://github.com/henchc/CLFL
Framework none

A domain-agnostic approach for opinion prediction on speech

Title A domain-agnostic approach for opinion prediction on speech
Authors Pedro Bispo Santos, Lisa Beinborn, Iryna Gurevych
Abstract We explore a domain-agnostic approach for analyzing speech with the goal of opinion prediction. We represent the speech signal by mel-frequency cepstral coefficients and apply long short-term memory neural networks to automatically learn temporal regularities in speech. In contrast to previous work, our approach does not require complex feature engineering and works without textual transcripts. As a consequence, it can easily be applied on various speech analysis tasks for different languages and the results show that it can nevertheless be competitive to the state-of-the-art in opinion prediction. In a detailed error analysis for opinion mining we find that our approach performs well in identifying speaker-specific characteristics, but should be combined with additional information if subtle differences in the linguistic content need to be identified.
Tasks Emotion Recognition, Feature Engineering, Opinion Mining, Speaker Identification
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4318/
PDF https://www.aclweb.org/anthology/W16-4318
PWC https://paperswithcode.com/paper/a-domain-agnostic-approach-for-opinion
Repo https://github.com/UKPLab/coling-peoples2016-opinion-prediction
Framework none

Efficient High-Order Interaction-Aware Feature Selection Based on Conditional Mutual Information

Title Efficient High-Order Interaction-Aware Feature Selection Based on Conditional Mutual Information
Authors Alexander Shishkin, Anastasia Bezzubtseva, Alexey Drutsa, Ilia Shishkov, Ekaterina Gladkikh, Gleb Gusev, Pavel Serdyukov
Abstract This study introduces a novel feature selection approach CMICOT, which is a further evolution of filter methods with sequential forward selection (SFS) whose scoring functions are based on conditional mutual information (MI). We state and study a novel saddle point (max-min) optimization problem to build a scoring function that is able to identify joint interactions between several features. This method fills the gap of MI-based SFS techniques with high-order dependencies. In this high-dimensional case, the estimation of MI has prohibitively high sample complexity. We mitigate this cost using a greedy approximation and binary representatives what makes our technique able to be effectively used. The superiority of our approach is demonstrated by comparison with recently proposed interaction-aware filters and several interaction-agnostic state-of-the-art ones on ten publicly available benchmark datasets.
Tasks Feature Selection
Published 2016-12-01
URL http://papers.nips.cc/paper/6584-efficient-high-order-interaction-aware-feature-selection-based-on-conditional-mutual-information
PDF http://papers.nips.cc/paper/6584-efficient-high-order-interaction-aware-feature-selection-based-on-conditional-mutual-information.pdf
PWC https://paperswithcode.com/paper/efficient-high-order-interaction-aware
Repo https://github.com/yandex/CMICOT
Framework none

Universal Decompositional Semantics on Universal Dependencies

Title Universal Decompositional Semantics on Universal Dependencies
Authors Aaron Steven White, Drew Reisinger, Keisuke Sakaguchi, Tim Vieira, Sheng Zhang, Rachel Rudinger, Kyle Rawlins, Benjamin Van Durme
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1177/
PDF https://www.aclweb.org/anthology/D16-1177
PWC https://paperswithcode.com/paper/universal-decompositional-semantics-on
Repo https://github.com/hltcoe/PredPatt
Framework none

Incorporating Label Dependency for Answer Quality Tagging in Community Question Answering via CNN-LSTM-CRF

Title Incorporating Label Dependency for Answer Quality Tagging in Community Question Answering via CNN-LSTM-CRF
Authors Yang Xiang, Xiaoqiang Zhou, Qingcai Chen, Zhihui Zheng, Buzhou Tang, Xiaolong Wang, Yang Qin
Abstract In community question answering (cQA), the quality of answers are determined by the matching degree between question-answer pairs and the correlation among the answers. In this paper, we show that the dependency between the answer quality labels also plays a pivotal role. To validate the effectiveness of label dependency, we propose two neural network-based models, with different combination modes of Convolutional Neural Net-works, Long Short Term Memory and Conditional Random Fields. Extensive experi-ments are taken on the dataset released by the SemEval-2015 cQA shared task. The first model is a stacked ensemble of the networks. It achieves 58.96{%} on macro averaged F1, which improves the state-of-the-art neural network-based method by 2.82{%} and outper-forms the Top-1 system in the shared task by 1.77{%}. The second is a simple attention-based model whose input is the connection of the question and its corresponding answers. It produces promising results with 58.29{%} on overall F1 and gains the best performance on the Good and Bad categories.
Tasks Community Question Answering, Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1117/
PDF https://www.aclweb.org/anthology/C16-1117
PWC https://paperswithcode.com/paper/incorporating-label-dependency-for-answer
Repo https://github.com/o0laika0o/CNN-LSTM-CRF-for-cQA-answer-tagging
Framework none

Context-Sensitive Lexicon Features for Neural Sentiment Analysis

Title Context-Sensitive Lexicon Features for Neural Sentiment Analysis
Authors Zhiyang Teng, Duy-Tin Vo, Yue Zhang
Abstract
Tasks Opinion Mining, Sentiment Analysis
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1169/
PDF https://www.aclweb.org/anthology/D16-1169
PWC https://paperswithcode.com/paper/context-sensitive-lexicon-features-for-neural
Repo https://github.com/zeeeyang/lexicon_rnn
Framework none

BioMedLAT Corpus: Annotation of the Lexical Answer Type for Biomedical Questions

Title BioMedLAT Corpus: Annotation of the Lexical Answer Type for Biomedical Questions
Authors Mariana Neves, Milena Kraus
Abstract Question answering (QA) systems need to provide exact answers for the questions that are posed to the system. However, this can only be achieved through a precise processing of the question. During this procedure, one important step is the detection of the expected type of answer that the system should provide by extracting the headword of the questions and identifying its semantic type. We have annotated the headword and assigned UMLS semantic types to 643 factoid/list questions from the BioASQ training data. We present statistics on the corpus and a preliminary evaluation in baseline experiments. We also discuss the challenges on both the manual annotation and the automatic detection of the headwords and the semantic types. We believe that this is a valuable resource for both training and evaluation of biomedical QA systems. The corpus is available at: \url{https://github.com/mariananeves/BioMedLAT}.
Tasks Part-Of-Speech Tagging, Question Answering, Semantic Role Labeling, Tokenization
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4407/
PDF https://www.aclweb.org/anthology/W16-4407
PWC https://paperswithcode.com/paper/biomedlat-corpus-annotation-of-the-lexical
Repo https://github.com/mariananeves/BioMedLAT
Framework none

Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries

Title Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Authors Marta Villegas, Maite Melero, N{'u}ria Bel, Jorge Gracia
Abstract The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes{'} degree, to automatically generate new translation relations between words, and therefore to enrich existing bilingual dictionaries with new entries. Currently, the Apertium RDF Graph includes data from 22 Apertium bilingual dictionaries and constitutes a large unified array of linked lexical entries and translations that are available and accessible on the Web (http://linguistic.linkeddata.es/apertium/). In particular, its graph structure allows for interesting exploitation opportunities, some of which are addressed in this paper. Two {`}massive{'} experiments are reported: in the first one, the original EN-ES translation set was removed from the Apertium RDF Graph and a new EN-ES version was generated. The results were compared against the previously removed EN-ES data and against the Concise Oxford Spanish Dictionary. In the second experiment, a new non-existent EN-FR translation set was generated. In this case the results were compared against a converted wiktionary English-French file. The results we got are really good and perform well for the extreme case of correlated polysemy. This lead us to address the possibility to use cycles and nodes degree to identify potential oddities in the source data. If cycle density proves efficient when considering potential targets, we can assume that in dense graphs nodes with low degree may indicate potential errors. |
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1140/
PDF https://www.aclweb.org/anthology/L16-1140
PWC https://paperswithcode.com/paper/leveraging-rdf-graphs-for-crossing-multiple
Repo https://github.com/martavillegas/ApertiumRDF
Framework none
comments powered by Disqus