May 5, 2019

2031 words 10 mins read

Paper Group NAWR 3

TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter. Learning local feature descriptors with triplets and shallow convolutional neural networks. Optimizing Statistical Machine Translation for Text Simplification. Deep Neural Networks with Inexact Matching for Person Re-Identification. MeTA: A Unified …

TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter


Title	TweeTime : A Minimally Supervised Method for Recognizing and Normalizing Time Expressions in Twitter
Authors	Jeniya Tabassum, Alan Ritter, Wei Xu
Abstract
Tasks	Information Retrieval, Knowledge Base Population
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1030/
PDF	https://www.aclweb.org/anthology/D16-1030
PWC	https://paperswithcode.com/paper/tweetime-a-minimally-supervised-method-for-1
Repo	https://github.com/jeniyat/TweeTime
Framework	none

Learning local feature descriptors with triplets and shallow convolutional neural networks


Title	Learning local feature descriptors with triplets and shallow convolutional neural networks
Authors	V. Balntas, E. Riba, D. Ponsa, and K. Mikolajczyk.
Abstract	It has recently been demonstrated that local feature descriptors based on convolutional neural networks (CNN) can significantly improve the matching performance. Previous work on learning such descriptors has focused on exploiting pairs of positive and negative patches to learn discriminative CNN representations. In this work, we propose to utilize triplets of training samples, together with in-triplet mining of hard negatives. We show that our method achieves state of the art results, without the computational overhead typically associated with mining of negatives and with lower complexity of the network architecture. We compare our approach to recently introduced convolutional local feature descriptors, and demonstrate the advantages of the proposed methods in terms of performance and speed. We also examine different loss functions associated with triplets.
Tasks
Published	2016-09-01
URL	http://www.bmva.org/bmvc/2016/papers/paper119/paper119.pdf
PDF	http://www.bmva.org/bmvc/2016/papers/paper119/paper119.pdf
PWC	https://paperswithcode.com/paper/learning-local-feature-descriptors-with
Repo	https://github.com/vbalnt/tfeat
Framework	pytorch

Optimizing Statistical Machine Translation for Text Simplification


Title	Optimizing Statistical Machine Translation for Text Simplification
Authors	Wei Xu, Courtney Napoles, Ellie Pavlick, Quanze Chen, Chris Callison-Burch
Abstract	Most recent sentence simplification systems use basic machine translation models to learn lexical and syntactic paraphrases from a manually simplified parallel corpus. These methods are limited by the quality and quantity of manually simplified corpora, which are expensive to build. In this paper, we conduct an in-depth adaptation of statistical machine translation to perform text simplification, taking advantage of large-scale paraphrases learned from bilingual texts and a small amount of manual simplifications with multiple references. Our work is the first to design automatic metrics that are effective for tuning and evaluating simplification systems, which will facilitate iterative development for this task.
Tasks	Machine Translation, Text Simplification
Published	2016-01-01
URL	https://www.aclweb.org/anthology/Q16-1029/
PDF	https://www.aclweb.org/anthology/Q16-1029
PWC	https://paperswithcode.com/paper/optimizing-statistical-machine-translation
Repo	https://github.com/cocoxu/simplification
Framework	none

Deep Neural Networks with Inexact Matching for Person Re-Identification


Title	Deep Neural Networks with Inexact Matching for Person Re-Identification
Authors	Arulkumar Subramaniam, Moitreya Chatterjee, Anurag Mittal
Abstract	Person Re-Identification is the task of matching images of a person across multiple camera views. Almost all prior approaches address this challenge by attempting to learn the possible transformations that relate the different views of a person from a training corpora. Then, they utilize these transformation patterns for matching a query image to those in a gallery image bank at test time. This necessitates learning good feature representations of the images and having a robust feature matching technique. Deep learning approaches, such as Convolutional Neural Networks (CNN), simultaneously do both and have shown great promise recently. In this work, we propose two CNN-based architectures for Person Re-Identification. In the first, given a pair of images, we extract feature maps from these images via multiple stages of convolution and pooling. A novel inexact matching technique then matches pixels in the first representation with those of the second. Furthermore, we search across a wider region in the second representation for matching. Our novel matching technique allows us to tackle the challenges posed by large viewpoint variations, illumination changes or partial occlusions. Our approach shows a promising performance and requires only about half the parameters as a current state-of-the-art technique. Nonetheless, it also suffers from false matches at times. In order to mitigate this issue, we propose a fused architecture that combines our inexact matching pipeline with a state-of-the-art exact matching technique. We observe substantial gains with the fused model over the current state-of-the-art on multiple challenging datasets of varying sizes, with gains of up to about 21%.
Tasks	Person Re-Identification
Published	2016-12-01
URL	http://papers.nips.cc/paper/6367-deep-neural-networks-with-inexact-matching-for-person-re-identification
PDF	http://papers.nips.cc/paper/6367-deep-neural-networks-with-inexact-matching-for-person-re-identification.pdf
PWC	https://paperswithcode.com/paper/deep-neural-networks-with-inexact-matching
Repo	https://github.com/InnovArul/personreid_normxcorr
Framework	torch

MeTA: A Unified Toolkit for Text Retrieval and Analysis


Title	MeTA: A Unified Toolkit for Text Retrieval and Analysis
Authors	Sean Massung, Chase Geigle, ChengXiang Zhai
Abstract
Tasks	Document Classification, Information Retrieval, Tokenization
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4016/
PDF	https://www.aclweb.org/anthology/P16-4016
PWC	https://paperswithcode.com/paper/meta-a-unified-toolkit-for-text-retrieval-and
Repo	https://github.com/meta-toolkit/meta
Framework	none

Neural Sentiment Classification with User and Product Attention


Title	Neural Sentiment Classification with User and Product Attention
Authors	Huimin Chen, Maosong Sun, Cunchao Tu, Yankai Lin, Zhiyuan Liu
Abstract
Tasks	Feature Engineering, Sentiment Analysis, Speech Recognition, Text Classification
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1171/
PDF	https://www.aclweb.org/anthology/D16-1171
PWC	https://paperswithcode.com/paper/neural-sentiment-classification-with-user-and
Repo	https://github.com/thunlp/NSC
Framework	none

FlexTag: A Highly Flexible PoS Tagging Framework


Title	FlexTag: A Highly Flexible PoS Tagging Framework
Authors	Torsten Zesch, Tobias Horsmann
Abstract	We present FlexTag, a highly flexible PoS tagging framework. In contrast to monolithic implementations that can only be retrained but not adapted otherwise, FlexTag enables users to modify the feature space and the classification algorithm. Thus, FlexTag makes it easy to quickly develop custom-made taggers exactly fitting the research problem.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1675/
PDF	https://www.aclweb.org/anthology/L16-1675
PWC	https://paperswithcode.com/paper/flextag-a-highly-flexible-pos-tagging
Repo	https://github.com/Horsmann/FlexTag
Framework	none

Supervised Machine Learning for Hybrid Meter


Title	Supervised Machine Learning for Hybrid Meter
Authors	Alex Estes, Christopher Hench
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0201/
PDF	https://www.aclweb.org/anthology/W16-0201
PWC	https://paperswithcode.com/paper/supervised-machine-learning-for-hybrid-meter
Repo	https://github.com/henchc/CLFL
Framework	none

A domain-agnostic approach for opinion prediction on speech


Title	A domain-agnostic approach for opinion prediction on speech
Authors	Pedro Bispo Santos, Lisa Beinborn, Iryna Gurevych
Abstract	We explore a domain-agnostic approach for analyzing speech with the goal of opinion prediction. We represent the speech signal by mel-frequency cepstral coefficients and apply long short-term memory neural networks to automatically learn temporal regularities in speech. In contrast to previous work, our approach does not require complex feature engineering and works without textual transcripts. As a consequence, it can easily be applied on various speech analysis tasks for different languages and the results show that it can nevertheless be competitive to the state-of-the-art in opinion prediction. In a detailed error analysis for opinion mining we find that our approach performs well in identifying speaker-specific characteristics, but should be combined with additional information if subtle differences in the linguistic content need to be identified.
Tasks	Emotion Recognition, Feature Engineering, Opinion Mining, Speaker Identification
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4318/
PDF	https://www.aclweb.org/anthology/W16-4318
PWC	https://paperswithcode.com/paper/a-domain-agnostic-approach-for-opinion
Repo	https://github.com/UKPLab/coling-peoples2016-opinion-prediction
Framework	none

Efficient High-Order Interaction-Aware Feature Selection Based on Conditional Mutual Information


Title	Efficient High-Order Interaction-Aware Feature Selection Based on Conditional Mutual Information
Authors	Alexander Shishkin, Anastasia Bezzubtseva, Alexey Drutsa, Ilia Shishkov, Ekaterina Gladkikh, Gleb Gusev, Pavel Serdyukov
Abstract	This study introduces a novel feature selection approach CMICOT, which is a further evolution of filter methods with sequential forward selection (SFS) whose scoring functions are based on conditional mutual information (MI). We state and study a novel saddle point (max-min) optimization problem to build a scoring function that is able to identify joint interactions between several features. This method fills the gap of MI-based SFS techniques with high-order dependencies. In this high-dimensional case, the estimation of MI has prohibitively high sample complexity. We mitigate this cost using a greedy approximation and binary representatives what makes our technique able to be effectively used. The superiority of our approach is demonstrated by comparison with recently proposed interaction-aware filters and several interaction-agnostic state-of-the-art ones on ten publicly available benchmark datasets.
Tasks	Feature Selection
Published	2016-12-01
URL	http://papers.nips.cc/paper/6584-efficient-high-order-interaction-aware-feature-selection-based-on-conditional-mutual-information
PDF	http://papers.nips.cc/paper/6584-efficient-high-order-interaction-aware-feature-selection-based-on-conditional-mutual-information.pdf
PWC	https://paperswithcode.com/paper/efficient-high-order-interaction-aware
Repo	https://github.com/yandex/CMICOT
Framework	none

Universal Decompositional Semantics on Universal Dependencies


Title	Universal Decompositional Semantics on Universal Dependencies
Authors	Aaron Steven White, Drew Reisinger, Keisuke Sakaguchi, Tim Vieira, Sheng Zhang, Rachel Rudinger, Kyle Rawlins, Benjamin Van Durme
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1177/
PDF	https://www.aclweb.org/anthology/D16-1177
PWC	https://paperswithcode.com/paper/universal-decompositional-semantics-on
Repo	https://github.com/hltcoe/PredPatt
Framework	none

Incorporating Label Dependency for Answer Quality Tagging in Community Question Answering via CNN-LSTM-CRF


Title	Incorporating Label Dependency for Answer Quality Tagging in Community Question Answering via CNN-LSTM-CRF
Authors	Yang Xiang, Xiaoqiang Zhou, Qingcai Chen, Zhihui Zheng, Buzhou Tang, Xiaolong Wang, Yang Qin
Abstract	In community question answering (cQA), the quality of answers are determined by the matching degree between question-answer pairs and the correlation among the answers. In this paper, we show that the dependency between the answer quality labels also plays a pivotal role. To validate the effectiveness of label dependency, we propose two neural network-based models, with different combination modes of Convolutional Neural Net-works, Long Short Term Memory and Conditional Random Fields. Extensive experi-ments are taken on the dataset released by the SemEval-2015 cQA shared task. The first model is a stacked ensemble of the networks. It achieves 58.96{%} on macro averaged F1, which improves the state-of-the-art neural network-based method by 2.82{%} and outper-forms the Top-1 system in the shared task by 1.77{%}. The second is a simple attention-based model whose input is the connection of the question and its corresponding answers. It produces promising results with 58.29{%} on overall F1 and gains the best performance on the Good and Bad categories.
Tasks	Community Question Answering, Question Answering
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1117/
PDF	https://www.aclweb.org/anthology/C16-1117
PWC	https://paperswithcode.com/paper/incorporating-label-dependency-for-answer
Repo	https://github.com/o0laika0o/CNN-LSTM-CRF-for-cQA-answer-tagging
Framework	none

Context-Sensitive Lexicon Features for Neural Sentiment Analysis


Title	Context-Sensitive Lexicon Features for Neural Sentiment Analysis
Authors	Zhiyang Teng, Duy-Tin Vo, Yue Zhang
Abstract
Tasks	Opinion Mining, Sentiment Analysis
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1169/
PDF	https://www.aclweb.org/anthology/D16-1169
PWC	https://paperswithcode.com/paper/context-sensitive-lexicon-features-for-neural
Repo	https://github.com/zeeeyang/lexicon_rnn
Framework	none

BioMedLAT Corpus: Annotation of the Lexical Answer Type for Biomedical Questions


Title	BioMedLAT Corpus: Annotation of the Lexical Answer Type for Biomedical Questions
Authors	Mariana Neves, Milena Kraus
Abstract	Question answering (QA) systems need to provide exact answers for the questions that are posed to the system. However, this can only be achieved through a precise processing of the question. During this procedure, one important step is the detection of the expected type of answer that the system should provide by extracting the headword of the questions and identifying its semantic type. We have annotated the headword and assigned UMLS semantic types to 643 factoid/list questions from the BioASQ training data. We present statistics on the corpus and a preliminary evaluation in baseline experiments. We also discuss the challenges on both the manual annotation and the automatic detection of the headwords and the semantic types. We believe that this is a valuable resource for both training and evaluation of biomedical QA systems. The corpus is available at: \url{https://github.com/mariananeves/BioMedLAT}.
Tasks	Part-Of-Speech Tagging, Question Answering, Semantic Role Labeling, Tokenization
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4407/
PDF	https://www.aclweb.org/anthology/W16-4407
PWC	https://paperswithcode.com/paper/biomedlat-corpus-annotation-of-the-lexical
Repo	https://github.com/mariananeves/BioMedLAT
Framework	none

Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries


Title	Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
Authors	Marta Villegas, Maite Melero, N{'u}ria Bel, Jorge Gracia
Abstract	The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes{'} degree, to automatically generate new translation relations between words, and therefore to enrich existing bilingual dictionaries with new entries. Currently, the Apertium RDF Graph includes data from 22 Apertium bilingual dictionaries and constitutes a large unified array of linked lexical entries and translations that are available and accessible on the Web (http://linguistic.linkeddata.es/apertium/). In particular, its graph structure allows for interesting exploitation opportunities, some of which are addressed in this paper. Two {`}massive{'} experiments are reported: in the first one, the original EN-ES translation set was removed from the Apertium RDF Graph and a new EN-ES version was generated. The results were compared against the previously removed EN-ES data and against the Concise Oxford Spanish Dictionary. In the second experiment, a new non-existent EN-FR translation set was generated. In this case the results were compared against a converted wiktionary English-French file. The results we got are really good and perform well for the extreme case of correlated polysemy. This lead us to address the possibility to use cycles and nodes degree to identify potential oddities in the source data. If cycle density proves efficient when considering potential targets, we can assume that in dense graphs nodes with low degree may indicate potential errors. \|
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1140/
PDF	https://www.aclweb.org/anthology/L16-1140
PWC	https://paperswithcode.com/paper/leveraging-rdf-graphs-for-crossing-multiple
Repo	https://github.com/martavillegas/ApertiumRDF
Framework	none