May 5, 2019

2024 words 10 mins read

Paper Group NAWR 6

An Open Corpus for Named Entity Recognition in Historic Newspapers. A Reading Comprehension Corpus for Machine Translation Evaluation. Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation. Kyoto-NMT: a Neural Machine Translation implementation in Chainer. Learning a Lexicon and Translation Model from Phoneme Lattic …

An Open Corpus for Named Entity Recognition in Historic Newspapers


Title	An Open Corpus for Named Entity Recognition in Historic Newspapers
Authors	Clemens Neudecker
Abstract	The availability of openly available textual datasets ({`}corpora{''}) with highly accurate manual annotations ({`}gold standard{''}) of named entities (e.g. persons, locations, organizations, etc.) is crucial in the training and evaluation of named entity recognition systems. Currently there are only few such datasets available on the web, and even less for texts containing historical spelling variation. The production and subsequent release into the public domain of four such datasets with 100 pages each for the languages Dutch, French, German (including Austrian) as part of the Europeana Newspapers project is expected to contribute to the further development and improvement of named entity recognition systems with a focus on historical content. This paper describes how these datasets were produced, what challenges were encountered in their creation and informs about their final quality and availability.
Tasks	Named Entity Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1689/
PDF	https://www.aclweb.org/anthology/L16-1689
PWC	https://paperswithcode.com/paper/an-open-corpus-for-named-entity-recognition
Repo	https://github.com/EuropeanaNewspapers/ner-corpora
Framework	none

A Reading Comprehension Corpus for Machine Translation Evaluation


Title	A Reading Comprehension Corpus for Machine Translation Evaluation
Authors	Carolina Scarton, Lucia Specia
Abstract	Effectively assessing Natural Language Processing output tasks is a challenge for research in the area. In the case of Machine Translation (MT), automatic metrics are usually preferred over human evaluation, given time and budget constraints.However, traditional automatic metrics (such as BLEU) are not reliable for absolute quality assessment of documents, often producing similar scores for documents translated by the same MT system.For scenarios where absolute labels are necessary for building models, such as document-level Quality Estimation, these metrics can not be fully trusted. In this paper, we introduce a corpus of reading comprehension tests based on machine translated documents, where we evaluate documents based on answers to questions by fluent speakers of the target language. We describe the process of creating such a resource, the experiment design and agreement between the test takers. Finally, we discuss ways to convert the reading comprehension test into document-level quality scores.
Tasks	Machine Translation, Reading Comprehension
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1579/
PDF	https://www.aclweb.org/anthology/L16-1579
PWC	https://paperswithcode.com/paper/a-reading-comprehension-corpus-for-machine
Repo	https://github.com/carolscarton/CREG-MT-eval
Framework	none


Title	Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation
Authors	Nut Limsopatham, Nigel Collier
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1096/
PDF	https://www.aclweb.org/anthology/P16-1096
PWC	https://paperswithcode.com/paper/normalising-medical-concepts-in-social-media
Repo	https://github.com/nutli/concept_normalisation
Framework	tf

Kyoto-NMT: a Neural Machine Translation implementation in Chainer


Title	Kyoto-NMT: a Neural Machine Translation implementation in Chainer
Authors	Fabien Cromi{`e}res
Abstract	We present Kyoto-NMT, an open-source implementation of the Neural Machine Translation paradigm. This implementation is done in Python and Chainer, an easy-to-use Deep Learning Framework.
Tasks	Language Modelling, Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2064/
PDF	https://www.aclweb.org/anthology/C16-2064
PWC	https://paperswithcode.com/paper/kyoto-nmt-a-neural-machine-translation
Repo	https://github.com/fabiencro/knmt
Framework	none

Learning a Lexicon and Translation Model from Phoneme Lattices


Title	Learning a Lexicon and Translation Model from Phoneme Lattices
Authors	Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird, Quoc Truong Do, Satoshi Nakamura
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1263/
PDF	https://www.aclweb.org/anthology/D16-1263
PWC	https://paperswithcode.com/paper/learning-a-lexicon-and-translation-model-from
Repo	https://github.com/oadams/latticetm
Framework	none

Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records


Title	Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records
Authors	George Gkotsis, Sumithra Velupillai, Anika Oellrich, Harry Dean, Maria Liakata, Rina Dutta
Abstract
Tasks	Negation Detection
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0310/
PDF	https://www.aclweb.org/anthology/W16-0310
PWC	https://paperswithcode.com/paper/dont-let-notes-be-misunderstood-a-negation
Repo	https://github.com/gkotsis/negation-detection
Framework	none

Mnemonic Descent Method: A recurrent process applied for end-to-end face alignment


Title	Mnemonic Descent Method: A recurrent process applied for end-to-end face alignment
Authors	G. Trigeorgis, P. Snape, M. A. Nicolaou, E. Antonakos, S. Zafeiriou
Abstract	Cascaded regression has recently become the method of choice for solving non-linear least squares problems such as deformable image alignment. Given a sizeable training set, cascaded regression learns a set of generic rules that are sequentially applied to minimise the least squares problem. Despite the success of cascaded regression for problems such as face alignment and head pose estimation, there are several shortcomings arising in the strategies proposed thus far. Specifically,(a) the regressors are learnt independently,(b) the descent directions may cancel one another out and (c) handcrafted features (eg, HoGs, SIFT etc.) are mainly used to drive the cascade, which may be sub-optimal for the task at hand. In this paper, we propose a combined and jointly trained convolutional recurrent neural network architecture that allows the training of an end-to-end to system that attempts to alleviate the aforementioned drawbacks. The recurrent module facilitates the joint optimisation of the regressors by assuming the cascades form a nonlinear dynamical system, in effect fully utilising the information between all cascade levels by introducing a memory unit that shares information across all levels. The convolutional module allows the network to extract features that are specialised for the task at hand and are experimentally shown to outperform hand-crafted features. We show that the application of the proposed architecture for the problem of face alignment results in a strong improvement over the current state-of-the-art.
Tasks	Face Alignment, Head Pose Estimation, Pose Estimation
Published	2016-06-01
URL	https://www.ibug.doc.ic.ac.uk/media/uploads/documents/trigeorgis2016mnemonic.pdf
PDF	https://www.ibug.doc.ic.ac.uk/media/uploads/documents/trigeorgis2016mnemonic.pdf
PWC	https://paperswithcode.com/paper/mnemonic-descent-method-a-recurrent-process-1
Repo	https://github.com/trigeorgis/mdm
Framework	tf

Structure-From-Motion Revisited


Title	Structure-From-Motion Revisited
Authors	Johannes L. Schonberger, Jan-Michael Frahm
Abstract	Incremental Structure-from-Motion is a prevalent strategy for 3D reconstruction from unordered image collections. While incremental reconstruction systems have tremendously advanced in all regards, robustness, accuracy, completeness, and scalability remain the key problems towards building a truly general-purpose pipeline. We propose a new SfM technique that improves upon the state of the art to make a further step towards this ultimate goal. The full reconstruction pipeline is released to the public as an open-source implementation.
Tasks	3D Reconstruction
Published	2016-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2016/html/Schonberger_Structure-From-Motion_Revisited_CVPR_2016_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2016/papers/Schonberger_Structure-From-Motion_Revisited_CVPR_2016_paper.pdf
PWC	https://paperswithcode.com/paper/structure-from-motion-revisited
Repo	https://github.com/colmap/colmap
Framework	none

Accumulated Stability Voting: A Robust Descriptor From Descriptors of Multiple Scales


Title	Accumulated Stability Voting: A Robust Descriptor From Descriptors of Multiple Scales
Authors	Tsun-Yi Yang, Yen-Yu Lin, Yung-Yu Chuang
Abstract	This paper proposes a novel local descriptor through accumulated stability voting (ASV). The stability of feature dimensions is measured by their differences across scales. To be more robust to noise, the stability is further quantized by thresholding. The principle of maximum entropy is utilized for determining the best thresholds for maximizing discriminant power of the resultant descriptor. Accumulating stability renders a real-valued descriptor and it can be converted into a binary descriptor by an additional thresholding process. The real-valued descriptor attains high matching accuracy while the binary descriptor makes a good compromise between storage and accuracy. Our descriptors are simple yet effective, and easy to implement. In addition, our descriptors require no training. Experiments on popular benchmarks demonstrate the effectiveness of our descriptors and their superiority to the state-of-the-art descriptors.
Tasks
Published	2016-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2016/html/Yang_Accumulated_Stability_Voting_CVPR_2016_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2016/papers/Yang_Accumulated_Stability_Voting_CVPR_2016_paper.pdf
PWC	https://paperswithcode.com/paper/accumulated-stability-voting-a-robust
Repo	https://github.com/shamangary/ASV
Framework	none

Robust Gram Embeddings


Title	Robust Gram Embeddings
Authors	Taygun Keke{\c{c}}, David M. J. Tax
Abstract
Tasks	Sarcasm Detection, Sentiment Analysis, Word Embeddings
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1113/
PDF	https://www.aclweb.org/anthology/D16-1113
PWC	https://paperswithcode.com/paper/robust-gram-embeddings
Repo	https://github.com/taygunk/robust_gram_embeddings
Framework	none

Joint Event Extraction via Recurrent Neural Networks


Title	Joint Event Extraction via Recurrent Neural Networks
Authors	Thien Huu Nguyen, Kyunghyun Cho, Ralph Grishman
Abstract
Tasks	Structured Prediction
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1034/
PDF	https://www.aclweb.org/anthology/N16-1034
PWC	https://paperswithcode.com/paper/joint-event-extraction-via-recurrent-neural
Repo	https://github.com/anoperson/jointEE-NN
Framework	none

Learning compact binary descriptors with unsupervised deep neural networks


Title	Learning compact binary descriptors with unsupervised deep neural networks
Authors	Kevin Lin; Jiwen Lu; Chu-Song Chen; Jie Zhou
Abstract	In this paper, we propose a new unsupervised deep learning approach called DeepBit to learn compact binary descriptor for efficient visual object matching. Unlike most existing binary descriptors which were designed with random projections or linear hash functions, we develop a deep neural network to learn binary descriptors in an unsupervised manner. We enforce three criterions on binary codes which are learned at the top layer of our network: 1) minimal loss quantization, 2) evenly distributed codes and 3) uncorrelated bits. Then, we learn the parameters of the networks with a back-propagation technique. Experimental results on three different visual analysis tasks including image matching, image retrieval, and object recognition clearly demonstrate the effectiveness of the proposed approach.
Tasks	Image Retrieval, Object Recognition, Quantization
Published	2016-06-01
URL	https://ieeexplore.ieee.org/document/7780502/
PDF	https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Lin_Learning_Compact_Binary_CVPR_2016_paper.pdf
PWC	https://paperswithcode.com/paper/learning-compact-binary-descriptors-with-1
Repo	https://github.com/kevinlin311tw/cvpr16-deepbit
Framework	none

A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection


Title	A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
Authors	J{'e}r{'e}my Ferrero, Fr{'e}d{'e}ric Agn{`e}s, Laurent Besacier, Didier Schwab
Abstract	In this paper we describe our effort to create a dataset for the evaluation of cross-language textual similarity detection. We present preexisting corpora and their limits and we explain the various gathered resources to overcome these limits and build our enriched dataset. The proposed dataset is multilingual, includes cross-language alignment for different granularities (from chunk to document), is based on both parallel and comparable corpora and contains human and machine translated texts. Moreover, it includes texts written by multiple types of authors (from average to professionals). With the obtained dataset, we conduct a systematic and rigorous evaluation of several state-of-the-art cross-language textual similarity detection methods. The evaluation results are reviewed and discussed. Finally, dataset and scripts are made publicly available on GitHub: http://github.com/FerreroJeremy/Cross-Language-Dataset.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1657/
PDF	https://www.aclweb.org/anthology/L16-1657
PWC	https://paperswithcode.com/paper/a-multilingual-multi-style-and-multi
Repo	https://github.com/FerreroJeremy/Cross-Language-Dataset
Framework	none

Dynamic Image Networks for Action Recognition


Title	Dynamic Image Networks for Action Recognition
Authors	Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould
Abstract	We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis especially when convolutional neural networks (CNNs) are used. The dynamic image is based on the rank pooling concept and is obtained through the parameters of a ranking machine that encodes the temporal evolution of the frames of the video. Dynamic images are obtained by directly applying rank pooling on the raw image pixels of a video producing a single RGB image per video. This idea is simple but powerful as it enables the use of existing CNN models directly on video data with fine-tuning. We present an efficient and effective approximate rank pooling operator, speeding it up orders of magnitude compared to rank pooling. Our new approximate rank pooling CNN layer allows us to generalize dynamic images to dynamic feature maps and we demonstrate the power of our new representations on standard benchmarks in action recognition achieving state-of-the-art performance.
Tasks	Temporal Action Localization
Published	2016-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2016/html/Bilen_Dynamic_Image_Networks_CVPR_2016_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2016/papers/Bilen_Dynamic_Image_Networks_CVPR_2016_paper.pdf
PWC	https://paperswithcode.com/paper/dynamic-image-networks-for-action-recognition
Repo	https://github.com/hbilen/dynamic-image-nets
Framework	none

Structural Deep Network Embedding


Title	Structural Deep Network Embedding
Authors	Daixin Wang1, Peng Cui1, Wenwu Zhu1
Abstract	Networkembeddingisanimportantmethodtolearnlow-dimensional representations of vertexes in networks, aiming to capture and preserve the network structure. Almost all the existing network embeddingmethodsadoptshallowmodels. However,sincetheunderlyingnetworkstructureiscomplex, shallowmodelscannotcapture the highly non-linear network structure, resulting in sub-optimal network representations. Therefore, how to ﬁnd a method that is able to effectively capture the highly non-linear network structure and preserve the global and local structure is an open yet important problem. To solve this problem, in this paper we propose a StructuralDeepNetworkEmbeddingmethod,namelySDNE.More speciﬁcally, we ﬁrst propose a semi-supervised deep model, which has multiple layers of non-linear functions, thereby being able to capture the highly non-linear network structure. Then we propose to exploit the ﬁrst-order and second-order proximity jointly to preserve the network structure. The second-order proximity is used bytheunsupervisedcomponenttocapturetheglobalnetworkstructure. Whiletheﬁrst-orderproximityisusedasthesupervisedinformation in the supervised component to preserve the local network structure. By jointly optimizing them in the semi-supervised deep model, our method can preserve both the local and global network structureandisrobusttosparsenetworks. Empirically,weconduct the experiments on ﬁve real-world networks, including a language network, a citation network and three social networks. The results show that compared to the baselines, our method can reconstruct the original network signiﬁcantly better and achieves substantial gains in three applications, i.e. multi-label classiﬁcation, link prediction and visualization.
Tasks	Graph Classification, Link Prediction, Network Embedding
Published	2016-06-01
URL	https://www.kdd.org/kdd2016/papers/files/rfp0191-wangAemb.pdf
PDF	https://www.kdd.org/kdd2016/papers/files/rfp0191-wangAemb.pdf
PWC	https://paperswithcode.com/paper/structural-deep-network-embedding
Repo	https://github.com/shenweichen/GraphEmbedding
Framework	tf