May 5, 2019

2024 words 10 mins read

Paper Group NAWR 6

Paper Group NAWR 6

An Open Corpus for Named Entity Recognition in Historic Newspapers. A Reading Comprehension Corpus for Machine Translation Evaluation. Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation. Kyoto-NMT: a Neural Machine Translation implementation in Chainer. Learning a Lexicon and Translation Model from Phoneme Lattic …

An Open Corpus for Named Entity Recognition in Historic Newspapers

Title An Open Corpus for Named Entity Recognition in Historic Newspapers
Authors Clemens Neudecker
Abstract The availability of openly available textual datasets ({}corpora{''}) with highly accurate manual annotations ({}gold standard{''}) of named entities (e.g. persons, locations, organizations, etc.) is crucial in the training and evaluation of named entity recognition systems. Currently there are only few such datasets available on the web, and even less for texts containing historical spelling variation. The production and subsequent release into the public domain of four such datasets with 100 pages each for the languages Dutch, French, German (including Austrian) as part of the Europeana Newspapers project is expected to contribute to the further development and improvement of named entity recognition systems with a focus on historical content. This paper describes how these datasets were produced, what challenges were encountered in their creation and informs about their final quality and availability.
Tasks Named Entity Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1689/
PDF https://www.aclweb.org/anthology/L16-1689
PWC https://paperswithcode.com/paper/an-open-corpus-for-named-entity-recognition
Repo https://github.com/EuropeanaNewspapers/ner-corpora
Framework none

A Reading Comprehension Corpus for Machine Translation Evaluation

Title A Reading Comprehension Corpus for Machine Translation Evaluation
Authors Carolina Scarton, Lucia Specia
Abstract Effectively assessing Natural Language Processing output tasks is a challenge for research in the area. In the case of Machine Translation (MT), automatic metrics are usually preferred over human evaluation, given time and budget constraints.However, traditional automatic metrics (such as BLEU) are not reliable for absolute quality assessment of documents, often producing similar scores for documents translated by the same MT system.For scenarios where absolute labels are necessary for building models, such as document-level Quality Estimation, these metrics can not be fully trusted. In this paper, we introduce a corpus of reading comprehension tests based on machine translated documents, where we evaluate documents based on answers to questions by fluent speakers of the target language. We describe the process of creating such a resource, the experiment design and agreement between the test takers. Finally, we discuss ways to convert the reading comprehension test into document-level quality scores.
Tasks Machine Translation, Reading Comprehension
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1579/
PDF https://www.aclweb.org/anthology/L16-1579
PWC https://paperswithcode.com/paper/a-reading-comprehension-corpus-for-machine
Repo https://github.com/carolscarton/CREG-MT-eval
Framework none

Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation

Title Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation
Authors Nut Limsopatham, Nigel Collier
Abstract
Tasks Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1096/
PDF https://www.aclweb.org/anthology/P16-1096
PWC https://paperswithcode.com/paper/normalising-medical-concepts-in-social-media
Repo https://github.com/nutli/concept_normalisation
Framework tf

Kyoto-NMT: a Neural Machine Translation implementation in Chainer

Title Kyoto-NMT: a Neural Machine Translation implementation in Chainer
Authors Fabien Cromi{`e}res
Abstract We present Kyoto-NMT, an open-source implementation of the Neural Machine Translation paradigm. This implementation is done in Python and Chainer, an easy-to-use Deep Learning Framework.
Tasks Language Modelling, Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2064/
PDF https://www.aclweb.org/anthology/C16-2064
PWC https://paperswithcode.com/paper/kyoto-nmt-a-neural-machine-translation
Repo https://github.com/fabiencro/knmt
Framework none

Learning a Lexicon and Translation Model from Phoneme Lattices

Title Learning a Lexicon and Translation Model from Phoneme Lattices
Authors Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird, Quoc Truong Do, Satoshi Nakamura
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1263/
PDF https://www.aclweb.org/anthology/D16-1263
PWC https://paperswithcode.com/paper/learning-a-lexicon-and-translation-model-from
Repo https://github.com/oadams/latticetm
Framework none

Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Title Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records
Authors George Gkotsis, Sumithra Velupillai, Anika Oellrich, Harry Dean, Maria Liakata, Rina Dutta
Abstract
Tasks Negation Detection
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0310/
PDF https://www.aclweb.org/anthology/W16-0310
PWC https://paperswithcode.com/paper/dont-let-notes-be-misunderstood-a-negation
Repo https://github.com/gkotsis/negation-detection
Framework none

Mnemonic Descent Method: A recurrent process applied for end-to-end face alignment

Title Mnemonic Descent Method: A recurrent process applied for end-to-end face alignment
Authors G. Trigeorgis, P. Snape, M. A. Nicolaou, E. Antonakos, S. Zafeiriou
Abstract Cascaded regression has recently become the method of choice for solving non-linear least squares problems such as deformable image alignment. Given a sizeable training set, cascaded regression learns a set of generic rules that are sequentially applied to minimise the least squares problem. Despite the success of cascaded regression for problems such as face alignment and head pose estimation, there are several shortcomings arising in the strategies proposed thus far. Specifically,(a) the regressors are learnt independently,(b) the descent directions may cancel one another out and (c) handcrafted features (eg, HoGs, SIFT etc.) are mainly used to drive the cascade, which may be sub-optimal for the task at hand. In this paper, we propose a combined and jointly trained convolutional recurrent neural network architecture that allows the training of an end-to-end to system that attempts to alleviate the aforementioned drawbacks. The recurrent module facilitates the joint optimisation of the regressors by assuming the cascades form a nonlinear dynamical system, in effect fully utilising the information between all cascade levels by introducing a memory unit that shares information across all levels. The convolutional module allows the network to extract features that are specialised for the task at hand and are experimentally shown to outperform hand-crafted features. We show that the application of the proposed architecture for the problem of face alignment results in a strong improvement over the current state-of-the-art.
Tasks Face Alignment, Head Pose Estimation, Pose Estimation
Published 2016-06-01
URL https://www.ibug.doc.ic.ac.uk/media/uploads/documents/trigeorgis2016mnemonic.pdf
PDF https://www.ibug.doc.ic.ac.uk/media/uploads/documents/trigeorgis2016mnemonic.pdf
PWC https://paperswithcode.com/paper/mnemonic-descent-method-a-recurrent-process-1
Repo https://github.com/trigeorgis/mdm
Framework tf

Structure-From-Motion Revisited

Title Structure-From-Motion Revisited
Authors Johannes L. Schonberger, Jan-Michael Frahm
Abstract Incremental Structure-from-Motion is a prevalent strategy for 3D reconstruction from unordered image collections. While incremental reconstruction systems have tremendously advanced in all regards, robustness, accuracy, completeness, and scalability remain the key problems towards building a truly general-purpose pipeline. We propose a new SfM technique that improves upon the state of the art to make a further step towards this ultimate goal. The full reconstruction pipeline is released to the public as an open-source implementation.
Tasks 3D Reconstruction
Published 2016-06-01
URL http://openaccess.thecvf.com/content_cvpr_2016/html/Schonberger_Structure-From-Motion_Revisited_CVPR_2016_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2016/papers/Schonberger_Structure-From-Motion_Revisited_CVPR_2016_paper.pdf
PWC https://paperswithcode.com/paper/structure-from-motion-revisited
Repo https://github.com/colmap/colmap
Framework none

Accumulated Stability Voting: A Robust Descriptor From Descriptors of Multiple Scales

Title Accumulated Stability Voting: A Robust Descriptor From Descriptors of Multiple Scales
Authors Tsun-Yi Yang, Yen-Yu Lin, Yung-Yu Chuang
Abstract This paper proposes a novel local descriptor through accumulated stability voting (ASV). The stability of feature dimensions is measured by their differences across scales. To be more robust to noise, the stability is further quantized by thresholding. The principle of maximum entropy is utilized for determining the best thresholds for maximizing discriminant power of the resultant descriptor. Accumulating stability renders a real-valued descriptor and it can be converted into a binary descriptor by an additional thresholding process. The real-valued descriptor attains high matching accuracy while the binary descriptor makes a good compromise between storage and accuracy. Our descriptors are simple yet effective, and easy to implement. In addition, our descriptors require no training. Experiments on popular benchmarks demonstrate the effectiveness of our descriptors and their superiority to the state-of-the-art descriptors.
Tasks
Published 2016-06-01
URL http://openaccess.thecvf.com/content_cvpr_2016/html/Yang_Accumulated_Stability_Voting_CVPR_2016_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2016/papers/Yang_Accumulated_Stability_Voting_CVPR_2016_paper.pdf
PWC https://paperswithcode.com/paper/accumulated-stability-voting-a-robust
Repo https://github.com/shamangary/ASV
Framework none

Robust Gram Embeddings

Title Robust Gram Embeddings
Authors Taygun Keke{\c{c}}, David M. J. Tax
Abstract
Tasks Sarcasm Detection, Sentiment Analysis, Word Embeddings
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1113/
PDF https://www.aclweb.org/anthology/D16-1113
PWC https://paperswithcode.com/paper/robust-gram-embeddings
Repo https://github.com/taygunk/robust_gram_embeddings
Framework none

Joint Event Extraction via Recurrent Neural Networks

Title Joint Event Extraction via Recurrent Neural Networks
Authors Thien Huu Nguyen, Kyunghyun Cho, Ralph Grishman
Abstract
Tasks Structured Prediction
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1034/
PDF https://www.aclweb.org/anthology/N16-1034
PWC https://paperswithcode.com/paper/joint-event-extraction-via-recurrent-neural
Repo https://github.com/anoperson/jointEE-NN
Framework none

Learning compact binary descriptors with unsupervised deep neural networks

Title Learning compact binary descriptors with unsupervised deep neural networks
Authors Kevin Lin; Jiwen Lu; Chu-Song Chen; Jie Zhou
Abstract In this paper, we propose a new unsupervised deep learning approach called DeepBit to learn compact binary descriptor for efficient visual object matching. Unlike most existing binary descriptors which were designed with random projections or linear hash functions, we develop a deep neural network to learn binary descriptors in an unsupervised manner. We enforce three criterions on binary codes which are learned at the top layer of our network: 1) minimal loss quantization, 2) evenly distributed codes and 3) uncorrelated bits. Then, we learn the parameters of the networks with a back-propagation technique. Experimental results on three different visual analysis tasks including image matching, image retrieval, and object recognition clearly demonstrate the effectiveness of the proposed approach.
Tasks Image Retrieval, Object Recognition, Quantization
Published 2016-06-01
URL https://ieeexplore.ieee.org/document/7780502/
PDF https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Lin_Learning_Compact_Binary_CVPR_2016_paper.pdf
PWC https://paperswithcode.com/paper/learning-compact-binary-descriptors-with-1
Repo https://github.com/kevinlin311tw/cvpr16-deepbit
Framework none

A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection

Title A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
Authors J{'e}r{'e}my Ferrero, Fr{'e}d{'e}ric Agn{`e}s, Laurent Besacier, Didier Schwab
Abstract In this paper we describe our effort to create a dataset for the evaluation of cross-language textual similarity detection. We present preexisting corpora and their limits and we explain the various gathered resources to overcome these limits and build our enriched dataset. The proposed dataset is multilingual, includes cross-language alignment for different granularities (from chunk to document), is based on both parallel and comparable corpora and contains human and machine translated texts. Moreover, it includes texts written by multiple types of authors (from average to professionals). With the obtained dataset, we conduct a systematic and rigorous evaluation of several state-of-the-art cross-language textual similarity detection methods. The evaluation results are reviewed and discussed. Finally, dataset and scripts are made publicly available on GitHub: http://github.com/FerreroJeremy/Cross-Language-Dataset.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1657/
PDF https://www.aclweb.org/anthology/L16-1657
PWC https://paperswithcode.com/paper/a-multilingual-multi-style-and-multi
Repo https://github.com/FerreroJeremy/Cross-Language-Dataset
Framework none

Dynamic Image Networks for Action Recognition

Title Dynamic Image Networks for Action Recognition
Authors Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould
Abstract We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis especially when convolutional neural networks (CNNs) are used. The dynamic image is based on the rank pooling concept and is obtained through the parameters of a ranking machine that encodes the temporal evolution of the frames of the video. Dynamic images are obtained by directly applying rank pooling on the raw image pixels of a video producing a single RGB image per video. This idea is simple but powerful as it enables the use of existing CNN models directly on video data with fine-tuning. We present an efficient and effective approximate rank pooling operator, speeding it up orders of magnitude compared to rank pooling. Our new approximate rank pooling CNN layer allows us to generalize dynamic images to dynamic feature maps and we demonstrate the power of our new representations on standard benchmarks in action recognition achieving state-of-the-art performance.
Tasks Temporal Action Localization
Published 2016-06-01
URL http://openaccess.thecvf.com/content_cvpr_2016/html/Bilen_Dynamic_Image_Networks_CVPR_2016_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2016/papers/Bilen_Dynamic_Image_Networks_CVPR_2016_paper.pdf
PWC https://paperswithcode.com/paper/dynamic-image-networks-for-action-recognition
Repo https://github.com/hbilen/dynamic-image-nets
Framework none

Structural Deep Network Embedding

Title Structural Deep Network Embedding
Authors Daixin Wang1, Peng Cui1, Wenwu Zhu1
Abstract Networkembeddingisanimportantmethodtolearnlow-dimensional representations of vertexes in networks, aiming to capture and preserve the network structure. Almost all the existing network embeddingmethodsadoptshallowmodels. However,sincetheunderlyingnetworkstructureiscomplex, shallowmodelscannotcapture the highly non-linear network structure, resulting in sub-optimal network representations. Therefore, how to find a method that is able to effectively capture the highly non-linear network structure and preserve the global and local structure is an open yet important problem. To solve this problem, in this paper we propose a StructuralDeepNetworkEmbeddingmethod,namelySDNE.More specifically, we first propose a semi-supervised deep model, which has multiple layers of non-linear functions, thereby being able to capture the highly non-linear network structure. Then we propose to exploit the first-order and second-order proximity jointly to preserve the network structure. The second-order proximity is used bytheunsupervisedcomponenttocapturetheglobalnetworkstructure. Whilethefirst-orderproximityisusedasthesupervisedinformation in the supervised component to preserve the local network structure. By jointly optimizing them in the semi-supervised deep model, our method can preserve both the local and global network structureandisrobusttosparsenetworks. Empirically,weconduct the experiments on five real-world networks, including a language network, a citation network and three social networks. The results show that compared to the baselines, our method can reconstruct the original network significantly better and achieves substantial gains in three applications, i.e. multi-label classification, link prediction and visualization.
Tasks Graph Classification, Link Prediction, Network Embedding
Published 2016-06-01
URL https://www.kdd.org/kdd2016/papers/files/rfp0191-wangAemb.pdf
PDF https://www.kdd.org/kdd2016/papers/files/rfp0191-wangAemb.pdf
PWC https://paperswithcode.com/paper/structural-deep-network-embedding
Repo https://github.com/shenweichen/GraphEmbedding
Framework tf
comments powered by Disqus