January 24, 2020

2882 words 14 mins read

Paper Group NANR 139

Paper Group NANR 139

Toponym Detection in the Bio-Medical Domain: A Hybrid Approach with Deep Learning. Dimensionality reduction: theoretical perspective on practical measures. Unsupervised dialogue intent detection via hierarchical topic model. Policy Preference Detection in Parliamentary Debate Motions. The MADAR Shared Task on Arabic Fine-Grained Dialect Identificat …

Toponym Detection in the Bio-Medical Domain: A Hybrid Approach with Deep Learning

Title Toponym Detection in the Bio-Medical Domain: A Hybrid Approach with Deep Learning
Authors Alistair Plum, Tharindu Ranasinghe, Constantin Orasan
Abstract This paper compares how different machine learning classifiers can be used together with simple string matching and named entity recognition to detect locations in texts. We compare five different state-of-the-art machine learning classifiers in order to predict whether a sentence contains a location or not. Following this classification task, we use a string matching algorithm with a gazetteer to identify the exact index of a toponym within the sentence. We evaluate different approaches in terms of machine learning classifiers, text pre-processing and location extraction on the SemEval-2019 Task 12 dataset, compiled for toponym resolution in the bio-medical domain. Finally, we compare the results with our system that was previously submitted to the SemEval-2019 task evaluation.
Tasks Named Entity Recognition
Published 2019-09-01
URL https://www.aclweb.org/anthology/R19-1106/
PDF https://www.aclweb.org/anthology/R19-1106
PWC https://paperswithcode.com/paper/toponym-detection-in-the-bio-medical-domain-a
Repo
Framework

Dimensionality reduction: theoretical perspective on practical measures

Title Dimensionality reduction: theoretical perspective on practical measures
Authors Yair Bartal, Nova Fandina, Ofer Neiman
Abstract Dimensionality reduction plays a central role in real-world applications for Machine Learning, among many fields. In particular, metric dimensionality reduction where data from a general metric is mapped into low dimensional space, is often used as a first step before applying machine learning algorithms. In almost all these applications the quality of the embedding is measured by various average case criteria. Metric dimensionality reduction has also been studied in Math and TCS, within the extremely fruitful and influential field of metric embedding. Yet, the vast majority of theoretical research has been devoted to analyzing the worst case behavior of embeddings and therefore has little relevance to practical settings. The goal of this paper is to bridge the gap between theory and practice view-points of metric dimensionality reduction, laying the foundation for a theoretical study of more practically oriented analysis. This paper can be viewed as providing a comprehensive theoretical framework addressing a line of research initiated by VL [NeuroIPS’ 18] who have set the goal of analyzing different distortion measurement criteria, with the lens of Machine Learning applicability, from both theoretical and practical perspectives. We complement their work by considering some important and vastly used average case criteria, some of which originated within the well-known Multi-Dimensional Scaling framework. While often studied in practice, no theoretical studies have thus far attempted at providing rigorous analysis of these criteria. In this paper we provide the first analysis of these, as well as the new distortion measure developed by [VL18] designed to possess Machine Learning desired properties. Moreover, we show that all measures considered can be adapted to possess similar qualities. The main consequences of our work are nearly tight bounds on the absolute values of all distortion criteria, as well as first approximation algorithms with provable guarantees.
Tasks Dimensionality Reduction
Published 2019-12-01
URL http://papers.nips.cc/paper/9243-dimensionality-reduction-theoretical-perspective-on-practical-measures
PDF http://papers.nips.cc/paper/9243-dimensionality-reduction-theoretical-perspective-on-practical-measures.pdf
PWC https://paperswithcode.com/paper/dimensionality-reduction-theoretical
Repo
Framework

Unsupervised dialogue intent detection via hierarchical topic model

Title Unsupervised dialogue intent detection via hierarchical topic model
Authors Artem Popov, Victor Bulatov, Darya Polyudova, Eugenia Veselova
Abstract One of the challenges during a task-oriented chatbot development is the scarce availability of the labeled training data. The best way of getting one is to ask the assessors to tag each dialogue according to its intent. Unfortunately, performing labeling without any provisional collection structure is difficult since the very notion of the intent is ill-defined. In this paper, we propose a hierarchical multimodal regularized topic model to obtain a first approximation of the intent set. Our rationale for hierarchical models usage is their ability to take into account several degrees of the dialogues relevancy. We attempt to build a model that can distinguish between subject-based (e.g. medicine and transport topics) and action-based (e.g. filing of an application and tracking application status) similarities. In order to achieve this, we divide set of all features into several groups according to part-of-speech analysis. Various feature groups are treated differently on different hierarchy levels.
Tasks Chatbot, Intent Detection
Published 2019-09-01
URL https://www.aclweb.org/anthology/R19-1108/
PDF https://www.aclweb.org/anthology/R19-1108
PWC https://paperswithcode.com/paper/unsupervised-dialogue-intent-detection-via
Repo
Framework

Policy Preference Detection in Parliamentary Debate Motions

Title Policy Preference Detection in Parliamentary Debate Motions
Authors Gavin Abercrombie, Federico Nanni, Riza Batista-Navarro, Simone Paolo Ponzetto
Abstract Debate motions (proposals) tabled in the UK Parliament contain information about the stated policy preferences of the Members of Parliament who propose them, and are key to the analysis of all subsequent speeches given in response to them. We attempt to automatically label debate motions with codes from a pre-existing coding scheme developed by political scientists for the annotation and analysis of political parties{'} manifestos. We develop annotation guidelines for the task of applying these codes to debate motions at two levels of granularity and produce a dataset of manually labelled examples. We evaluate the annotation process and the reliability and utility of the labelling scheme, finding that inter-annotator agreement is comparable with that of other studies conducted on manifesto data. Moreover, we test a variety of ways of automatically labelling motions with the codes, ranging from similarity matching to neural classification methods, and evaluate them against the gold standard labels. From these experiments, we note that established supervised baselines are not always able to improve over simple lexical heuristics. At the same time, we detect a clear and evident benefit when employing BERT, a state-of-the-art deep language representation model, even in classification scenarios with over 30 different labels and limited amounts of training data.
Tasks
Published 2019-11-01
URL https://www.aclweb.org/anthology/K19-1024/
PDF https://www.aclweb.org/anthology/K19-1024
PWC https://paperswithcode.com/paper/policy-preference-detection-in-parliamentary
Repo
Framework

The MADAR Shared Task on Arabic Fine-Grained Dialect Identification

Title The MADAR Shared Task on Arabic Fine-Grained Dialect Identification
Authors Houda Bouamor, Sabit Hassan, Nizar Habash
Abstract In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine-Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic Natural Language Processing Workshop, collocated with ACL 2019. The shared task includes two subtasks: the MADAR Travel Domain Dialect Identification subtask (Subtask 1) and the MADAR Twitter User Dialect Identification subtask (Subtask 2). This shared task is the first to target a large set of dialect labels at the city and country levels. The data for the shared task was created or collected under the Multi-Arabic Dialect Applications and Resources (MADAR) project. A total of 21 teams from 15 countries participated in the shared task.
Tasks
Published 2019-08-01
URL https://www.aclweb.org/anthology/W19-4622/
PDF https://www.aclweb.org/anthology/W19-4622
PWC https://paperswithcode.com/paper/the-madar-shared-task-on-arabic-fine-grained
Repo
Framework

Ground-to-Aerial Image Geo-Localization With a Hard Exemplar Reweighting Triplet Loss

Title Ground-to-Aerial Image Geo-Localization With a Hard Exemplar Reweighting Triplet Loss
Authors Sudong Cai, Yulan Guo, Salman Khan, Jiwei Hu, Gongjian Wen
Abstract The task of ground-to-aerial image geo-localization can be achieved by matching a ground view query image to a reference database of aerial/satellite images. It is highly challenging due to the dramatic viewpoint changes and unknown orientations. In this paper, we propose a novel in-batch reweighting triplet loss to emphasize the positive effect of hard exemplars during end-to-end training. We also integrate an attention mechanism into our model using feature-level contextual information. To analyze the difficulty level of each triplet, we first enforce a modified logistic regression to triplets with a distance rectifying factor. Then, the reference negative distances for corresponding anchors are set, and the relative weights of triplets are computed by comparing their difficulty to the corresponding references. To reduce the influence of extreme hard data and less useful simple exemplars, the final weights are pruned using upper and lower bound constraints. Experiments on two benchmark datasets show that the proposed approach significantly outperforms the state-of-the-art methods.
Tasks
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Cai_Ground-to-Aerial_Image_Geo-Localization_With_a_Hard_Exemplar_Reweighting_Triplet_Loss_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Cai_Ground-to-Aerial_Image_Geo-Localization_With_a_Hard_Exemplar_Reweighting_Triplet_Loss_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/ground-to-aerial-image-geo-localization-with
Repo
Framework

Generating Formality-Tuned Summaries Using Input-Dependent Rewards

Title Generating Formality-Tuned Summaries Using Input-Dependent Rewards
Authors Kushal Chawla, Balaji Vasan Srinivasan, Niyati Chhaya
Abstract Abstractive text summarization aims at generating human-like summaries by understanding and paraphrasing the given input content. Recent efforts based on sequence-to-sequence networks only allow the generation of a single summary. However, it is often desirable to accommodate the psycho-linguistic preferences of the intended audience while generating the summaries. In this work, we present a reinforcement learning based approach to generate formality-tailored summaries for an input article. Our novel input-dependent reward function aids in training the model with stylistic feedback on sampled and ground-truth summaries together. Once trained, the same model can generate formal and informal summary variants. Our automated and qualitative evaluations show the viability of the proposed framework.
Tasks Abstractive Text Summarization, Text Summarization
Published 2019-11-01
URL https://www.aclweb.org/anthology/K19-1078/
PDF https://www.aclweb.org/anthology/K19-1078
PWC https://paperswithcode.com/paper/generating-formality-tuned-summaries-using
Repo
Framework

Classification of Micro-Texts Using Sub-Word Embeddings

Title Classification of Micro-Texts Using Sub-Word Embeddings
Authors Mihir Joshi, Nur Zincir-Heywood
Abstract Extracting features and writing styles from short text messages is always a challenge. Short messages, like tweets, do not have enough data to perform statistical authorship attribution. Besides, the vocabulary used in these texts is sometimes improvised or misspelled. Therefore, in this paper, we propose combining four feature extraction techniques namely character n-grams, word n-grams, Flexible Patterns and a new sub-word embedding using the skip-gram model. Our system uses a Multi-Layer Perceptron to utilize these features from tweets to analyze short text messages. This proposed system achieves 85{%} accuracy, which is a considerable improvement over previous systems.
Tasks Word Embeddings
Published 2019-09-01
URL https://www.aclweb.org/anthology/R19-1062/
PDF https://www.aclweb.org/anthology/R19-1062
PWC https://paperswithcode.com/paper/classification-of-micro-texts-using-sub-word
Repo
Framework

Graph Embeddings for Frame Identification

Title Graph Embeddings for Frame Identification
Authors Alex Popov, er, Jennifer Sikos
Abstract Lexical resources such as WordNet (Miller, 1995) and FrameNet (Baker et al., 1998) are organized as graphs, where relationships between words are made explicit via the structure of the resource. This work explores how structural information from these lexical resources can lead to gains in a downstream task, namely frame identification. While much of the current work in frame identification uses various neural architectures to predict frames, those neural architectures only use representations of frames based on annotated corpus data. We demonstrate how incorporating knowledge directly from the FrameNet graph structure improves the performance of a neural network-based frame identification system. Specifically, we construct a bidirectional LSTM with a loss function that incorporates various graph- and corpus-based frame embeddings for learning and ultimately achieves strong performance gains with the graph-based embeddings over corpus-based embeddings alone.
Tasks
Published 2019-09-01
URL https://www.aclweb.org/anthology/R19-1109/
PDF https://www.aclweb.org/anthology/R19-1109
PWC https://paperswithcode.com/paper/graph-embeddings-for-frame-identification
Repo
Framework

On Committee Representations of Adversarial Learning Models for Question-Answer Ranking

Title On Committee Representations of Adversarial Learning Models for Question-Answer Ranking
Authors Sparsh Gupta, Vitor Carvalho
Abstract Adversarial training is a process in Machine Learning that explicitly trains models on adversarial inputs (inputs designed to deceive or trick the learning process) in order to make it more robust or accurate. In this paper we investigate how representing adversarial training models as committees can be used to effectively improve the performance of Question-Answer (QA) Ranking. We start by empirically probing the effects of adversarial training over multiple QA ranking algorithms, including the state-of-the-art Multihop Attention Network model. We evaluate these algorithms on several benchmark datasets and observe that, while adversarial training is beneficial to most baseline algorithms, there are cases where it may lead to overfitting and performance degradation. We investigate the causes of such degradation, and then propose a new representation procedure for this adversarial learning problem, based on committee learning, that not only is capable of consistently improving all baseline algorithms, but also outperforms the previous state-of-the-art algorithm by as much as 6{%} in NDCG.
Tasks
Published 2019-08-01
URL https://www.aclweb.org/anthology/W19-4325/
PDF https://www.aclweb.org/anthology/W19-4325
PWC https://paperswithcode.com/paper/on-committee-representations-of-adversarial
Repo
Framework

From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason

Title From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason
Authors Ajeet Kumar Singh, Anand Mishra, Shashank Shekhar, Anirban Chakraborty
Abstract Text present in images are not merely strings, they provide useful cues about the image. Despite their utility in better image understanding, scene texts are not used in traditional visual question answering (VQA) models. In this work, we present a VQA model which can read scene texts and perform reasoning on a knowledge graph to arrive at an accurate answer. Our proposed model has three mutually interacting modules: i. proposal module to get word and visual content proposals from the image, ii. fusion module to fuse these proposals, question and knowledge base to mine relevant facts, and represent these facts as multi-relational graph, iii. reasoning module to perform a novel gated graph neural network based reasoning on this graph. The performance of our knowledge-enabled VQA model is evaluated on our newly introduced dataset, viz. text-KVQA. To the best of our knowledge, this is the first dataset which identifies the need for bridging text recognition with knowledge graph based reasoning. Through extensive experiments, we show that our proposed method outperforms traditional VQA as well as question-answering over knowledge base-based methods on text-KVQA.
Tasks Question Answering, Visual Question Answering
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Singh_From_Strings_to_Things_Knowledge-Enabled_VQA_Model_That_Can_Read_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Singh_From_Strings_to_Things_Knowledge-Enabled_VQA_Model_That_Can_Read_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/from-strings-to-things-knowledge-enabled-vqa
Repo
Framework

A Framework for Relation Extraction Across Multiple Datasets in Multiple Domains

Title A Framework for Relation Extraction Across Multiple Datasets in Multiple Domains
Authors Geeticka Chauhan, Matthew McDermott, Peter Szolovits
Abstract In this work, we aim to build a unifying framework for relation extraction (RE), applying this on 3 highly used datasets with the ability to be extendable to new datasets. At the moment, the domain suffers from lack of reproducibility as well as a lack of consensus on generalizable techniques. Our framework will be open-sourced and will aid in performing systematic exploration on the effect of different modeling techniques, pre-processing, training methodologies and evaluation metrics on the 3 datasets to help establish a consensus.
Tasks Relation Extraction
Published 2019-08-01
URL https://www.aclweb.org/anthology/papers/W/W19/W19-3608/
PDF https://www.aclweb.org/anthology/W19-3608
PWC https://paperswithcode.com/paper/a-framework-for-relation-extraction-across
Repo
Framework

Quantum-based subgraph convolutional neural networks

Title Quantum-based subgraph convolutional neural networks
Authors Zhihong Zhang, Dongdong Chen, Jianjia Wang, Lu Bai, Edwin R.Hancock
Abstract This paper proposes a new graph convolutional neural network architecture based on a depth-based representation of graph structure deriving from quantum walks, which we refer to as the quantum-based subgraph convolutional neural network (QS-CNNs). This new architecture captures both the global topological structure and the local connectivity structure within a graph. Specifically, we commence by establishing a family of K-layer expansion subgraphs for each vertex of a graph by quantum walks, which captures the global topological arrangement information for substructures contained within a graph. We then design a set of fixed-size convolution filters over the subgraphs, which helps to characterise multi-scale patterns residing in the data. The idea is to apply convolution filters sliding over the entire set of subgraphs rooted at a vertex to extract the local features analogous to the standard convolution operation on grid data. Experiments on eight graph-structured datasets demonstrate that QS-CNNs architecture is capable of outperforming fourteen state-of-the-art methods for the tasks of node classification and graph classification.
Tasks Graph Classification, Node Classification
Published 2019-04-01
URL https://doi.org/10.1016/j.patcog.2018.11.002
PDF http://eprints.whiterose.ac.uk/139142/
PWC https://paperswithcode.com/paper/quantum-based-subgraph-convolutional-neural
Repo
Framework

BNU-HKBU UIC NLP Team 2 at SemEval-2019 Task 6: Detecting Offensive Language Using BERT model

Title BNU-HKBU UIC NLP Team 2 at SemEval-2019 Task 6: Detecting Offensive Language Using BERT model
Authors Zhenghao Wu, Hao Zheng, Jianming Wang, Weifeng Su, Jefferson Fong
Abstract In this study we deal with the problem of identifying and categorizing offensive language in social media. Our group, BNU-HKBU UIC NLP Team2, use supervised classification along with multiple version of data generated by different ways of pre-processing the data. We then use the state-of-the-art model Bidirectional Encoder Representations from Transformers, or BERT (Devlin et al, 2018), to capture linguistic, syntactic and semantic features. Long range dependencies between each part of a sentence can be captured by BERT{'}s bidirectional encoder representations. Our results show 85.12{%} accuracy and 80.57{%} F1 scores in Subtask A (offensive language identification), 87.92{%} accuracy and 50{%} F1 scores in Subtask B (categorization of offense types), and 69.95{%} accuracy and 50.47{%} F1 score in Subtask C (offense target identification). Analysis of the results shows that distinguishing between targeted and untargeted offensive language is not a simple task. More work needs to be done on the unbalance data problem in Subtasks B and C. Some future work is also discussed.
Tasks Language Identification
Published 2019-06-01
URL https://www.aclweb.org/anthology/S19-2099/
PDF https://www.aclweb.org/anthology/S19-2099
PWC https://paperswithcode.com/paper/bnu-hkbu-uic-nlp-team-2-at-semeval-2019-task
Repo
Framework

The Universitat d’Alacant Submissions to the English-to-Kazakh News Translation Task at WMT 2019

Title The Universitat d’Alacant Submissions to the English-to-Kazakh News Translation Task at WMT 2019
Authors V{'\i}ctor M. S{'a}nchez-Cartagena, Juan Antonio P{'e}rez-Ortiz, Felipe S{'a}nchez-Mart{'\i}nez
Abstract This paper describes the two submissions of Universitat d{'}Alacant to the English-to-Kazakh news translation task at WMT 2019. Our submissions take advantage of monolingual data and parallel data from other language pairs by means of iterative backtranslation, pivot backtranslation and transfer learning. They also use linguistic information in two ways: morphological segmentation of Kazakh text, and integration of the output of a rule-based machine translation system. Our systems were ranked second in terms of chrF++ despite being built from an ensemble of only 2 independent training runs.
Tasks Machine Translation, Transfer Learning
Published 2019-08-01
URL https://www.aclweb.org/anthology/W19-5339/
PDF https://www.aclweb.org/anthology/W19-5339
PWC https://paperswithcode.com/paper/the-universitat-dalacant-submissions-to-the
Repo
Framework
comments powered by Disqus