January 24, 2020

2886 words 14 mins read

Paper Group NANR 119

YUN-HPCC at SemEval-2019 Task 3: Multi-Step Ensemble Neural Network for Sentiment Analysis in Textual Conversation. Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors. Automated Cross-language Intelligibility Analysis of Parkinson’s Disease Patients Using Speech Recognition Technologies. A Little Linguistics Goes …

YUN-HPCC at SemEval-2019 Task 3: Multi-Step Ensemble Neural Network for Sentiment Analysis in Textual Conversation


Title	YUN-HPCC at SemEval-2019 Task 3: Multi-Step Ensemble Neural Network for Sentiment Analysis in Textual Conversation
Authors	Dawei Li, Jin Wang, Xuejie Zhang
Abstract	This paper describes our approach to the sentiment analysis of Twitter textual conversations based on deep learning. We analyze the syntax, abbreviations, and informal-writing of Twitter; and perform perfect data preprocessing on the data to convert them to normative text. We apply a multi-step ensemble strategy to solve the problem of extremely unbalanced data in the training set. This is achieved by taking the GloVe and Elmo word vectors as input into a combination model with four different deep neural networks. The experimental results from the development dataset demonstrate that the proposed model exhibits a strong generalization ability. For evaluation on the best dataset, we integrated the results using the stacking ensemble learning approach and achieved competitive results. According to the final official review, the results of our model ranked 10th out of 165 teams.
Tasks	Sentiment Analysis
Published	2019-06-01
URL	https://www.aclweb.org/anthology/S19-2063/
PDF	https://www.aclweb.org/anthology/S19-2063
PWC	https://paperswithcode.com/paper/yun-hpcc-at-semeval-2019-task-3-multi-step
Repo
Framework

Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors


Title	Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors
Authors	Danijar Hafner, Dustin Tran, Timothy Lillicrap, Alex Irpan, James Davidson
Abstract	Obtaining reliable uncertainty estimates of neural network predictions is a long standing challenge. Bayesian neural networks have been proposed as a solution, but it remains open how to specify their prior. In particular, the common practice of a standard normal prior in weight space imposes only weak regularities, causing the function posterior to possibly generalize in unforeseen ways on inputs outside of the training distribution. We propose noise contrastive priors (NCPs) to obtain reliable uncertainty estimates. The key idea is to train the model to output high uncertainty for data points outside of the training distribution. NCPs do so using an input prior, which adds noise to the inputs of the current mini batch, and an output prior, which is a wide distribution given these inputs. NCPs are compatible with any model that can output uncertainty estimates, are easy to scale, and yield reliable uncertainty estimates throughout training. Empirically, we show that NCPs prevent overfitting outside of the training distribution and result in uncertainty estimates that are useful for active learning. We demonstrate the scalability of our method on the flight delays data set, where we significantly improve upon previously published results.
Tasks	Active Learning
Published	2019-05-01
URL	https://openreview.net/forum?id=HkgxasA5Ym
PDF	https://openreview.net/pdf?id=HkgxasA5Ym
PWC	https://paperswithcode.com/paper/reliable-uncertainty-estimates-in-deep-neural-1
Repo
Framework

Automated Cross-language Intelligibility Analysis of Parkinson’s Disease Patients Using Speech Recognition Technologies


Title	Automated Cross-language Intelligibility Analysis of Parkinson’s Disease Patients Using Speech Recognition Technologies
Authors	Nina Hosseini-Kivanani, Juan Camilo V{'a}squez-Correa, Manfred Stede, Elmar N{"o}th
Abstract	Speech deficits are common symptoms amongParkinson{'}s Disease (PD) patients. The automatic assessment of speech signals is promising for the evaluation of the neurological state and the speech quality of the patients. Recently, progress has been made in applying machine learning and computational methods to automatically evaluate the speech of PD patients. In the present study, we plan to analyze the speech signals of PD patients and healthy control (HC) subjects in three different languages: German, Spanish, and Czech, with the aim to identify biomarkers to discriminate between PD patients and HC subjects and to evaluate the neurological state of the patients. Therefore, the main contribution of this study is the automatic classification of PD patients and HC subjects in different languages with focusing on phonation, articulation, and prosody. We will focus on an intelligibility analysis based on automatic speech recognition systems trained on these three languages. This is one of the first studies done that considers the evaluation of the speech of PD patients in different languages. The purpose of this research proposal is to build a model that can discriminate PD and HC subjects even when the language used for train and test is different.
Tasks	Speech Recognition
Published	2019-07-01
URL	https://www.aclweb.org/anthology/P19-2010/
PDF	https://www.aclweb.org/anthology/P19-2010
PWC	https://paperswithcode.com/paper/automated-cross-language-intelligibility
Repo
Framework

A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance


Title	A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance
Authors	Alex Erdmann, er, Salam Khalifa, Mai Oudah, Nizar Habash, Houda Bouamor
Abstract	We present de-lexical segmentation, a linguistically motivated alternative to greedy or other unsupervised methods, requiring only minimal language specific input. Our technique involves creating a small grammar of closed-class affixes which can be written in a few hours. The grammar over generates analyses for word forms attested in a raw corpus which are disambiguated based on features of the linguistic base proposed for each form. Extending the grammar to cover orthographic, morpho-syntactic or lexical variation is simple, making it an ideal solution for challenging corpora with noisy, dialect-inconsistent, or otherwise non-standard content. In two evaluations, we consistently outperform competitive unsupervised baselines and approach the performance of state-of-the-art supervised models trained on large amounts of data, providing evidence for the value of linguistic input during preprocessing.
Tasks
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-4214/
PDF	https://www.aclweb.org/anthology/W19-4214
PWC	https://paperswithcode.com/paper/a-little-linguistics-goes-a-long-way
Repo
Framework

A Probabilistic Approach for Confidence Scoring in Speech Recognition


Title	A Probabilistic Approach for Confidence Scoring in Speech Recognition
Authors	Punnoose Kuriakose
Abstract
Tasks	Speech Recognition
Published	2019-09-01
URL	https://www.aclweb.org/anthology/W19-7410/
PDF	https://www.aclweb.org/anthology/W19-7410
PWC	https://paperswithcode.com/paper/a-probabilistic-approach-for-confidence
Repo
Framework

The Success Story of Mitra Translations


Title	The Success Story of Mitra Translations
Authors	Mina Ilieva, Mariya Kancheva
Abstract	Technologies and their constant updates and innovative nature drastically and irreversibly transformed this small business into a leading brand on the translation market, along with just few other LSPs integrating translation software solutions. Now, we are constantly following the new developments in software updates and online platforms and we are successfully keeping up with any new trend in the field of translation, localization, transcreation, revision, post-editing, etc. Ultimately, we are positive that proper implementation of technology (with focus on quality, cost and time) and hard work are the stepping stones in the way to become a trusted translation services provider.
Tasks
Published	2019-09-01
URL	https://www.aclweb.org/anthology/W19-8716/
PDF	https://www.aclweb.org/anthology/W19-8716
PWC	https://paperswithcode.com/paper/the-success-story-of-mitra-translations
Repo
Framework

ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge


Title	ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge
Authors	Vincent Nguyen, Sarvnaz Karimi, Zhenchang Xing
Abstract	We report on our system for textual inference and question entailment in the medical domain for the ACL BioNLP 2019 Shared Task, MEDIQA. Textual inference is the task of finding the semantic relationships between pairs of text. Question entailment involves identifying pairs of questions which have similar semantic content. To improve upon medical natural language inference and question entailment approaches to further medical question answering, we propose a system that incorporates open-domain and biomedical domain approaches to improve semantic understanding and ambiguity resolution. Our models achieve 80{%} accuracy on medical natural language inference (6.5{%} absolute improvement over the original baseline), 48.9{%} accuracy on recognising medical question entailment, 0.248 Spearman{'}s rho for question answering ranking and 68.6{%} accuracy for question answering classification.
Tasks	Natural Language Inference, Question Answering
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-5051/
PDF	https://www.aclweb.org/anthology/W19-5051
PWC	https://paperswithcode.com/paper/anu-csiro-at-mediqa-2019-question-answering
Repo
Framework

Foundations of Collaborative Task-Oriented Dialogue: What’s in a Slot?


Title	Foundations of Collaborative Task-Oriented Dialogue: What’s in a Slot?
Authors	Philip Cohen
Abstract	In this paper, we examine the foundations of task-oriented dialogues, in which systems are requested to perform tasks for humans. We argue that the way this dialogue task has been framed has limited its applicability to processing simple requests with atomic {`}slot-fillers{''}. However, real task-oriented dialogues can contain more complex utterances that provide non-atomic constraints on slot values. For example, in response to the system{'}s question {`}What time do you want me to reserve the restaurant?{''}, a user should be able to say {`}the earliest time available,{''} which cannot be handled by classic {`}intent + slots{''} approaches that do not incorporate expressive logical form meaning representations. Furthermore, situations for which it would be desirable to build task-oriented dialogue systems, e.g., to engage in mixed-initiative, collaborative or multiparty dialogues, will require a more general approach. In order to overcome these limitations and to provide such an approach, we give a logical analysis of the {`}intent+slot{''} dialogue setting using a modal logic of intention and including a more expansive notion of {`}dialogue state{''}. Finally, we briefly discuss our program of research to build a next generation of plan-based dialogue systems that goes beyond {``}intent + slots{''}. \|
Tasks	Task-Oriented Dialogue Systems
Published	2019-09-01
URL	https://www.aclweb.org/anthology/W19-5924/
PDF	https://www.aclweb.org/anthology/W19-5924
PWC	https://paperswithcode.com/paper/foundations-of-collaborative-task-oriented
Repo
Framework

Abstract Text Summarization: A Low Resource Challenge


Title	Abstract Text Summarization: A Low Resource Challenge
Authors	Shantipriya Parida, Petr Motlicek
Abstract	Text summarization is considered as a challenging task in the NLP community. The availability of datasets for the task of multilingual text summarization is rare, and such datasets are difficult to construct. In this work, we build an abstract text summarizer for the German language text using the state-of-the-art {``}Transformer{''} model. We propose an iterative data augmentation approach which uses synthetic data along with the real summarization data for the German language. To generate synthetic data, the Common Crawl (German) dataset is exploited, which covers different domains. The synthetic data is effective for the low resource condition and is particularly helpful for our multilingual scenario where availability of summarizing data is still a challenging issue. The data are also useful in deep learning scenarios where the neural models require a large amount of training data for utilization of its capacity. The obtained summarization performance is measured in terms of ROUGE and BLEU score. We achieve an absolute improvement of +1.5 and +16.0 in ROUGE1 F1 (R1{_}F1) on the development and test sets, respectively, compared to the system which does not rely on data augmentation. \|
Tasks	Data Augmentation, Text Summarization
Published	2019-11-01
URL	https://www.aclweb.org/anthology/D19-1616/
PDF	https://www.aclweb.org/anthology/D19-1616
PWC	https://paperswithcode.com/paper/abstract-text-summarization-a-low-resource
Repo
Framework

CAMsterdam at SemEval-2019 Task 6: Neural and graph-based feature extraction for the identification of offensive tweets


Title	CAMsterdam at SemEval-2019 Task 6: Neural and graph-based feature extraction for the identification of offensive tweets
Authors	Guy Aglionby, Chris Davis, Pushkar Mishra, Andrew Caines, Helen Yannakoudakis, Marek Rei, Ekaterina Shutova, Paula Buttery
Abstract	We describe the CAMsterdam team entry to the SemEval-2019 Shared Task 6 on offensive language identification in Twitter data. Our proposed model learns to extract textual features using a multi-layer recurrent network, and then performs text classification using gradient-boosted decision trees (GBDT). A self-attention architecture enables the model to focus on the most relevant areas in the text. In order to enrich input representations, we use node2vec to learn globally optimised embeddings for hashtags, which are then given as additional features to the GBDT classifier. Our best model obtains 78.79{%} macro F1-score on detecting offensive language (subtask A), 66.32{%} on categorising offence types (targeted/untargeted; subtask B), and 55.36{%} on identifying the target of offence (subtask C).
Tasks	Language Identification, Text Classification
Published	2019-06-01
URL	https://www.aclweb.org/anthology/S19-2100/
PDF	https://www.aclweb.org/anthology/S19-2100
PWC	https://paperswithcode.com/paper/camsterdam-at-semeval-2019-task-6-neural-and
Repo
Framework

Multi-way Encoding for Robustness to Adversarial Attacks


Title	Multi-way Encoding for Robustness to Adversarial Attacks
Authors	Donghyun Kim, Sarah Adel Bargal, Jianming Zhang, Stan Sclaroff
Abstract	Deep models are state-of-the-art for many computer vision tasks including image classification and object detection. However, it has been shown that deep models are vulnerable to adversarial examples. We highlight how one-hot encoding directly contributes to this vulnerability and propose breaking away from this widely-used, but highly-vulnerable mapping. We demonstrate that by leveraging a different output encoding, multi-way encoding, we can make models more robust. Our approach makes it more difficult for adversaries to find useful gradients for generating adversarial attacks. We present state-of-the-art robustness results for black-box, white-box attacks, and achieve higher clean accuracy on four benchmark datasets: MNIST, CIFAR-10, CIFAR-100, and SVHN when combined with adversarial training. The strength of our approach is also presented in the form of an attack for model watermarking, raising challenges in detecting stolen models.
Tasks	Image Classification, Object Detection
Published	2019-05-01
URL	https://openreview.net/forum?id=B1xOYoA5tQ
PDF	https://openreview.net/pdf?id=B1xOYoA5tQ
PWC	https://paperswithcode.com/paper/multi-way-encoding-for-robustness-to
Repo
Framework

Predicting the Outcome of Deliberative Democracy: A Research Proposal


Title	Predicting the Outcome of Deliberative Democracy: A Research Proposal
Authors	Conor McKillop
Abstract	As liberal states across the world face a decline in political participation by citizens, deliberative democracy is a promising solution for the public{'}s decreasing confidence and apathy towards the democratic process. Deliberative dialogue is method of public interaction that is fundamental to the concept of deliberative democracy. The ability to identify and predict consensus in the dialogues could bring greater accessibility and transparency to the face-to-face participatory process. The paper sets out a research plan for the first steps at automatically identifying and predicting consensus in a corpus of German language debates on hydraulic fracking. It proposes the use of a unique combination of lexical, sentiment, durational and further {`}derivative{'} features of adjacency pairs to train traditional classification models. In addition to this, the use of deep learning techniques to improve the accuracy of the classification and prediction tasks is also discussed. Preliminary results at the classification of utterances are also presented, with an F1 between 0.61 and 0.64 demonstrating that the task of recognising agreement is demanding but possible. \|
Tasks
Published	2019-07-01
URL	https://www.aclweb.org/anthology/P19-2013/
PDF	https://www.aclweb.org/anthology/P19-2013
PWC	https://paperswithcode.com/paper/predicting-the-outcome-of-deliberative
Repo
Framework

Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle


Title	Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
Authors	Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang
Abstract	Q-learning with function approximation is one of the most popular methods in reinforcement learning. Though the idea of using function approximation was proposed at least 60 years ago, even in the simplest setup, i.e, approximating Q-functions with linear functions, it is still an open problem how to design a provably efficient algorithm that learns a near-optimal policy. The key challenges are how to efficiently explore the state space and how to decide when to stop exploring in conjunction with the function approximation scheme. The current paper presents a provably efficient algorithm for Q-learning with linear function approximation. Under certain regularity assumptions, our algorithm, Difference Maximization Q-learning, combined with linear function approximation, returns a near-optimal policy using polynomial number of trajectories. Our algorithm introduces a new notion, the Distribution Shift Error Checking (DSEC) oracle. This oracle tests whether there exists a function in the function class that predicts well on a distribution $\mathcal{D}_1$, but predicts poorly on another distribution $\mathcal{D}_2$, where $\mathcal{D}_1$ and $\mathcal{D}_2$ are distributions over states induced by two different exploration policies. For the linear function class, this oracle is equivalent to solving a top eigenvalue problem. We believe our algorithmic insights, especially the DSEC oracle, are also useful in designing and analyzing reinforcement learning algorithms with general function approximation.
Tasks	Q-Learning
Published	2019-12-01
URL	http://papers.nips.cc/paper/9018-provably-efficient-q-learning-with-function-approximation-via-distribution-shift-error-checking-oracle
PDF	http://papers.nips.cc/paper/9018-provably-efficient-q-learning-with-function-approximation-via-distribution-shift-error-checking-oracle.pdf
PWC	https://paperswithcode.com/paper/provably-efficient-q-learning-with-function-1
Repo
Framework

Motion feature augmented network for dynamic hand gesture recognition from skeletal data


Title	Motion feature augmented network for dynamic hand gesture recognition from skeletal data
Authors	Xinghao Chen, 1 Guijin Wang, Hengkai Guo, Cairong Zhang, Hang Wang, and Li Zhang
Abstract	Dynamic hand gesture recognition has attracted increasing attention because of its importance for human–computer interaction. In this paper, we propose a novel motion feature augmented network (MFA-Net) for dynamic hand gesture recognition from skeletal data. MFA-Net exploits motion features of finger and global movements to augment features of deep network for gesture recognition. To describe finger articulated movements, finger motion features are extracted from the hand skeleton sequence via a variational autoencoder. Global motion features are utilized to represent the global movements of hand skeleton. These motion features along with the skeleton sequence are then fed into three branches of a recurrent neural network (RNN), which augment the motion features for RNN and improve the classification performance. The proposed MFA-Net is evaluated on two challenging skeleton-based dynamic hand gesture datasets, including DHG-14/28 dataset and SHREC’17 dataset. Experimental results demonstrate that our proposed method achieves comparable performance on DHG-14/28 dataset and better performance on SHREC’17 dataset when compared with start-of-the-art methods.
Tasks	Gesture Recognition, Hand Gesture Recognition, Hand-Gesture Recognition, Skeleton Based Action Recognition
Published	2019-01-10
URL	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6359639/
PDF	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6359639/
PWC	https://paperswithcode.com/paper/motion-feature-augmented-network-for-dynamic
Repo
Framework

Improving Mongolian-Chinese Neural Machine Translation with Morphological Noise


Title	Improving Mongolian-Chinese Neural Machine Translation with Morphological Noise
Authors	Yatu Ji, Hongxu Hou, Chen Junjie, Nier Wu
Abstract	For the translation of agglutinative language such as typical Mongolian, unknown (UNK) words not only come from the quite restricted vocabulary, but also mostly from misunderstanding of the translation model to the morphological changes. In this study, we introduce a new adversarial training model to alleviate the UNK problem in Mongolian-Chinese machine translation. The training process can be described as three adversarial sub models (generator, value screener and discriminator), playing a win-win game. In this game, the added screener plays the role of emphasizing that the discriminator pays attention to the added Mongolian morphological noise in the form of pseudo-data and improving the training efficiency. The experimental results show that the newly emerged Mongolian-Chinese task is state-of-the-art. Under this premise, the training time is greatly shortened.
Tasks	Machine Translation
Published	2019-07-01
URL	https://www.aclweb.org/anthology/P19-2016/
PDF	https://www.aclweb.org/anthology/P19-2016
PWC	https://paperswithcode.com/paper/improving-mongolian-chinese-neural-machine
Repo
Framework