April 1, 2020

2838 words 14 mins read

Paper Group ANR 479

Experiments with Different Indexing Techniques for Text Retrieval tasks on Gujarati Language using Bag of Words Approach. Fast Generating A Large Number of Gumbel-Max Variables. NestedVAE: Isolating Common Factors via Weak Supervision. AQPDCITY Dataset: Picture-Based PM2.5 Monitoring in the Urban Area of Big Cities. Incremental Fast Subclass Discri …

Experiments with Different Indexing Techniques for Text Retrieval tasks on Gujarati Language using Bag of Words Approach


Title	Experiments with Different Indexing Techniques for Text Retrieval tasks on Gujarati Language using Bag of Words Approach
Authors	Dr. Jyoti Pareek, Hardik Joshi, Krunal Chauhan, Rushikesh Patel
Abstract	This paper presents results of various experiments carried out to improve text retrieval of gujarati text documents. Text retrieval involves searching and ranking of text documents for a given set of query terms. We have tested various retrieval models that uses bag-of-words approach. Bag-of-words approach is a traditional approach that is being used till date where the text document is represented as collection of words. Measures like frequency count, inverse document frequency etc. are used to signify and rank relevant documents for user queries. Different ranking models have been used to quantify ranking performance using the metric of mean average precision. Gujarati is a morphologically rich language, we have compared techniques like stop word removal, stemming and frequent case generation against baseline to measure the improvements in information retrieval tasks. Most of the techniques are language dependent and requires development of language specific tools. We used plain unprocessed word index as the baseline, we have seen significant improvements in comparison of MAP values after applying different indexing techniques when compared to the baseline.
Tasks	Information Retrieval
Published	2020-02-05
URL	https://arxiv.org/abs/2002.01792v1
PDF	https://arxiv.org/pdf/2002.01792v1.pdf
PWC	https://paperswithcode.com/paper/experiments-with-different-indexing
Repo
Framework

Fast Generating A Large Number of Gumbel-Max Variables


Title	Fast Generating A Large Number of Gumbel-Max Variables
Authors	Yiyan Qi, Pinghui Wang, Yuanming Zhang, Junzhou Zhao, Guangjian Tian, Xiaohong Guan
Abstract	The well-known Gumbel-Max Trick for sampling elements from a categorical distribution (or more generally a nonnegative vector) and its variants have been widely used in areas such as machine learning and information retrieval. To sample a random element $i$ (or a Gumbel-Max variable $i$) in proportion to its positive weight $v_i$, the Gumbel-Max Trick first computes a Gumbel random variable $g_i$ for each positive weight element $i$, and then samples the element $i$ with the largest value of $g_i+\ln v_i$. Recently, applications including similarity estimation and graph embedding require to generate $k$ independent Gumbel-Max variables from high dimensional vectors. However, it is computationally expensive for a large $k$ (e.g., hundreds or even thousands) when using the traditional Gumbel-Max Trick. To solve this problem, we propose a novel algorithm, \emph{FastGM}, that reduces the time complexity from $O(kn^+)$ to $O(k \ln k + n^+)$, where $n^+$ is the number of positive elements in the vector of interest. Instead of computing $k$ independent Gumbel random variables directly, we find that there exists a technique to generate these variables in descending order. Using this technique, our method FastGM computes variables $g_i+\ln v_i$ for all positive elements $i$ in descending order. As a result, FastGM significantly reduces the computation time because we can stop the procedure of Gumbel random variables computing for many elements especially for those with small weights. Experiments on a variety of real-world datasets show that FastGM is orders of magnitude faster than state-of-the-art methods without sacrificing accuracy and incurring additional expenses.
Tasks	Graph Embedding, Information Retrieval
Published	2020-02-02
URL	https://arxiv.org/abs/2002.00413v1
PDF	https://arxiv.org/pdf/2002.00413v1.pdf
PWC	https://paperswithcode.com/paper/fast-generating-a-large-number-of-gumbel-max
Repo
Framework

NestedVAE: Isolating Common Factors via Weak Supervision


Title	NestedVAE: Isolating Common Factors via Weak Supervision
Authors	Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden
Abstract	Fair and unbiased machine learning is an important and active field of research, as decision processes are increasingly driven by models that learn from data. Unfortunately, any biases present in the data may be learned by the model, thereby inappropriately transferring that bias into the decision making process. We identify the connection between the task of bias reduction and that of isolating factors common between domains whilst encouraging domain specific invariance. To isolate the common factors we combine the theory of deep latent variable models with information bottleneck theory for scenarios whereby data may be naturally paired across domains and no additional supervision is required. The result is the Nested Variational AutoEncoder (NestedVAE). Two outer VAEs with shared weights attempt to reconstruct the input and infer a latent space, whilst a nested VAE attempts to reconstruct the latent representation of one image, from the latent representation of its paired image. In so doing, the nested VAE isolates the common latent factors/causes and becomes invariant to unwanted factors that are not shared between paired images. We also propose a new metric to provide a balanced method of evaluating consistency and classifier performance across domains which we refer to as the Adjusted Parity metric. An evaluation of NestedVAE on both domain and attribute invariance, change detection, and learning common factors for the prediction of biological sex demonstrates that NestedVAE significantly outperforms alternative methods.
Tasks	Decision Making, Latent Variable Models
Published	2020-02-26
URL	https://arxiv.org/abs/2002.11576v1
PDF	https://arxiv.org/pdf/2002.11576v1.pdf
PWC	https://paperswithcode.com/paper/nestedvae-isolating-common-factors-via-weak
Repo
Framework

AQPDCITY Dataset: Picture-Based PM2.5 Monitoring in the Urban Area of Big Cities


Title	AQPDCITY Dataset: Picture-Based PM2.5 Monitoring in the Urban Area of Big Cities
Authors	Yonghui Zhang, Ke Gu
Abstract	Since Particulate Matters (PMs) are closely related to people’s living and health, it has become one of the most important indicator of air quality monitoring around the world. But the existing sensor-based methods for PM monitoring have remarkable disadvantages, such as low-density monitoring stations and high-requirement monitoring conditions. It is highly desired to devise a method that can obtain the PM concentration at any location for the following air quality control in time. The prior works indicate that the PM concentration can be monitored by using ubiquitous photos. To further investigate such issue, we gathered 1,500 photos in big cities to establish a new AQPDCITY dataset. Experiments conducted to check nine state-of-the-art methods on this dataset show that the performance of those above methods perform poorly in the AQPDCITY dataset.
Tasks
Published	2020-03-22
URL	https://arxiv.org/abs/2003.09784v1
PDF	https://arxiv.org/pdf/2003.09784v1.pdf
PWC	https://paperswithcode.com/paper/aqpdcity-dataset-picture-based-pm25
Repo
Framework

Incremental Fast Subclass Discriminant Analysis


Title	Incremental Fast Subclass Discriminant Analysis
Authors	Kateryna Chumachenko, Jenni Raitoharju, Moncef Gabbouj, Alexandros Iosifidis
Abstract	This paper proposes an incremental solution to Fast Subclass Discriminant Analysis (fastSDA). We present an exact and an approximate linear solution, along with an approximate kernelized variant. Extensive experiments on eight image datasets with different incremental batch sizes show the superiority of the proposed approach in terms of training time and accuracy being equal or close to fastSDA solution and outperforming other methods.
Tasks
Published	2020-02-11
URL	https://arxiv.org/abs/2002.04348v1
PDF	https://arxiv.org/pdf/2002.04348v1.pdf
PWC	https://paperswithcode.com/paper/incremental-fast-subclass-discriminant
Repo
Framework

Plasticity-Enhanced Domain-Wall MTJ Neural Networks for Energy-Efficient Online Learning


Title	Plasticity-Enhanced Domain-Wall MTJ Neural Networks for Energy-Efficient Online Learning
Authors	Christopher H. Bennett, T. Patrick Xiao, Can Cui, Naimul Hassan, Otitoaleke G. Akinola, Jean Anne C. Incorvia, Alvaro Velasquez, Joseph S. Friedman, Matthew J. Marinella
Abstract	Machine learning implements backpropagation via abundant training samples. We demonstrate a multi-stage learning system realized by a promising non-volatile memory device, the domain-wall magnetic tunnel junction (DW-MTJ). The system consists of unsupervised (clustering) as well as supervised sub-systems, and generalizes quickly (with few samples). We demonstrate interactions between physical properties of this device and optimal implementation of neuroscience-inspired plasticity learning rules, and highlight performance on a suite of tasks. Our energy analysis confirms the value of the approach, as the learning budget stays below 20 $\mu J$ even for large tasks used typically in machine learning.
Tasks
Published	2020-03-04
URL	https://arxiv.org/abs/2003.02357v1
PDF	https://arxiv.org/pdf/2003.02357v1.pdf
PWC	https://paperswithcode.com/paper/plasticity-enhanced-domain-wall-mtj-neural
Repo
Framework

Human Posture Recognition and Gesture Imitation with a Humanoid Robot


Title	Human Posture Recognition and Gesture Imitation with a Humanoid Robot
Authors	Amir Aly
Abstract	This study proposes different approaches for static and dynamic gesture analysis and imitation with the social robot Nao
Tasks
Published	2020-02-05
URL	https://arxiv.org/abs/2002.01779v3
PDF	https://arxiv.org/pdf/2002.01779v3.pdf
PWC	https://paperswithcode.com/paper/human-posture-recognition-and-gesture
Repo
Framework

Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models


Title	Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Authors	Liam Hiley, Alun Preece, Yulia Hicks, Supriyo Chakraborty, Prudhvi Gurram, Richard Tomsett
Abstract	A small subset of explainability techniques developed initially for image recognition models has recently been applied for interpretability of 3D Convolutional Neural Network models in activity recognition tasks. Much like the models themselves, the techniques require little or no modification to be compatible with 3D inputs. However, these explanation techniques regard spatial and temporal information jointly. Therefore, using such explanation techniques, a user cannot explicitly distinguish the role of motion in a 3D model’s decision. In fact, it has been shown that these models do not appropriately factor motion information into their decision. We propose a selective relevance method for adapting the 2D explanation techniques to provide motion-specific explanations, better aligning them with the human understanding of motion as conceptually separate from static spatial features. We demonstrate the utility of our method in conjunction with several widely-used 2D explanation methods, and show that it improves explanation selectivity for motion. Our results show that the selective relevance method can not only provide insight on the role played by motion in the model’s decision – in effect, revealing and quantifying the model’s spatial bias – but the method also simplifies the resulting explanations for human consumption.
Tasks	Activity Recognition
Published	2020-03-31
URL	https://arxiv.org/abs/2003.14285v1
PDF	https://arxiv.org/pdf/2003.14285v1.pdf
PWC	https://paperswithcode.com/paper/explaining-motion-relevance-for-activity
Repo
Framework

Ascertaining price formation in cryptocurrency markets with DeepLearning


Title	Ascertaining price formation in cryptocurrency markets with DeepLearning
Authors	Fan Fang, Waichung Chung, Carmine Ventre, Michail Basios, Leslie Kanthan, Lingbo Li, Fan Wu
Abstract	The cryptocurrency market is amongst the fastest-growing of all the financial markets in the world. Unlike traditional markets, such as equities, foreign exchange and commodities, cryptocurrency market is considered to have larger volatility and illiquidity. This paper is inspired by the recent success of using deep learning for stock market prediction. In this work, we analyze and present the characteristics of the cryptocurrency market in a high-frequency setting. In particular, we applied a deep learning approach to predict the direction of the mid-price changes on the upcoming tick. We monitored live tick-level data from $8$ cryptocurrency pairs and applied both statistical and machine learning techniques to provide a live prediction. We reveal that promising results are possible for cryptocurrencies, and in particular, we achieve a consistent $78%$ accuracy on the prediction of the mid-price movement on live exchange rate of Bitcoins vs US dollars.
Tasks	Stock Market Prediction
Published	2020-02-09
URL	https://arxiv.org/abs/2003.00803v1
PDF	https://arxiv.org/pdf/2003.00803v1.pdf
PWC	https://paperswithcode.com/paper/ascertaining-price-formation-in
Repo
Framework

Type-2 Fuzzy Set based Hesitant Fuzzy Linguistic Term Sets for Linguistic Decision Making


Title	Type-2 Fuzzy Set based Hesitant Fuzzy Linguistic Term Sets for Linguistic Decision Making
Authors	Taniya Seth, Pranab K. Muhuri
Abstract	Approaches based on computing with words find good applicability in decision making systems. Predominantly finding their basis in type-1 fuzzy sets, computing with words approaches employ type-1 fuzzy sets as semantics of the linguistic terms. However, type-2 fuzzy sets have been proven to be scientifically more appropriate to represent linguistic information in practical systems. They take into account both the intra-uncertainty as well as the inter-uncertainty in cases where the linguistic information comes from a group of experts. Hence in this paper, we propose to introduce linguistic terms whose semantics are denoted by interval type-2 fuzzy sets within the hesitant fuzzy linguistic term set framework, resulting in type-2 fuzzy sets based hesitant fuzzy linguistic term sets. We also introduce a novel method of computing type-2 fuzzy envelopes out of multiple interval type-2 fuzzy sets with trapezoidal membership functions. Furthermore, the proposed framework with interval type-2 fuzzy sets is applied on a supplier performance evaluation scenario. Since humans are predominantly involved in the entire process of supply chain, their feedback is crucial while deciding many factors. Towards the end of the paper, we compare our presented model with various existing models and demonstrate the advantages of the former.
Tasks	Decision Making
Published	2020-02-26
URL	https://arxiv.org/abs/2002.11714v1
PDF	https://arxiv.org/pdf/2002.11714v1.pdf
PWC	https://paperswithcode.com/paper/type-2-fuzzy-set-based-hesitant-fuzzy
Repo
Framework

The PHOTON Wizard – Towards Educational Machine Learning Code Generators


Title	The PHOTON Wizard – Towards Educational Machine Learning Code Generators
Authors	Ramona Leenings, Nils Ralf Winter, Kelvin Sarink, Jan Ernsting, Xiaoyi Jiang, Udo Dannlowski, Tim Hahn
Abstract	Despite the tremendous efforts to democratize machine learning, especially in applied-science, the application is still often hampered by the lack of coding skills. As we consider programmatic understanding key to building effective and efficient machine learning solutions, we argue for a novel educational approach that builds upon the accessibility and acceptance of graphical user interfaces to convey programming skills to an applied-science target group. We outline a proof-of-concept, open-source web application, the PHOTON Wizard, which dynamically translates GUI interactions into valid source code for the Python machine learning framework PHOTON. Thereby, users possessing theoretical machine learning knowledge gain key insights into the model development workflow as well as an intuitive understanding of custom implementations. Specifically, the PHOTON Wizard integrates the concept of Educational Machine Learning Code Generators to teach users how to write code for designing, training, optimizing and evaluating custom machine learning pipelines.
Tasks
Published	2020-02-13
URL	https://arxiv.org/abs/2002.05432v1
PDF	https://arxiv.org/pdf/2002.05432v1.pdf
PWC	https://paperswithcode.com/paper/the-photon-wizard-towards-educational-machine
Repo
Framework

Quantum Interference for Counting Clusters


Title	Quantum Interference for Counting Clusters
Authors	Rohit R Muthyala, Davi Geiger, Zvi M. Kedem
Abstract	Counting the number of clusters, when these clusters overlap significantly is a challenging problem in machine learning. We argue that a purely mathematical quantum theory, formulated using the path integral technique, when applied to non-physics modeling leads to non-physics quantum theories that are statistical in nature. We show that a quantum theory can be a more robust statistical theory to separate data to count overlapping clusters. The theory is also confirmed from data simulations.This works identify how quantum theory can be effective in counting clusters and hope to inspire the field to further apply such techniques.
Tasks
Published	2020-01-03
URL	https://arxiv.org/abs/2001.04251v1
PDF	https://arxiv.org/pdf/2001.04251v1.pdf
PWC	https://paperswithcode.com/paper/quantum-interference-for-counting-clusters
Repo
Framework

Regret Bound of Adaptive Control in Linear Quadratic Gaussian (LQG) Systems


Title	Regret Bound of Adaptive Control in Linear Quadratic Gaussian (LQG) Systems
Authors	Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar
Abstract	We study the problem of adaptive control in partially observable linear quadratic Gaussian control systems, where the model dynamics are unknown a priori. We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty, to effectively minimize the overall control cost. We employ the predictor state evolution representation of the system dynamics and propose a new approach for closed-loop system identification, estimation, and confidence bound construction. LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model for further exploration and exploitation. We provide stability guarantees for LqgOpt, and prove the regret upper bound of $\tilde{\mathcal{O}}(\sqrt{T})$ for adaptive control of linear quadratic Gaussian (LQG) systems, where $T$ is the time horizon of the problem.
Tasks
Published	2020-03-12
URL	https://arxiv.org/abs/2003.05999v1
PDF	https://arxiv.org/pdf/2003.05999v1.pdf
PWC	https://paperswithcode.com/paper/regret-bound-of-adaptive-control-in-linear
Repo
Framework

Perspectives and Ethics of the Autonomous Artificial Thinking Systems


Title	Perspectives and Ethics of the Autonomous Artificial Thinking Systems
Authors	Joël Colloc
Abstract	The feasibility of autonomous artificial thinking systems needs to compare the way the human beings acquire their information and develops the thought with the current capacities of the autonomous information systems. Our model uses four hierarchies: the hierarchy of information systems, the cognitive hierarchy, the linguistic hierarchy and the digital informative hierarchy that combines artificial intelligence, the power of computers models, methods and tools to develop autonomous information systems. The question of the capability of autonomous system to provide a form of artificial thought arises with the ethical consequences on the social life and the perspective of transhumanism.
Tasks
Published	2020-01-13
URL	https://arxiv.org/abs/2001.04270v1
PDF	https://arxiv.org/pdf/2001.04270v1.pdf
PWC	https://paperswithcode.com/paper/perspectives-and-ethics-of-the-autonomous
Repo
Framework

Information-Theoretic Free Energy as Emotion Potential: Emotional Valence as a Function of Complexity and Novelty


Title	Information-Theoretic Free Energy as Emotion Potential: Emotional Valence as a Function of Complexity and Novelty
Authors	Hideyoshi Yanagisawa
Abstract	This study extends the mathematical model of emotion dimensions that we previously proposed (Yanagisawa, et al. 2019, Front Comput Neurosci) to consider perceived complexity as well as novelty, as a source of arousal potential. Berlyne’s hedonic function of arousal potential (or the inverse U-shaped curve, the so-called Wundt curve) is assumed. We modeled the arousal potential as information contents to be processed in the brain after sensory stimuli are perceived (or recognized), which we termed sensory surprisal. We mathematically demonstrated that sensory surprisal represents free energy, and it is equivalent to a summation of information gain (or information from novelty) and perceived complexity (or information from complexity), which are the collative variables forming the arousal potential. We demonstrated empirical evidence with visual stimuli (profile shapes of butterfly) supporting the hypothesis that the summation of perceived novelty and complexity shapes the inverse U-shaped beauty function. We discussed the potential of free energy as a mathematical principle explaining emotion initiators.
Tasks
Published	2020-03-23
URL	https://arxiv.org/abs/2003.10073v1
PDF	https://arxiv.org/pdf/2003.10073v1.pdf
PWC	https://paperswithcode.com/paper/information-theoretic-free-energy-as-emotion
Repo
Framework