October 16, 2019

3114 words 15 mins read

Paper Group ANR 1012

Paper Group ANR 1012

Towards a Question Answering System over the Semantic Web. A Pipeline for Creative Visual Storytelling. 2-gram-based Phonetic Feature Generation for Convolutional Neural Network in Assessment of Trademark Similarity. Nearly Optimal Dynamic $k$-Means Clustering for High-Dimensional Data. New Hybrid Neuro-Evolutionary Algorithms for Renewable Energy …

Towards a Question Answering System over the Semantic Web

Title Towards a Question Answering System over the Semantic Web
Authors Dennis Diefenbach, Andreas Both, Kamal Singh, Pierre Maret
Abstract Thanks to the development of the Semantic Web, a lot of new structured data has become available on the Web in the form of knowledge bases (KBs). Making this valuable data accessible and usable for end-users is one of the main goals of Question Answering (QA) over KBs. Most current QA systems query one KB, in one language (namely English). The existing approaches are not designed to be easily adaptable to new KBs and languages. We first introduce a new approach for translating natural language questions to SPARQL queries. It is able to query several KBs simultaneously, in different languages, and can easily be ported to other KBs and languages. In our evaluation, the impact of our approach is proven using 5 different well-known and large KBs: Wikidata, DBpedia, MusicBrainz, DBLP and Freebase as well as 5 different languages namely English, German, French, Italian and Spanish. Second, we show how we integrated our approach, to make it easily accessible by the research community and by end-users. To summarize, we provided a conceptional solution for multilingual, KB-agnostic Question Answering over the Semantic Web. The provided first approximation validates this concept.
Tasks Question Answering
Published 2018-03-02
URL http://arxiv.org/abs/1803.00832v1
PDF http://arxiv.org/pdf/1803.00832v1.pdf
PWC https://paperswithcode.com/paper/towards-a-question-answering-system-over-the
Repo
Framework

A Pipeline for Creative Visual Storytelling

Title A Pipeline for Creative Visual Storytelling
Authors Stephanie M. Lukin, Reginald Hobbs, Clare R. Voss
Abstract Computational visual storytelling produces a textual description of events and interpretations depicted in a sequence of images. These texts are made possible by advances and cross-disciplinary approaches in natural language processing, generation, and computer vision. We define a computational creative visual storytelling as one with the ability to alter the telling of a story along three aspects: to speak about different environments, to produce variations based on narrative goals, and to adapt the narrative to the audience. These aspects of creative storytelling and their effect on the narrative have yet to be explored in visual storytelling. This paper presents a pipeline of task-modules, Object Identification, Single-Image Inferencing, and Multi-Image Narration, that serve as a preliminary design for building a creative visual storyteller. We have piloted this design for a sequence of images in an annotation task. We present and analyze the collected corpus and describe plans towards automation.
Tasks Visual Storytelling
Published 2018-07-21
URL http://arxiv.org/abs/1807.08077v1
PDF http://arxiv.org/pdf/1807.08077v1.pdf
PWC https://paperswithcode.com/paper/a-pipeline-for-creative-visual-storytelling
Repo
Framework

2-gram-based Phonetic Feature Generation for Convolutional Neural Network in Assessment of Trademark Similarity

Title 2-gram-based Phonetic Feature Generation for Convolutional Neural Network in Assessment of Trademark Similarity
Authors Kyung Pyo Ko, Kwang Hee Lee, Mi So Jang, Gun Hong Park
Abstract A trademark is a mark used to identify various commodities. If same or similar trademark is registered for the same or similar commodity, the purchaser of the goods may be confused. Therefore, in the process of trademark registration examination, the examiner judges whether the trademark is the same or similar to the other applied or registered trademarks. The confusion in trademarks is based on the visual, phonetic or conceptual similarity of the marks. In this paper, we focus specifically on the phonetic similarity between trademarks. We propose a method to generate 2D phonetic feature for convolutional neural network in assessment of trademark similarity. This proposed algorithm is tested with 12,553 trademark phonetic similar pairs and 34,020 trademark phonetic non-similar pairs from 2010 to 2016. As a result, we have obtained approximately 92% judgment accuracy.
Tasks
Published 2018-02-10
URL http://arxiv.org/abs/1802.03581v1
PDF http://arxiv.org/pdf/1802.03581v1.pdf
PWC https://paperswithcode.com/paper/2-gram-based-phonetic-feature-generation-for
Repo
Framework

Nearly Optimal Dynamic $k$-Means Clustering for High-Dimensional Data

Title Nearly Optimal Dynamic $k$-Means Clustering for High-Dimensional Data
Authors Wei Hu, Zhao Song, Lin F. Yang, Peilin Zhong
Abstract We consider the $k$-means clustering problem in the dynamic streaming setting, where points from a discrete Euclidean space ${1, 2, \ldots, \Delta}^d$ can be dynamically inserted to or deleted from the dataset. For this problem, we provide a one-pass coreset construction algorithm using space $\tilde{O}(k\cdot \mathrm{poly}(d, \log\Delta))$, where $k$ is the target number of centers. To our knowledge, this is the first dynamic geometric data stream algorithm for $k$-means using space polynomial in dimension and nearly optimal (linear) in $k$.
Tasks
Published 2018-02-01
URL http://arxiv.org/abs/1802.00459v2
PDF http://arxiv.org/pdf/1802.00459v2.pdf
PWC https://paperswithcode.com/paper/nearly-optimal-dynamic-k-means-clustering-for
Repo
Framework

New Hybrid Neuro-Evolutionary Algorithms for Renewable Energy and Facilities Management Problems

Title New Hybrid Neuro-Evolutionary Algorithms for Renewable Energy and Facilities Management Problems
Authors L. Cornejo-Bueno
Abstract This Ph.D. thesis deals with the optimization of several renewable energy resources development as well as the improvement of facilities management in oceanic engineering and airports, using computational hybrid methods belonging to AI to this end. Energy is essential to our society in order to ensure a good quality of life. This means that predictions over the characteristics on which renewable energies depend are necessary, in order to know the amount of energy that will be obtained at any time. The second topic tackled in this thesis is related to the basic parameters that influence in different marine activities and airports, whose knowledge is necessary to develop a proper facilities management in these environments. Within this work, a study of the state-of-the-art Machine Learning have been performed to solve the problems associated with the topics above-mentioned, and several contributions have been proposed: One of the pillars of this work is focused on the estimation of the most important parameters in the exploitation of renewable resources. The second contribution of this thesis is related to feature selection problems. The proposed methodologies are applied to multiple problems: the prediction of $H_s$, relevant for marine energy applications and marine activities, the estimation of WPREs, undesirable variations in the electric power produced by a wind farm, the prediction of global solar radiation in areas from Spain and Australia, really important in terms of solar energy, and the prediction of low-visibility events at airports. All of these practical issues are developed with the consequent previous data analysis, normally, in terms of meteorological variables.
Tasks Feature Selection
Published 2018-06-05
URL http://arxiv.org/abs/1806.02654v1
PDF http://arxiv.org/pdf/1806.02654v1.pdf
PWC https://paperswithcode.com/paper/new-hybrid-neuro-evolutionary-algorithms-for
Repo
Framework

Mammographic Image Enhancement using Digital Image Processing Technique

Title Mammographic Image Enhancement using Digital Image Processing Technique
Authors Ardymulya Iswardani, Wahyu Hidayat
Abstract Abstract PURPOSES this study aims to perform microcalsification detection by performing image enhancement in mammography image by using transformation of negative image and histogram equalization. image mammography with .pgm format changed to. jpg format then processed into negative image result then processed again using histogram equalization. the results of the image enhancement process using negative image techniques and equalization histograms are compared and validated with MSE and PSNR on each mammographic image. CONCLUSION: Image enhancement process on mammography image can be done, however there are only some image that have improved quality, this affected by threshold usage, which have important role to get better visualization on mammographic image. Keywords-component; Image enhancement, image negative, histogram equalization, mammographic, breast cancer
Tasks Image Enhancement
Published 2018-06-29
URL http://arxiv.org/abs/1806.11496v1
PDF http://arxiv.org/pdf/1806.11496v1.pdf
PWC https://paperswithcode.com/paper/mammographic-image-enhancement-using-digital
Repo
Framework

Client Selection for Federated Learning with Heterogeneous Resources in Mobile Edge

Title Client Selection for Federated Learning with Heterogeneous Resources in Mobile Edge
Authors Takayuki Nishio, Ryo Yonetani
Abstract We envision a mobile edge computing (MEC) framework for machine learning (ML) technologies, which leverages distributed client data and computation resources for training high-performance ML models while preserving client privacy. Toward this future goal, this work aims to extend Federated Learning (FL), a decentralized learning framework that enables privacy-preserving training of models, to work with heterogeneous clients in a practical cellular network. The FL protocol iteratively asks random clients to download a trainable model from a server, update it with own data, and upload the updated model to the server, while asking the server to aggregate multiple client updates to further improve the model. While clients in this protocol are free from disclosing own private data, the overall training process can become inefficient when some clients are with limited computational resources (i.e. requiring longer update time) or under poor wireless channel conditions (longer upload time). Our new FL protocol, which we refer to as FedCS, mitigates this problem and performs FL efficiently while actively managing clients based on their resource conditions. Specifically, FedCS solves a client selection problem with resource constraints, which allows the server to aggregate as many client updates as possible and to accelerate performance improvement in ML models. We conducted an experimental evaluation using publicly-available large-scale image datasets to train deep neural networks on MEC environment simulations. The experimental results show that FedCS is able to complete its training process in a significantly shorter time compared to the original FL protocol.
Tasks
Published 2018-04-23
URL http://arxiv.org/abs/1804.08333v2
PDF http://arxiv.org/pdf/1804.08333v2.pdf
PWC https://paperswithcode.com/paper/client-selection-for-federated-learning-with
Repo
Framework

Personalized Classifier for Food Image Recognition

Title Personalized Classifier for Food Image Recognition
Authors Shota Horiguchi, Sosuke Amano, Makoto Ogawa, Kiyoharu Aizawa
Abstract Currently, food image recognition tasks are evaluated against fixed datasets. However, in real-world conditions, there are cases in which the number of samples in each class continues to increase and samples from novel classes appear. In particular, dynamic datasets in which each individual user creates samples and continues the updating process often have content that varies considerably between different users, and the number of samples per person is very limited. A single classifier common to all users cannot handle such dynamic data. Bridging the gap between the laboratory environment and the real world has not yet been accomplished on a large scale. Personalizing a classifier incrementally for each user is a promising way to do this. In this paper, we address the personalization problem, which involves adapting to the user’s domain incrementally using a very limited number of samples. We propose a simple yet effective personalization framework which is a combination of the nearest class mean classifier and the 1-nearest neighbor classifier based on deep features. To conduct realistic experiments, we made use of a new dataset of daily food images collected by a food-logging application. Experimental results show that our proposed method significantly outperforms existing methods.
Tasks
Published 2018-04-08
URL http://arxiv.org/abs/1804.04600v1
PDF http://arxiv.org/pdf/1804.04600v1.pdf
PWC https://paperswithcode.com/paper/personalized-classifier-for-food-image
Repo
Framework

Winter Road Surface Condition Recognition Using A Pretrained Deep Convolutional Network

Title Winter Road Surface Condition Recognition Using A Pretrained Deep Convolutional Network
Authors Guangyuan Pan, Liping Fu, Ruifan Yu, Matthew Muresan
Abstract This paper investigates the application of the latest machine learning technique deep neural networks for classifying road surface conditions (RSC) based on images from smartphones. Traditional machine learning techniques such as support vector machine (SVM) and random forests (RF) have been attempted in literature; however, their classification performance has been less than desirable due to challenges associated with image noises caused by sunlight glare and residual salts. A deep learning model based on convolutional neural network (CNN) is proposed and evaluated for its potential to address these challenges for improved classification accuracy. In the proposed approach we introduce the idea of applying an existing CNN model that has been pre-trained using millions of images with proven high recognition accuracy. The model is extended with two additional fully-connected layers of neurons for learning the specific features of the RSC images. The whole model is then trained with a low learning rate for fine-tuning by using a small set of RSC images. Results show that the proposed model has the highest classification performance in comparison to the traditional machine learning techniques. The testing accuracy with different training dataset sizes is also analyzed, showing the potential of achieving much higher accuracy with a larger training dataset.
Tasks
Published 2018-12-17
URL http://arxiv.org/abs/1812.06858v1
PDF http://arxiv.org/pdf/1812.06858v1.pdf
PWC https://paperswithcode.com/paper/winter-road-surface-condition-recognition
Repo
Framework

Corpus Conversion Service: A machine learning platform to ingest documents at scale [Poster abstract]

Title Corpus Conversion Service: A machine learning platform to ingest documents at scale [Poster abstract]
Authors Peter W J Staar, Michele Dolfi, Christoph Auer, Costas Bekas
Abstract Over the past few decades, the amount of scientific articles and technical literature has increased exponentially in size. Consequently, there is a great need for systems that can ingest these documents at scale and make their content discoverable. Unfortunately, both the format of these documents (e.g. the PDF format or bitmap images) as well as the presentation of the data (e.g. complex tables) make the extraction of qualitative and quantitive data extremely challenging. We present a platform to ingest documents at scale which is powered by Machine Learning techniques and allows the user to train custom models on document collections. We show precision/recall results greater than 97% with regard to conversion to structured formats, as well as scaling evidence for each of the microservices constituting the platform.
Tasks
Published 2018-05-15
URL http://arxiv.org/abs/1805.09687v1
PDF http://arxiv.org/pdf/1805.09687v1.pdf
PWC https://paperswithcode.com/paper/corpus-conversion-service-a-machine-learning
Repo
Framework

Triangular Architecture for Rare Language Translation

Title Triangular Architecture for Rare Language Translation
Authors Shuo Ren, Wenhu Chen, Shujie Liu, Mu Li, Ming Zhou, Shuai Ma
Abstract Neural Machine Translation (NMT) performs poor on the low-resource language pair $(X,Z)$, especially when $Z$ is a rare language. By introducing another rich language $Y$, we propose a novel triangular training architecture (TA-NMT) to leverage bilingual data $(Y,Z)$ (may be small) and $(X,Y)$ (can be rich) to improve the translation performance of low-resource pairs. In this triangular architecture, $Z$ is taken as the intermediate latent variable, and translation models of $Z$ are jointly optimized with a unified bidirectional EM algorithm under the goal of maximizing the translation likelihood of $(X,Y)$. Empirical results demonstrate that our method significantly improves the translation quality of rare languages on MultiUN and IWSLT2012 datasets, and achieves even better performance combining back-translation methods.
Tasks Machine Translation
Published 2018-05-13
URL http://arxiv.org/abs/1805.04813v2
PDF http://arxiv.org/pdf/1805.04813v2.pdf
PWC https://paperswithcode.com/paper/triangular-architecture-for-rare-language
Repo
Framework

Neuron inspired data encoding memristive multi-level memory cell

Title Neuron inspired data encoding memristive multi-level memory cell
Authors Aidana Irmanova, Alex Pappachen James
Abstract Mapping neuro-inspired algorithms to sensor backplanes of on-chip hardware require shifting the signal processing from digital to the analog domain, demanding memory technologies beyond conventional CMOS binary storage units. Using memristors for building analog data storage is one of the promising approaches amongst emerging non-volatile memory technologies. Recently, a memristive multi-level memory (MLM) cell for storing discrete analog values has been developed in which memory system is implemented combining memristors in voltage divider configuration. In given example, the memory cell of 3 sub-cells with a memristor in each was programmed to store ternary bits which overall achieved 10 and 27 discrete voltage levels. However, for further use of proposed memory cell in analog signal processing circuits data encoder is required to generate control voltages for programming memristors to store discrete analog values. In this paper, we present the design and performance analysis of data encoder that generates write pattern signals for 10 level memristive memory.
Tasks
Published 2018-03-14
URL http://arxiv.org/abs/1803.05132v1
PDF http://arxiv.org/pdf/1803.05132v1.pdf
PWC https://paperswithcode.com/paper/neuron-inspired-data-encoding-memristive
Repo
Framework

Physical Signal Classification Via Deep Neural Networks

Title Physical Signal Classification Via Deep Neural Networks
Authors Benjamin Epstein, Roy H. Olsson III
Abstract A Deep Neural Network is applied to classify physical signatures obtained from physical sensor measurements of running gasoline and diesel-powered vehicles and other devices. The classification provides information on the target identities as to vehicle type and even vehicle model. The physical measurements include acoustic, acceleration (vibration), geophonic, and magnetic.
Tasks
Published 2018-11-15
URL http://arxiv.org/abs/1811.06349v1
PDF http://arxiv.org/pdf/1811.06349v1.pdf
PWC https://paperswithcode.com/paper/physical-signal-classification-via-deep
Repo
Framework

Enabling Pedestrian Safety using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash

Title Enabling Pedestrian Safety using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash
Authors Puneet Kohli, Anjali Chadha
Abstract Human lives are important. The decision to allow self-driving vehicles operate on our roads carries great weight. This has been a hot topic of debate between policy-makers, technologists and public safety institutions. The recent Uber Inc. self-driving car crash, resulting in the death of a pedestrian, has strengthened the argument that autonomous vehicle technology is still not ready for deployment on public roads. In this work, we analyze the Uber car crash and shed light on the question, “Could the Uber Car Crash have been avoided?". We apply state-of-the-art Computer Vision models to this highly practical scenario. More generally, our experimental results are an evaluation of various image enhancement and object recognition techniques for enabling pedestrian safety in low-lighting conditions using the Uber crash as a case study.
Tasks Image Enhancement, Object Recognition
Published 2018-05-30
URL http://arxiv.org/abs/1805.11815v1
PDF http://arxiv.org/pdf/1805.11815v1.pdf
PWC https://paperswithcode.com/paper/enabling-pedestrian-safety-using-computer
Repo
Framework

A robust hierarchical nominal classification method based on similarity and dissimilarity using loss function and an improved version of the deck of cards method

Title A robust hierarchical nominal classification method based on similarity and dissimilarity using loss function and an improved version of the deck of cards method
Authors Ana Sara Costa, Salvatore Corrente, Salvatore Greco, José Rui Figueira, José Borbinha
Abstract Cat-SD is a multiple criteria decision aiding method for dealing with nominal classification problems. Actions are assessed according to multiple criteria and assigned to one or more categories. A set of reference actions is used for defining each category. The assignment of an action to a given category depends on the comparison of the action to each reference set according to likeness thresholds. Distinct sets of criteria weights, interaction coefficients, and likeness thresholds can be defined per category. We propose to apply Multiple Criteria Hierarchy Process (MCHP) to Cat-SD. An adapted MCHP is proposed to take into account possible interaction effects between criteria structured in a hierarchical way. On the basis of the known deck of cards method, we also consider an imprecise elicitation of parameters permitting to take into account interactions and antagonistic effects between criteria. The elicitation procedure we are proposing can be applied to any Electre method. With the purpose of exploring the assignments obtained by Cat-SD considering possible sets of parameters, we propose to apply the Stochastic Multicriteria Acceptability Analysis (SMAA). The SMAA methodology allows to draw statistical conclusions on the classification of the actions. The proposed method, SMAA-hCat-SD, helps the decision maker to check the effects of the variation of parameters on the classification at different levels of the hierarchy. We propose also a procedure, based on the concept of loss function, to obtain a final classification fulfilling some requirements given by the decision maker and taking into account the hierarchy of criteria and the probabilistic assignments obtained applying SMAA. Also this procedure can be applied to any classification Electre method.
Tasks
Published 2018-12-12
URL https://arxiv.org/abs/1812.08596v2
PDF https://arxiv.org/pdf/1812.08596v2.pdf
PWC https://paperswithcode.com/paper/a-robust-hierarchical-nominal-classification
Repo
Framework
comments powered by Disqus