July 28, 2019

2964 words 14 mins read

Paper Group ANR 208

Joint Background Reconstruction and Foreground Segmentation via A Two-stage Convolutional Neural Network. A filter based approach for inbetweening. Argumentation-based Security for Social Good. Is Natural Language a Perigraphic Process? The Theorem about Facts and Words Revisited. Hierarchical Representations for Efficient Architecture Search. Can …

Joint Background Reconstruction and Foreground Segmentation via A Two-stage Convolutional Neural Network


Title	Joint Background Reconstruction and Foreground Segmentation via A Two-stage Convolutional Neural Network
Authors	Xu Zhao, Yingying Chen, Ming Tang, Jinqiao Wang
Abstract	Foreground segmentation in video sequences is a classic topic in computer vision. Due to the lack of semantic and prior knowledge, it is difficult for existing methods to deal with sophisticated scenes well. Therefore, in this paper, we propose an end-to-end two-stage deep convolutional neural network (CNN) framework for foreground segmentation in video sequences. In the first stage, a convolutional encoder-decoder sub-network is employed to reconstruct the background images and encode rich prior knowledge of background scenes. In the second stage, the reconstructed background and current frame are input into a multi-channel fully-convolutional sub-network (MCFCN) for accurate foreground segmentation. In the two-stage CNN, the reconstruction loss and segmentation loss are jointly optimized. The background images and foreground objects are output simultaneously in an end-to-end way. Moreover, by incorporating the prior semantic knowledge of foreground and background in the pre-training process, our method could restrain the background noise and keep the integrity of foreground objects at the same time. Experiments on CDNet 2014 show that our method outperforms the state-of-the-art by 4.9%.
Tasks
Published	2017-07-24
URL	http://arxiv.org/abs/1707.07584v1
PDF	http://arxiv.org/pdf/1707.07584v1.pdf
PWC	https://paperswithcode.com/paper/joint-background-reconstruction-and
Repo
Framework

A filter based approach for inbetweening


Title	A filter based approach for inbetweening
Authors	Yuichi Yagi
Abstract	We present a filter based approach for inbetweening. We train a convolutional neural network to generate intermediate frames. This network aim to generate smooth animation of line drawings. Our method can process scanned images directly. Our method does not need to compute correspondence of lines and topological changes explicitly. We experiment our method with real animation production data. The results show that our method can generate intermediate frames partially.
Tasks
Published	2017-06-12
URL	http://arxiv.org/abs/1706.03497v1
PDF	http://arxiv.org/pdf/1706.03497v1.pdf
PWC	https://paperswithcode.com/paper/a-filter-based-approach-for-inbetweening
Repo
Framework


Title	Argumentation-based Security for Social Good
Authors	Erisa Karafili, Antonis C. Kakas, Nikolaos I. Spanoudakis, Emil C. Lupu
Abstract	The increase of connectivity and the impact it has in every day life is raising new and existing security problems that are becoming important for social good. We introduce two particular problems: cyber attack attribution and regulatory data sharing. For both problems, decisions about which rules to apply, should be taken under incomplete and context dependent information. The solution we propose is based on argumentation reasoning, that is a well suited technique for implementing decision making mechanisms under conflicting and incomplete information. Our proposal permits us to identify the attacker of a cyber attack and decide the regulation rule that should be used while using and sharing data. We illustrate our solution through concrete examples.
Tasks	Decision Making
Published	2017-05-01
URL	http://arxiv.org/abs/1705.00732v1
PDF	http://arxiv.org/pdf/1705.00732v1.pdf
PWC	https://paperswithcode.com/paper/argumentation-based-security-for-social-good
Repo
Framework

Is Natural Language a Perigraphic Process? The Theorem about Facts and Words Revisited


Title	Is Natural Language a Perigraphic Process? The Theorem about Facts and Words Revisited
Authors	Łukasz Dębowski
Abstract	As we discuss, a stationary stochastic process is nonergodic when a random persistent topic can be detected in the infinite random text sampled from the process, whereas we call the process strongly nonergodic when an infinite sequence of independent random bits, called probabilistic facts, is needed to describe this topic completely. Replacing probabilistic facts with an algorithmically random sequence of bits, called algorithmic facts, we adapt this property back to ergodic processes. Subsequently, we call a process perigraphic if the number of algorithmic facts which can be inferred from a finite text sampled from the process grows like a power of the text length. We present a simple example of such a process. Moreover, we demonstrate an assertion which we call the theorem about facts and words. This proposition states that the number of probabilistic or algorithmic facts which can be inferred from a text drawn from a process must be roughly smaller than the number of distinct word-like strings detected in this text by means of the PPM compression algorithm. We also observe that the number of the word-like strings for a sample of plays by Shakespeare follows an empirical stepwise power law, in a stark contrast to Markov processes. Hence we suppose that natural language considered as a process is not only non-Markov but also perigraphic.
Tasks
Published	2017-06-14
URL	http://arxiv.org/abs/1706.04432v2
PDF	http://arxiv.org/pdf/1706.04432v2.pdf
PWC	https://paperswithcode.com/paper/is-natural-language-a-perigraphic-process-the
Repo
Framework

Hierarchical Representations for Efficient Architecture Search


Title	Hierarchical Representations for Efficient Architecture Search
Authors	Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, Koray Kavukcuoglu
Abstract	We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation scheme that imitates the modularized design pattern commonly adopted by human experts, and an expressive search space that supports complex topologies. Our algorithm efficiently discovers architectures that outperform a large number of manually designed models for image classification, obtaining top-1 error of 3.6% on CIFAR-10 and 20.3% when transferred to ImageNet, which is competitive with the best existing neural architecture search approaches. We also present results using random search, achieving 0.3% less top-1 accuracy on CIFAR-10 and 0.1% less on ImageNet whilst reducing the search time from 36 hours down to 1 hour.
Tasks	Image Classification, Neural Architecture Search
Published	2017-11-01
URL	http://arxiv.org/abs/1711.00436v2
PDF	http://arxiv.org/pdf/1711.00436v2.pdf
PWC	https://paperswithcode.com/paper/hierarchical-representations-for-efficient
Repo
Framework

Can We See Photosynthesis? Magnifying the Tiny Color Changes of Plant Green Leaves Using Eulerian Video Magnification


Title	Can We See Photosynthesis? Magnifying the Tiny Color Changes of Plant Green Leaves Using Eulerian Video Magnification
Authors	Islam A. T. F. Taj-Eddin, Mahmoud Afifi, Mostafa Korashy, Ali H. Ahmed, Ng Yoke Cheng, Evelyng Hernandez, Salma M. Abdel-latif
Abstract	Plant aliveness is proven through laboratory experiments and special scientific instruments. In this paper, we aim to detect the degree of animation of plants based on the magnification of the small color changes in the plant’s green leaves using the Eulerian video magnification. Capturing the video under a controlled environment, e.g., using a tripod and direct current (DC) light sources, reduces camera movements and minimizes light fluctuations; we aim to reduce the external factors as much as possible. The acquired video is then stabilized and a proposed algorithm used to reduce the illumination variations. Lastly, the Euler magnification is utilized to magnify the color changes on the light invariant video. The proposed system does not require any special purpose instruments as it uses a digital camera with a regular frame rate. The results of magnified color changes on both natural and plastic leaves show that the live green leaves have color changes in contrast to the plastic leaves. Hence, we can argue that the color changes of the leaves are due to biological operations, such as photosynthesis. To date, this is possibly the first work that focuses on interpreting visually, some biological operations of plants without any special purpose instruments.
Tasks
Published	2017-06-12
URL	http://arxiv.org/abs/1706.03867v3
PDF	http://arxiv.org/pdf/1706.03867v3.pdf
PWC	https://paperswithcode.com/paper/can-we-see-photosynthesis-magnifying-the-tiny
Repo
Framework

Natural Language Guided Visual Relationship Detection


Title	Natural Language Guided Visual Relationship Detection
Authors	Wentong Liao, Lin Shuai, Bodo Rosenhahn, Michael Ying Yang
Abstract	Reasoning about the relationships between object pairs in images is a crucial task for holistic scene understanding. Most of the existing works treat this task as a pure visual classification task: each type of relationship or phrase is classified as a relation category based on the extracted visual features. However, each kind of relationships has a wide variety of object combination and each pair of objects has diverse interactions. Obtaining sufficient training samples for all possible relationship categories is difficult and expensive. In this work, we propose a natural language guided framework to tackle this problem. We propose to use a generic bi-directional recurrent neural network to predict the semantic connection between the participating objects in the relationship from the aspect of natural language. The proposed simple method achieves the state-of-the-art on the Visual Relationship Detection (VRD) and Visual Genome datasets, especially when predicting unseen relationships (e.g. recall improved from 76.42% to 89.79% on VRD zero-shot testing set).
Tasks	Scene Understanding
Published	2017-11-16
URL	http://arxiv.org/abs/1711.06032v2
PDF	http://arxiv.org/pdf/1711.06032v2.pdf
PWC	https://paperswithcode.com/paper/natural-language-guided-visual-relationship
Repo
Framework

Tracking Typological Traits of Uralic Languages in Distributed Language Representations


Title	Tracking Typological Traits of Uralic Languages in Distributed Language Representations
Authors	Johannes Bjerva, Isabelle Augenstein
Abstract	Although linguistic typology has a long history, computational approaches have only recently gained popularity. The use of distributed representations in computational linguistics has also become increasingly popular. A recent development is to learn distributed representations of language, such that typologically similar languages are spatially close to one another. Although empirical successes have been shown for such language representations, they have not been subjected to much typological probing. In this paper, we first look at whether this type of language representations are empirically useful for model transfer between Uralic languages in deep neural networks. We then investigate which typological features are encoded in these representations by attempting to predict features in the World Atlas of Language Structures, at various stages of fine-tuning of the representations. We focus on Uralic languages, and find that some typological traits can be automatically inferred with accuracies well above a strong baseline.
Tasks
Published	2017-11-15
URL	http://arxiv.org/abs/1711.05468v1
PDF	http://arxiv.org/pdf/1711.05468v1.pdf
PWC	https://paperswithcode.com/paper/tracking-typological-traits-of-uralic
Repo
Framework


Title	Predicting the Quality of Short Narratives from Social Media
Authors	Tong Wang, Ping Chen, Boyang Li
Abstract	An important and difficult challenge in building computational models for narratives is the automatic evaluation of narrative quality. Quality evaluation connects narrative understanding and generation as generation systems need to evaluate their own products. To circumvent difficulties in acquiring annotations, we employ upvotes in social media as an approximate measure for story quality. We collected 54,484 answers from a crowd-powered question-and-answer website, Quora, and then used active learning to build a classifier that labeled 28,320 answers as stories. To predict the number of upvotes without the use of social network features, we create neural networks that model textual regions and the interdependence among regions, which serve as strong benchmarks for future research. To our best knowledge, this is the first large-scale study for automatic evaluation of narrative quality.
Tasks	Active Learning
Published	2017-07-08
URL	http://arxiv.org/abs/1707.02499v1
PDF	http://arxiv.org/pdf/1707.02499v1.pdf
PWC	https://paperswithcode.com/paper/predicting-the-quality-of-short-narratives
Repo
Framework

How Important is Syntactic Parsing Accuracy? An Empirical Evaluation on Rule-Based Sentiment Analysis


Title	How Important is Syntactic Parsing Accuracy? An Empirical Evaluation on Rule-Based Sentiment Analysis
Authors	Carlos Gómez-Rodríguez, Iago Alonso-Alonso, David Vilares
Abstract	Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language text or speech. Sentiment analysis is one example of application for which parsing has recently proven useful. In recent years, there have been significant advances in the accuracy of parsing algorithms. In this article, we perform an empirical, task-oriented evaluation to determine how parsing accuracy influences the performance of a state-of-the-art rule-based sentiment analysis system that determines the polarity of sentences from their parse trees. In particular, we evaluate the system using four well-known dependency parsers, including both current models with state-of-the-art accuracy and more innacurate models which, however, require less computational resources. The experiments show that all of the parsers produce similarly good results in the sentiment analysis task, without their accuracy having any relevant influence on the results. Since parsing is currently a task with a relatively high computational cost that varies strongly between algorithms, this suggests that sentiment analysis researchers and users should prioritize speed over accuracy when choosing a parser; and parsing researchers should investigate models that improve speed further, even at some cost to accuracy.
Tasks	Sentiment Analysis
Published	2017-06-07
URL	http://arxiv.org/abs/1706.02141v3
PDF	http://arxiv.org/pdf/1706.02141v3.pdf
PWC	https://paperswithcode.com/paper/how-important-is-syntactic-parsing-accuracy
Repo
Framework

Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs


Title	Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs
Authors	Emanuel Laude, Jan-Hendrik Lange, Jonas Schüpfer, Csaba Domokos, Laura Leal-Taixé, Frank R. Schmidt, Bjoern Andres, Daniel Cremers
Abstract	This paper introduces a novel algorithm for transductive inference in higher-order MRFs, where the unary energies are parameterized by a variable classifier. The considered task is posed as a joint optimization problem in the continuous classifier parameters and the discrete label variables. In contrast to prior approaches such as convex relaxations, we propose an advantageous decoupling of the objective function into discrete and continuous subproblems and a novel, efficient optimization method related to ADMM. This approach preserves integrality of the discrete label variables and guarantees global convergence to a critical point. We demonstrate the advantages of our approach in several experiments including video object segmentation on the DAVIS data set and interactive image segmentation.
Tasks	Semantic Segmentation, Video Object Segmentation, Video Semantic Segmentation
Published	2017-05-14
URL	http://arxiv.org/abs/1705.05020v5
PDF	http://arxiv.org/pdf/1705.05020v5.pdf
PWC	https://paperswithcode.com/paper/discrete-continuous-admm-for-transductive
Repo
Framework

Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning


Title	Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning
Authors	Ravi Tejwani, Adam Liska, Hongyuan You, Jenna Reinen, Payel Das
Abstract	The goal of the present study is to identify autism using machine learning techniques and resting-state brain imaging data, leveraging the temporal variability of the functional connections (FC) as the only information. We estimated and compared the FC variability across brain regions between typical, healthy subjects and autistic population by analyzing brain imaging data from a world-wide multi-site database known as ABIDE (Autism Brain Imaging Data Exchange). Our analysis revealed that patients diagnosed with autism spectrum disorder (ASD) show increased FC variability in several brain regions that are associated with low FC variability in the typical brain. We then used the enhanced FC variability of brain regions as features for training machine learning models for ASD classification and achieved 65% accuracy in identification of ASD versus control subjects within the dataset. We also used node strength estimated from number of functional connections per node averaged over the whole scan as features for ASD classification.The results reveal that the dynamic FC measures outperform or are comparable with the static FC measures in predicting ASD.
Tasks
Published	2017-12-21
URL	http://arxiv.org/abs/1712.08041v1
PDF	http://arxiv.org/pdf/1712.08041v1.pdf
PWC	https://paperswithcode.com/paper/autism-classification-using-brain-functional
Repo
Framework


Title	Social welfare and profit maximization from revealed preferences
Authors	Ziwei Ji, Ruta Mehta, Matus Telgarsky
Abstract	Consider the seller’s problem of finding optimal prices for her $n$ (divisible) goods when faced with a set of $m$ consumers, given that she can only observe their purchased bundles at posted prices, i.e., revealed preferences. We study both social welfare and profit maximization with revealed preferences. Although social welfare maximization is a seemingly non-convex optimization problem in prices, we show that (i) it can be reduced to a dual convex optimization problem in prices, and (ii) the revealed preferences can be interpreted as supergradients of the concave conjugate of valuation, with which subgradients of the dual function can be computed. We thereby obtain a simple subgradient-based algorithm for strongly concave valuations and convex cost, with query complexity $O(m^2/\epsilon^2)$, where $\epsilon$ is the additive difference between the social welfare induced by our algorithm and the optimum social welfare. We also study social welfare maximization under the online setting, specifically the random permutation model, where consumers arrive one-by-one in a random order. For the case where consumer valuations can be arbitrary continuous functions, we propose a price posting mechanism that achieves an expected social welfare up to an additive factor of $O(\sqrt{mn})$ from the maximum social welfare. Finally, for profit maximization (which may be non-convex in simple cases), we give nearly matching upper and lower bounds on the query complexity for separable valuations and cost (i.e., each good can be treated independently).
Tasks
Published	2017-11-06
URL	http://arxiv.org/abs/1711.02211v2
PDF	http://arxiv.org/pdf/1711.02211v2.pdf
PWC	https://paperswithcode.com/paper/social-welfare-and-profit-maximization-from
Repo
Framework

Six Challenges for Neural Machine Translation


Title	Six Challenges for Neural Machine Translation
Authors	Philipp Koehn, Rebecca Knowles
Abstract	We explore six challenges for neural machine translation: domain mismatch, amount of training data, rare words, long sentences, word alignment, and beam search. We show both deficiencies and improvements over the quality of phrase-based statistical machine translation.
Tasks	Machine Translation, Word Alignment
Published	2017-06-12
URL	http://arxiv.org/abs/1706.03872v1
PDF	http://arxiv.org/pdf/1706.03872v1.pdf
PWC	https://paperswithcode.com/paper/six-challenges-for-neural-machine-translation
Repo
Framework

A Fast Integrated Planning and Control Framework for Autonomous Driving via Imitation Learning


Title	A Fast Integrated Planning and Control Framework for Autonomous Driving via Imitation Learning
Authors	Liting Sun, Cheng Peng, Wei Zhan, Masayoshi Tomizuka
Abstract	For safe and efficient planning and control in autonomous driving, we need a driving policy which can achieve desirable driving quality in long-term horizon with guaranteed safety and feasibility. Optimization-based approaches, such as Model Predictive Control (MPC), can provide such optimal policies, but their computational complexity is generally unacceptable for real-time implementation. To address this problem, we propose a fast integrated planning and control framework that combines learning- and optimization-based approaches in a two-layer hierarchical structure. The first layer, defined as the “policy layer”, is established by a neural network which learns the long-term optimal driving policy generated by MPC. The second layer, called the “execution layer”, is a short-term optimization-based controller that tracks the reference trajecotries given by the “policy layer” with guaranteed short-term safety and feasibility. Moreover, with efficient and highly-representative features, a small-size neural network is sufficient in the “policy layer” to handle many complicated driving scenarios. This renders online imitation learning with Dataset Aggregation (DAgger) so that the performance of the “policy layer” can be improved rapidly and continuously online. Several exampled driving scenarios are demonstrated to verify the effectiveness and efficiency of the proposed framework.
Tasks	Autonomous Driving, Imitation Learning
Published	2017-07-09
URL	http://arxiv.org/abs/1707.02515v1
PDF	http://arxiv.org/pdf/1707.02515v1.pdf
PWC	https://paperswithcode.com/paper/a-fast-integrated-planning-and-control
Repo
Framework

Paper Group ANR 208

Joint Background Reconstruction and Foreground Segmentation via A Two-stage Convolutional Neural Network

A filter based approach for inbetweening

Argumentation-based Security for Social Good

Is Natural Language a Perigraphic Process? The Theorem about Facts and Words Revisited

Hierarchical Representations for Efficient Architecture Search

Can We See Photosynthesis? Magnifying the Tiny Color Changes of Plant Green Leaves Using Eulerian Video Magnification

Natural Language Guided Visual Relationship Detection

Tracking Typological Traits of Uralic Languages in Distributed Language Representations

Predicting the Quality of Short Narratives from Social Media

How Important is Syntactic Parsing Accuracy? An Empirical Evaluation on Rule-Based Sentiment Analysis

Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs

Autism Classification Using Brain Functional Connectivity Dynamics and Machine Learning

Social welfare and profit maximization from revealed preferences

Six Challenges for Neural Machine Translation

A Fast Integrated Planning and Control Framework for Autonomous Driving via Imitation Learning

Paper Group ANR 780

Paper Group ANR 554

Paper Group ANR 705