Paper Group ANR 271
A Growing Long-term Episodic & Semantic Memory. Co-Occuring Directions Sketching for Approximate Matrix Multiply. A Closed Form Solution to Multi-View Low-Rank Regression. From quantum foundations via natural language meaning to a theory of everything. Semantic Parsing with Semi-Supervised Sequential Autoencoders. Generating Binary Tags for Fast Me …
A Growing Long-term Episodic & Semantic Memory
Title | A Growing Long-term Episodic & Semantic Memory |
Authors | Marc Pickett, Rami Al-Rfou, Louis Shao, Chris Tar |
Abstract | The long-term memory of most connectionist systems lies entirely in the weights of the system. Since the number of weights is typically fixed, this bounds the total amount of knowledge that can be learned and stored. Though this is not normally a problem for a neural network designed for a specific task, such a bound is undesirable for a system that continually learns over an open range of domains. To address this, we describe a lifelong learning system that leverages a fast, though non-differentiable, content-addressable memory which can be exploited to encode both a long history of sequential episodic knowledge and semantic knowledge over many episodes for an unbounded number of domains. This opens the door for investigation into transfer learning, and leveraging prior knowledge that has been learned over a lifetime of experiences to new domains. |
Tasks | Transfer Learning |
Published | 2016-10-20 |
URL | http://arxiv.org/abs/1610.06402v1 |
http://arxiv.org/pdf/1610.06402v1.pdf | |
PWC | https://paperswithcode.com/paper/a-growing-long-term-episodic-semantic-memory |
Repo | |
Framework | |
Co-Occuring Directions Sketching for Approximate Matrix Multiply
Title | Co-Occuring Directions Sketching for Approximate Matrix Multiply |
Authors | Youssef Mroueh, Etienne Marcheret, Vaibhava Goel |
Abstract | We introduce co-occurring directions sketching, a deterministic algorithm for approximate matrix product (AMM), in the streaming model. We show that co-occuring directions achieves a better error bound for AMM than other randomized and deterministic approaches for AMM. Co-occurring directions gives a $1 + \epsilon$ -approximation of the optimal low rank approximation of a matrix product. Empirically our algorithm outperforms competing methods for AMM, for a small sketch size. We validate empirically our theoretical findings and algorithms |
Tasks | |
Published | 2016-10-25 |
URL | http://arxiv.org/abs/1610.07686v1 |
http://arxiv.org/pdf/1610.07686v1.pdf | |
PWC | https://paperswithcode.com/paper/co-occuring-directions-sketching-for |
Repo | |
Framework | |
A Closed Form Solution to Multi-View Low-Rank Regression
Title | A Closed Form Solution to Multi-View Low-Rank Regression |
Authors | Shuai Zheng, Xiao Cai, Chris Ding, Feiping Nie, Heng Huang |
Abstract | Real life data often includes information from different channels. For example, in computer vision, we can describe an image using different image features, such as pixel intensity, color, HOG, GIST feature, SIFT features, etc.. These different aspects of the same objects are often called multi-view (or multi-modal) data. Low-rank regression model has been proved to be an effective learning mechanism by exploring the low-rank structure of real life data. But previous low-rank regression model only works on single view data. In this paper, we propose a multi-view low-rank regression model by imposing low-rank constraints on multi-view regression model. Most importantly, we provide a closed-form solution to the multi-view low-rank regression model. Extensive experiments on 4 multi-view datasets show that the multi-view low-rank regression model outperforms single-view regression model and reveals that multi-view low-rank structure is very helpful. |
Tasks | |
Published | 2016-10-14 |
URL | http://arxiv.org/abs/1610.04668v1 |
http://arxiv.org/pdf/1610.04668v1.pdf | |
PWC | https://paperswithcode.com/paper/a-closed-form-solution-to-multi-view-low-rank |
Repo | |
Framework | |
From quantum foundations via natural language meaning to a theory of everything
Title | From quantum foundations via natural language meaning to a theory of everything |
Authors | Bob Coecke |
Abstract | In this paper we argue for a paradigmatic shift from reductionism' to togetherness’. In particular, we show how interaction between systems in quantum theory naturally carries over to modelling how word meanings interact in natural language. Since meaning in natural language, depending on the subject domain, encompasses discussions within any scientific discipline, we obtain a template for theories such as social interaction, animal behaviour, and many others. |
Tasks | |
Published | 2016-02-22 |
URL | http://arxiv.org/abs/1602.07618v1 |
http://arxiv.org/pdf/1602.07618v1.pdf | |
PWC | https://paperswithcode.com/paper/from-quantum-foundations-via-natural-language |
Repo | |
Framework | |
Semantic Parsing with Semi-Supervised Sequential Autoencoders
Title | Semantic Parsing with Semi-Supervised Sequential Autoencoders |
Authors | Tomáš Kočiský, Gábor Melis, Edward Grefenstette, Chris Dyer, Wang Ling, Phil Blunsom, Karl Moritz Hermann |
Abstract | We present a novel semi-supervised approach for sequence transduction and apply it to semantic parsing. The unsupervised component is based on a generative model in which latent sentences generate the unpaired logical forms. We apply this method to a number of semantic parsing tasks focusing on domains with limited access to labelled training data and extend those datasets with synthetically generated logical forms. |
Tasks | Semantic Parsing |
Published | 2016-09-29 |
URL | http://arxiv.org/abs/1609.09315v1 |
http://arxiv.org/pdf/1609.09315v1.pdf | |
PWC | https://paperswithcode.com/paper/semantic-parsing-with-semi-supervised |
Repo | |
Framework | |
Generating Binary Tags for Fast Medical Image Retrieval Based on Convolutional Nets and Radon Transform
Title | Generating Binary Tags for Fast Medical Image Retrieval Based on Convolutional Nets and Radon Transform |
Authors | Xinran Liu, Hamid R. Tizhoosh, Jonathan Kofman |
Abstract | Content-based image retrieval (CBIR) in large medical image archives is a challenging and necessary task. Generally, different feature extraction methods are used to assign expressive and invariant features to each image such that the search for similar images comes down to feature classification and/or matching. The present work introduces a new image retrieval method for medical applications that employs a convolutional neural network (CNN) with recently introduced Radon barcodes. We combine neural codes for global classification with Radon barcodes for the final retrieval. We also examine image search based on regions of interest (ROI) matching after image retrieval. The IRMA dataset with more than 14,000 x-rays images is used to evaluate the performance of our method. Experimental results show that our approach is superior to many published works. |
Tasks | Content-Based Image Retrieval, Image Retrieval, Medical Image Retrieval |
Published | 2016-04-16 |
URL | http://arxiv.org/abs/1604.04676v1 |
http://arxiv.org/pdf/1604.04676v1.pdf | |
PWC | https://paperswithcode.com/paper/generating-binary-tags-for-fast-medical-image |
Repo | |
Framework | |
Exploiting Depth from Single Monocular Images for Object Detection and Semantic Segmentation
Title | Exploiting Depth from Single Monocular Images for Object Detection and Semantic Segmentation |
Authors | Yuanzhouhan Cao, Chunhua Shen, Heng Tao Shen |
Abstract | Augmenting RGB data with measured depth has been shown to improve the performance of a range of tasks in computer vision including object detection and semantic segmentation. Although depth sensors such as the Microsoft Kinect have facilitated easy acquisition of such depth information, the vast majority of images used in vision tasks do not contain depth information. In this paper, we show that augmenting RGB images with estimated depth can also improve the accuracy of both object detection and semantic segmentation. Specifically, we first exploit the recent success of depth estimation from monocular images and learn a deep depth estimation model. Then we learn deep depth features from the estimated depth and combine with RGB features for object detection and semantic segmentation. Additionally, we propose an RGB-D semantic segmentation method which applies a multi-task training scheme: semantic label prediction and depth value regression. We test our methods on several datasets and demonstrate that incorporating information from estimated depth improves the performance of object detection and semantic segmentation remarkably. |
Tasks | Depth Estimation, Object Detection, Semantic Segmentation |
Published | 2016-10-06 |
URL | http://arxiv.org/abs/1610.01706v1 |
http://arxiv.org/pdf/1610.01706v1.pdf | |
PWC | https://paperswithcode.com/paper/exploiting-depth-from-single-monocular-images |
Repo | |
Framework | |
Guidefill: GPU Accelerated, Artist Guided Geometric Inpainting for 3D Conversion
Title | Guidefill: GPU Accelerated, Artist Guided Geometric Inpainting for 3D Conversion |
Authors | L. Robert Hocking, Russell MacKenzie, Carola-Bibiane Schoenlieb |
Abstract | The conversion of traditional film into stereo 3D has become an important problem in the past decade. One of the main bottlenecks is a disocclusion step, which in commercial 3D conversion is usually done by teams of artists armed with a toolbox of inpainting algorithms. A current difficulty in this is that most available algorithms are either too slow for interactive use, or provide no intuitive means for users to tweak the output. In this paper we present a new fast inpainting algorithm based on transporting along automatically detected splines, which the user may edit. Our algorithm is implemented on the GPU and fills the inpainting domain in successive shells that adapt their shape on the fly. In order to allocate GPU resources as efficiently as possible, we propose a parallel algorithm to track the inpainting interface as it evolves, ensuring that no resources are wasted on pixels that are not currently being worked on. Theoretical analysis of the time and processor complexiy of our algorithm without and with tracking (as well as numerous numerical experiments) demonstrate the merits of the latter. Our transport mechanism is similar to the one used in coherence transport, but improves upon it by corrected a “kinking” phenomena whereby extrapolated isophotes may bend at the boundary of the inpainting domain. Theoretical results explaining this phenomena and its resolution are presented. Although our method ignores texture, in many cases this is not a problem due to the thin inpainting domains in 3D conversion. Experimental results show that our method can achieve a visual quality that is competitive with the state-of-the-art while maintaining interactive speeds and providing the user with an intuitive interface to tweak the results. |
Tasks | |
Published | 2016-11-16 |
URL | http://arxiv.org/abs/1611.05319v3 |
http://arxiv.org/pdf/1611.05319v3.pdf | |
PWC | https://paperswithcode.com/paper/guidefill-gpu-accelerated-artist-guided |
Repo | |
Framework | |
Hierarchical Multi-resolution Mesh Networks for Brain Decoding
Title | Hierarchical Multi-resolution Mesh Networks for Brain Decoding |
Authors | Itir Onal Ertugrul, Mete Ozay, Fatos Tunay Yarman Vural |
Abstract | We propose a new framework, called Hierarchical Multi-resolution Mesh Networks (HMMNs), which establishes a set of brain networks at multiple time resolutions of fMRI signal to represent the underlying cognitive process. The suggested framework, first, decomposes the fMRI signal into various frequency subbands using wavelet transforms. Then, a brain network, called mesh network, is formed at each subband by ensembling a set of local meshes. The locality around each anatomic region is defined with respect to a neighborhood system based on functional connectivity. The arc weights of a mesh are estimated by ridge regression formed among the average region time series. In the final step, the adjacency matrices of mesh networks obtained at different subbands are ensembled for brain decoding under a hierarchical learning architecture, called, fuzzy stacked generalization (FSG). Our results on Human Connectome Project task-fMRI dataset reflect that the suggested HMMN model can successfully discriminate tasks by extracting complementary information obtained from mesh arc weights of multiple subbands. We study the topological properties of the mesh networks at different resolutions using the network measures, namely, node degree, node strength, betweenness centrality and global efficiency; and investigate the connectivity of anatomic regions, during a cognitive task. We observe significant variations among the network topologies obtained for different subbands. We, also, analyze the diversity properties of classifier ensemble, trained by the mesh networks in multiple subbands and observe that the classifiers in the ensemble collaborate with each other to fuse the complementary information freed at each subband. We conclude that the fMRI data, recorded during a cognitive task, embed diverse information across the anatomic regions at each resolution. |
Tasks | Brain Decoding, Time Series |
Published | 2016-07-12 |
URL | http://arxiv.org/abs/1607.07695v2 |
http://arxiv.org/pdf/1607.07695v2.pdf | |
PWC | https://paperswithcode.com/paper/hierarchical-multi-resolution-mesh-networks |
Repo | |
Framework | |
Estimating mutual information in high dimensions via classification error
Title | Estimating mutual information in high dimensions via classification error |
Authors | Charles Y. Zheng, Yuval Benjamini |
Abstract | Multivariate pattern analyses approaches in neuroimaging are fundamentally concerned with investigating the quantity and type of information processed by various regions of the human brain; typically, estimates of classification accuracy are used to quantify information. While a extensive and powerful library of methods can be applied to train and assess classifiers, it is not always clear how to use the resulting measures of classification performance to draw scientific conclusions: e.g. for the purpose of evaluating redundancy between brain regions. An additional confound for interpreting classification performance is the dependence of the error rate on the number and choice of distinct classes obtained for the classification task. In contrast, mutual information is a quantity defined independently of the experimental design, and has ideal properties for comparative analyses. Unfortunately, estimating the mutual information based on observations becomes statistically infeasible in high dimensions without some kind of assumption or prior. In this paper, we construct a novel classification-based estimator of mutual information based on high-dimensional asymptotics. We show that in a particular limiting regime, the mutual information is an invertible function of the expected $k$-class Bayes error. While the theory is based on a large-sample, high-dimensional limit, we demonstrate through simulations that our proposed estimator has superior performance to the alternatives in problems of moderate dimensionality. |
Tasks | |
Published | 2016-06-16 |
URL | http://arxiv.org/abs/1606.05229v2 |
http://arxiv.org/pdf/1606.05229v2.pdf | |
PWC | https://paperswithcode.com/paper/estimating-mutual-information-in-high |
Repo | |
Framework | |
Multi-Field Structural Decomposition for Question Answering
Title | Multi-Field Structural Decomposition for Question Answering |
Authors | Tomasz Jurczyk, Jinho D. Choi |
Abstract | This paper presents a precursory yet novel approach to the question answering task using structural decomposition. Our system first generates linguistic structures such as syntactic and semantic trees from text, decomposes them into multiple fields, then indexes the terms in each field. For each question, it decomposes the question into multiple fields, measures the relevance score of each field to the indexed ones, then ranks all documents by their relevance scores and weights associated with the fields, where the weights are learned through statistical modeling. Our final model gives an absolute improvement of over 40% to the baseline approach using simple search for detecting documents containing answers. |
Tasks | Question Answering |
Published | 2016-04-04 |
URL | http://arxiv.org/abs/1604.00938v1 |
http://arxiv.org/pdf/1604.00938v1.pdf | |
PWC | https://paperswithcode.com/paper/multi-field-structural-decomposition-for |
Repo | |
Framework | |
A Robust Framework for Classifying Evolving Document Streams in an Expert-Machine-Crowd Setting
Title | A Robust Framework for Classifying Evolving Document Streams in an Expert-Machine-Crowd Setting |
Authors | Muhammad Imran, Sanjay Chawla, Carlos Castillo |
Abstract | An emerging challenge in the online classification of social media data streams is to keep the categories used for classification up-to-date. In this paper, we propose an innovative framework based on an Expert-Machine-Crowd (EMC) triad to help categorize items by continuously identifying novel concepts in heterogeneous data streams often riddled with outliers. We unify constrained clustering and outlier detection by formulating a novel optimization problem: COD-Means. We design an algorithm to solve the COD-Means problem and show that COD-Means will not only help detect novel categories but also seamlessly discover human annotation errors and improve the overall quality of the categorization process. Experiments on diverse real data sets demonstrate that our approach is both effective and efficient. |
Tasks | Outlier Detection |
Published | 2016-10-06 |
URL | http://arxiv.org/abs/1610.01858v1 |
http://arxiv.org/pdf/1610.01858v1.pdf | |
PWC | https://paperswithcode.com/paper/a-robust-framework-for-classifying-evolving |
Repo | |
Framework | |
Introduction: Cognitive Issues in Natural Language Processing
Title | Introduction: Cognitive Issues in Natural Language Processing |
Authors | Thierry Poibeau, Shravan Vasishth |
Abstract | This special issue is dedicated to get a better picture of the relationships between computational linguistics and cognitive science. It specifically raises two questions: “what is the potential contribution of computational language modeling to cognitive science?” and conversely: “what is the influence of cognitive science in contemporary computational linguistics?” |
Tasks | Language Modelling |
Published | 2016-10-24 |
URL | http://arxiv.org/abs/1610.07365v1 |
http://arxiv.org/pdf/1610.07365v1.pdf | |
PWC | https://paperswithcode.com/paper/introduction-cognitive-issues-in-natural |
Repo | |
Framework | |
Concept based Attention
Title | Concept based Attention |
Authors | Jie You, Xin Yang, Matthias Hub |
Abstract | Attention endows animals an ability to concentrate on the most relevant information among a deluge of distractors at any given time, either through volitionally ‘top-down’ biasing, or driven by automatically ‘bottom-up’ saliency of stimuli, in favour of advantageous competition in neural modulations for information processing. Nevertheless, instead of being limited to perceive simple features, human and other advanced animals adaptively learn the world into categories and abstract concepts from experiences, imparting the world meanings. This thesis suggests that the high-level cognitive ability of human is more likely driven by attention basing on abstract perceptions, which is defined as concept based attention (CbA). |
Tasks | |
Published | 2016-05-11 |
URL | http://arxiv.org/abs/1605.03416v1 |
http://arxiv.org/pdf/1605.03416v1.pdf | |
PWC | https://paperswithcode.com/paper/concept-based-attention |
Repo | |
Framework | |
Effective Combination of Language and Vision Through Model Composition and the R-CCA Method
Title | Effective Combination of Language and Vision Through Model Composition and the R-CCA Method |
Authors | Hagar Loeub, Roi Reichart |
Abstract | We address the problem of integrating textual and visual information in vector space models for word meaning representation. We first present the Residual CCA (R-CCA) method, that complements the standard CCA method by representing, for each modality, the difference between the original signal and the signal projected to the shared, max correlation, space. We then show that constructing visual and textual representations and then post-processing them through composition of common modeling motifs such as PCA, CCA, R-CCA and linear interpolation (a.k.a sequential modeling) yields high quality models. On five standard semantic benchmarks our sequential models outperform recent multimodal representation learning alternatives, including ones that rely on joint representation learning. For two of these benchmarks our R-CCA method is part of the Best configuration our algorithm yields. |
Tasks | Representation Learning |
Published | 2016-09-28 |
URL | http://arxiv.org/abs/1609.08810v2 |
http://arxiv.org/pdf/1609.08810v2.pdf | |
PWC | https://paperswithcode.com/paper/effective-combination-of-language-and-vision |
Repo | |
Framework | |