Paper Group NAWR 26
iComposer: An Automatic Songwriting System for Chinese Popular Music. Distributed Low-rank Matrix Factorization With Exact Consensus. XQA: A Cross-lingual Open-domain Question Answering Dataset. Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks. Ghmerti at SemEval-2019 Task 6: A Deep Word- and Ch …
iComposer: An Automatic Songwriting System for Chinese Popular Music
Title | iComposer: An Automatic Songwriting System for Chinese Popular Music |
Authors | Hsin-Pei Lee, Jhih-Sheng Fang, Wei-Yun Ma |
Abstract | In this paper, we introduce iComposer, an interactive web-based songwriting system designed to assist human creators by greatly simplifying music production. iComposer automatically creates melodies to accompany any given text. It also enables users to generate a set of lyrics given arbitrary melodies. iComposer is based on three sequence-to-sequence models, which are used to predict melody, rhythm, and lyrics, respectively. Songs generated by iComposer are compared with human-composed and randomly-generated ones in a subjective test, the experimental results of which demonstrate the capability of the proposed system to write pleasing melodies and meaningful lyrics at a level similar to that of humans. |
Tasks | |
Published | 2019-06-01 |
URL | https://www.aclweb.org/anthology/N19-4015/ |
https://www.aclweb.org/anthology/N19-4015 | |
PWC | https://paperswithcode.com/paper/icomposer-an-automatic-songwriting-system-for |
Repo | https://github.com/hhpslily/iComposer |
Framework | pytorch |
Distributed Low-rank Matrix Factorization With Exact Consensus
Title | Distributed Low-rank Matrix Factorization With Exact Consensus |
Authors | Zhihui Zhu, Qiuwei Li, Xinshuo Yang, Gongguo Tang, Michael B. Wakin |
Abstract | Low-rank matrix factorization is a problem of broad importance, owing to the ubiquity of low-rank models in machine learning contexts. In spite of its non- convexity, this problem has a well-behaved geometric landscape, permitting local search algorithms such as gradient descent to converge to global minimizers. In this paper, we study low-rank matrix factorization in the distributed setting, where local variables at each node encode parts of the overall matrix factors, and consensus is encouraged among certain such variables. We identify conditions under which this new problem also has a well-behaved geometric landscape, and we propose an extension of distributed gradient descent (DGD) to solve this problem. The favorable landscape allows us to prove convergence to global optimality with exact consensus, a stronger result than what is provided by off-the-shelf DGD theory. |
Tasks | |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/9050-distributed-low-rank-matrix-factorization-with-exact-consensus |
http://papers.nips.cc/paper/9050-distributed-low-rank-matrix-factorization-with-exact-consensus.pdf | |
PWC | https://paperswithcode.com/paper/distributed-low-rank-matrix-factorization |
Repo | https://github.com/xinshuoyang/DGD-LOCAL |
Framework | none |
XQA: A Cross-lingual Open-domain Question Answering Dataset
Title | XQA: A Cross-lingual Open-domain Question Answering Dataset |
Authors | Jiahua Liu, Yankai Lin, Zhiyuan Liu, Maosong Sun |
Abstract | Open-domain question answering (OpenQA) aims to answer questions through text retrieval and reading comprehension. Recently, lots of neural network-based models have been proposed and achieved promising results in OpenQA. However, the success of these models relies on a massive volume of training data (usually in English), which is not available in many other languages, especially for those low-resource languages. Therefore, it is essential to investigate cross-lingual OpenQA. In this paper, we construct a novel dataset XQA for cross-lingual OpenQA research. It consists of a training set in English as well as development and test sets in eight other languages. Besides, we provide several baseline systems for cross-lingual OpenQA, including two machine translation-based methods and one zero-shot cross-lingual method (multilingual BERT). Experimental results show that the multilingual BERT model achieves the best results in almost all target languages, while the performance of cross-lingual OpenQA is still much lower than that of English. Our analysis indicates that the performance of cross-lingual OpenQA is related to not only how similar the target language and English are, but also how difficult the question set of the target language is. The XQA dataset is publicly available at http://github.com/thunlp/XQA. |
Tasks | Machine Translation, Open-Domain Question Answering, Question Answering, Reading Comprehension |
Published | 2019-07-01 |
URL | https://www.aclweb.org/anthology/P19-1227/ |
https://www.aclweb.org/anthology/P19-1227 | |
PWC | https://paperswithcode.com/paper/xqa-a-cross-lingual-open-domain-question |
Repo | https://github.com/thunlp/XQA |
Framework | tf |
Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks
Title | Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks |
Authors | Lixin Fan, Kam Woh Ng, Chee Seng Chan |
Abstract | With substantial amount of time, resources and human (team) efforts invested to explore and develop successful deep neural networks (DNN), there emerges an urgent need to protect these inventions from being illegally copied, redistributed, or abused without respecting the intellectual properties of legitimate owners. Following recent progresses along this line, we investigate a number of watermark-based DNN ownership verification methods in the face of ambiguity attacks, which aim to cast doubts on the ownership verification by forging counterfeit watermarks. It is shown that ambiguity attacks pose serious threats to existing DNN watermarking methods. As remedies to the above-mentioned loophole, this paper proposes novel passport-based DNN ownership verification schemes which are both robust to network modifications and resilient to ambiguity attacks. The gist of embedding digital passports is to design and train DNN models in a way such that, the DNN inference performance of an original task will be significantly deteriorated due to forged passports. In other words, genuine passports are not only verified by looking for the predefined signatures, but also reasserted by the unyielding DNN model inference performances. Extensive experimental results justify the effectiveness of the proposed passport-based DNN ownership verification schemes. Code and models are available at https://github.com/kamwoh/DeepIPR |
Tasks | |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/8719-rethinking-deep-neural-network-ownership-verification-embedding-passports-to-defeat-ambiguity-attacks |
http://papers.nips.cc/paper/8719-rethinking-deep-neural-network-ownership-verification-embedding-passports-to-defeat-ambiguity-attacks.pdf | |
PWC | https://paperswithcode.com/paper/rethinking-deep-neural-network-ownership-1 |
Repo | https://github.com/kamwoh/DeepIPR |
Framework | pytorch |
Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification
Title | Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification |
Authors | Ehsan Doostmohammadi, Hossein Sameti, Ali Saffar |
Abstract | This paper presents the models submitted by Ghmerti team for subtasks A and B of the OffensEval shared task at SemEval 2019. OffensEval addresses the problem of identifying and categorizing offensive language in social media in three subtasks; whether or not a content is offensive (subtask A), whether it is targeted (subtask B) towards an individual, a group, or other entities (subtask C). The proposed approach includes character-level Convolutional Neural Network, word-level Recurrent Neural Network, and some preprocessing. The performance achieved by the proposed model is 77.93{%} macro-averaged F1-score. |
Tasks | Language Identification |
Published | 2019-06-01 |
URL | https://www.aclweb.org/anthology/S19-2110/ |
https://www.aclweb.org/anthology/S19-2110 | |
PWC | https://paperswithcode.com/paper/ghmerti-at-semeval-2019-task-6-a-deep-word |
Repo | https://github.com/edoost/offenseval |
Framework | tf |
Latent Ordinary Differential Equations for Irregularly-Sampled Time Series
Title | Latent Ordinary Differential Equations for Irregularly-Sampled Time Series |
Authors | Yulia Rubanova, Tian Qi Chen, David K. Duvenaud |
Abstract | Time series with non-uniform intervals occur in many applications, and are difficult to model using standard recurrent neural networks (RNNs). We generalize RNNs to have continuous-time hidden dynamics defined by ordinary differential equations (ODEs), a model we call ODE-RNNs. Furthermore, we use ODE-RNNs to replace the recognition network of the recently-proposed Latent ODE model. Both ODE-RNNs and Latent ODEs can naturally handle arbitrary time gaps between observations, and can explicitly model the probability of observation times using Poisson processes. We show experimentally that these ODE-based models outperform their RNN-based counterparts on irregularly-sampled data. |
Tasks | Time Series |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/8773-latent-ordinary-differential-equations-for-irregularly-sampled-time-series |
http://papers.nips.cc/paper/8773-latent-ordinary-differential-equations-for-irregularly-sampled-time-series.pdf | |
PWC | https://paperswithcode.com/paper/latent-ordinary-differential-equations-for |
Repo | https://github.com/YuliaRubanova/latent_ode |
Framework | pytorch |
DC2Anet: Generating Lumbar Spine MR Images from CT Scan Data Based on Semi-Supervised Learning
Title | DC2Anet: Generating Lumbar Spine MR Images from CT Scan Data Based on Semi-Supervised Learning |
Authors | Cheng-Bin Jin, Hakil Kim, Mingjie Liu, In Ho Han, Jae Il Lee, Jung Hwan Lee, Seongsu Joo, Eunsik Park, Young Saem Ahn, Xuenan Cui |
Abstract | Magnetic resonance imaging (MRI) plays a significant role in the diagnosis of lumbar disc disease. However, the use of MRI is limited because of its high cost and significant operating and processing time. More importantly, MRI is contraindicated for some patients with claustrophobia or cardiac pacemakers due to the possibility of injury. In contrast, computed tomography (CT) scans are much less expensive, are faster, and do not face the same limitations. In this paper, we propose a method for estimating lumbar spine MR images based on CT images using a novel objective function and a dual cycle-consistent adversarial network (DC2Anet) with semi-supervised learning. The objective function includes six independent loss terms to balance quantitative and qualitative losses, enabling the generation of a realistic and accurate synthetic MR image. DC2Anet is also capable of semi-supervised learning, and the network is general enough for supervised or unsupervised setups. Experimental results prove that the method is accurate, being able to construct MR images that closely approximate reference MR images, while also outperforming four other state-of-the-art methods. |
Tasks | Computed Tomography (CT) |
Published | 2019-06-18 |
URL | https://www.mdpi.com/2076-3417/9/12/2521 |
https://www.mdpi.com/2076-3417/9/12/2521 | |
PWC | https://paperswithcode.com/paper/dc2anet-generating-lumbar-spine-mr-images |
Repo | https://github.com/ChengBinJin/SpineC2M |
Framework | tf |
Acoustic Non-Line-Of-Sight Imaging
Title | Acoustic Non-Line-Of-Sight Imaging |
Authors | David B. Lindell, Gordon Wetzstein, Vladlen Koltun |
Abstract | Non-line-of-sight (NLOS) imaging enables unprecedented capabilities in a wide range of applications, including robotic and machine vision, remote sensing, autonomous vehicle navigation, and medical imaging. Recent approaches to solving this challenging problem employ optical time-of-flight imaging systems with highly sensitive time-resolved photodetectors and ultra-fast pulsed lasers. However, despite recent successes in NLOS imaging using these systems, widespread implementation and adoption of the technology remains a challenge because of the requirement for specialized, expensive hardware. We introduce acoustic NLOS imaging, which is orders of magnitude less expensive than most optical systems and captures hidden 3D geometry at longer ranges with shorter acquisition times compared to state-of-the-art optical methods. Inspired by hardware setups used in radar and algorithmic approaches to model and invert wave-based image formation models developed in the seismic imaging community, we demonstrate a new approach to seeing around corners. |
Tasks | |
Published | 2019-06-01 |
URL | http://openaccess.thecvf.com/content_CVPR_2019/html/Lindell_Acoustic_Non-Line-Of-Sight_Imaging_CVPR_2019_paper.html |
http://openaccess.thecvf.com/content_CVPR_2019/papers/Lindell_Acoustic_Non-Line-Of-Sight_Imaging_CVPR_2019_paper.pdf | |
PWC | https://paperswithcode.com/paper/acoustic-non-line-of-sight-imaging |
Repo | https://github.com/computational-imaging/AcousticNLOS |
Framework | none |
On two ways to use determinantal point processes for Monte Carlo integration
Title | On two ways to use determinantal point processes for Monte Carlo integration |
Authors | Guillaume Gautier, Rémi Bardenet, Michal Valko |
Abstract | When approximating an integral by a weighted sum of function evaluations, determinantal point processes (DPPs) provide a way to enforce repulsion between the evaluation points. This negative dependence is encoded by a kernel. Fifteen years before the discovery of DPPs, Ermakov & Zolotukhin (EZ, 1960) had the intuition of sampling a DPP and solving a linear system to compute an unbiased Monte Carlo estimator of the integral. In the absence of DPP machinery to derive an efficient sampler and analyze their estimator, the idea of Monte Carlo integration with DPPs was stored in the cellar of numerical integration. Recently, Bardenet & Hardy (BH, 2019) came up with a more natural estimator with a fast central limit theorem (CLT). In this paper, we first take the EZ estimator out of the cellar, and analyze it using modern arguments. Second, we provide an efficient implementation to sample exactly a particular multidimensional DPP called multivariate Jacobi ensemble. The latter satisfies the assumptions of the aforementioned CLT. Third, our new implementation lets us investigate the behavior of the two unbiased Monte Carlo estimators in yet unexplored regimes. We demonstrate experimentally good properties when the kernel is adapted to basis of functions in which the integrand is sparse or has fast-decaying coefficients. If such a basis and the level of sparsity are known (e.g., we integrate a linear combination of kernel eigenfunctions), the EZ estimator can be the right choice, but otherwise it can display an erratic behavior. |
Tasks | Point Processes |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/8992-on-two-ways-to-use-determinantal-point-processes-for-monte-carlo-integration |
http://papers.nips.cc/paper/8992-on-two-ways-to-use-determinantal-point-processes-for-monte-carlo-integration.pdf | |
PWC | https://paperswithcode.com/paper/on-two-ways-to-use-determinantal-point |
Repo | https://github.com/guilgautier/DPPy |
Framework | none |
Random Path Selection for Continual Learning
Title | Random Path Selection for Continual Learning |
Authors | Jathushan Rajasegaran, Munawar Hayat, Salman H. Khan, Fahad Shahbaz Khan, Ling Shao |
Abstract | Incremental life-long learning is a main challenge towards the long-standing goal of Artificial General Intelligence. In real-life settings, learning tasks arrive in a sequence and machine learning models must continually learn to increment already acquired knowledge. The existing incremental learning approaches fall well below the state-of-the-art cumulative models that use all training classes at once. In this paper, we propose a random path selection algorithm, called RPS-Net, that progressively chooses optimal paths for the new tasks while encouraging parameter sharing and reuse. Our approach avoids the overhead introduced by computationally expensive evolutionary and reinforcement learning based path selection strategies while achieving considerable performance gains. As an added novelty, the proposed model integrates knowledge distillation and retrospection along with the path selection strategy to overcome catastrophic forgetting. In order to maintain an equilibrium between previous and newly acquired knowledge, we propose a simple controller to dynamically balance the model plasticity. Through extensive experiments, we demonstrate that the proposed method surpasses the state-of-the-art performance on incremental learning and by utilizing parallel computation this method can run in constant time with nearly the same efficiency as a conventional deep convolutional neural network. |
Tasks | Continual Learning |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/9429-random-path-selection-for-continual-learning |
http://papers.nips.cc/paper/9429-random-path-selection-for-continual-learning.pdf | |
PWC | https://paperswithcode.com/paper/random-path-selection-for-continual-learning |
Repo | https://github.com/brjathu/RPSnet |
Framework | pytorch |
Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused Visual Attention
Title | Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused Visual Attention |
Authors | Pooya Abolghasemi, Amir Mazaheri, Mubarak Shah, Ladislau Boloni |
Abstract | Several recent studies have demonstrated the promise of deep visuomotor policies for robot manipulator control. Despite impressive progress, these systems are known to be vulnerable to physical disturbances, such as accidental or adversarial bumps that make them drop the manipulated object. They also tend to be distracted by visual disturbances such as objects moving in the robot’s field of view, even if the disturbance does not physically prevent the execution of the task. In this paper, we propose an approach for augmenting a deep visuomotor policy trained through demonstrations with Task Focused visual Attention (TFA). The manipulation task is specified with a natural language text such as “move the red bowl to the left”. This allows the visual attention component to concentrate on the current object that the robot needs to manipulate. We show that even in benign environments, the TFA allows the policy to consistently outperform a variant with no attention mechanism. More importantly, the new policy is significantly more robust: it regularly recovers from severe physical disturbances (such as bumps causing it to drop the object) from which the baseline policy, i.e. with no visual attention, almost never recovers. In addition, we show that the proposed policy performs correctly in the presence of a wide class of visual disturbances, exhibiting a behavior reminiscent of human selective visual attention experiments. |
Tasks | Imitation Learning |
Published | 2019-06-01 |
URL | http://openaccess.thecvf.com/content_CVPR_2019/html/Abolghasemi_Pay_Attention_-_Robustifying_a_Deep_Visuomotor_Policy_Through_Task-Focused_CVPR_2019_paper.html |
http://openaccess.thecvf.com/content_CVPR_2019/papers/Abolghasemi_Pay_Attention_-_Robustifying_a_Deep_Visuomotor_Policy_Through_Task-Focused_CVPR_2019_paper.pdf | |
PWC | https://paperswithcode.com/paper/pay-attention-robustifying-a-deep-visuomotor-1 |
Repo | https://github.com/pouyaAB/Pay-Attention |
Framework | none |
UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos
Title | UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos |
Authors | Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, Wei Xu |
Abstract | In this paper, we propose UnOS, an unified system for unsupervised optical flow and stereo depth estimation using convolutional neural network (CNN) by taking advantages of their inherent geometrical consistency based on the rigid-scene assumption. UnOS significantly outperforms other state-of-the-art (SOTA) unsupervised approaches that treated the two tasks independently. Specifically, given two consecutive stereo image pairs from a video, UnOS estimates per-pixel stereo depth images, camera ego-motion and optical flow with three parallel CNNs. Based on these quantities, UnOS computes rigid optical flow and compares it against the optical flow estimated from the FlowNet, yielding pixels satisfying the rigid-scene assumption. Then, we encourage geometrical consistency between the two estimated flows within rigid regions, from which we derive a rigid-aware direct visual odometry (RDVO) module. We also propose rigid and occlusion-aware flow-consistency losses for the learning of UnOS. We evaluated our results on the popular KITTI dataset over 4 related tasks, i.e. stereo depth, optical flow, visual odometry and motion segmentation. |
Tasks | Depth Estimation, Motion Segmentation, Optical Flow Estimation, Stereo Depth Estimation, Visual Odometry |
Published | 2019-06-01 |
URL | http://openaccess.thecvf.com/content_CVPR_2019/html/Wang_UnOS_Unified_Unsupervised_Optical-Flow_and_Stereo-Depth_Estimation_by_Watching_Videos_CVPR_2019_paper.html |
http://openaccess.thecvf.com/content_CVPR_2019/papers/Wang_UnOS_Unified_Unsupervised_Optical-Flow_and_Stereo-Depth_Estimation_by_Watching_Videos_CVPR_2019_paper.pdf | |
PWC | https://paperswithcode.com/paper/unos-unified-unsupervised-optical-flow-and |
Repo | https://github.com/baidu-research/UnDepthflow |
Framework | tf |
Rethinking Complex Neural Network Architectures for Document Classification
Title | Rethinking Complex Neural Network Architectures for Document Classification |
Authors | Ashutosh Adhikari, Achyudh Ram, Raphael Tang, Jimmy Lin |
Abstract | Neural network models for many NLP tasks have grown increasingly complex in recent years, making training and deployment more difficult. A number of recent papers have questioned the necessity of such architectures and found that well-executed, simpler models are quite effective. We show that this is also the case for document classification: in a large-scale reproducibility study of several recent neural models, we find that a simple BiLSTM architecture with appropriate regularization yields accuracy and F1 that are either competitive or exceed the state of the art on four standard benchmark datasets. Surprisingly, our simple model is able to achieve these results without attention mechanisms. While these regularization techniques, borrowed from language modeling, are not novel, to our knowledge we are the first to apply them in this context. Our work provides an open-source platform and the foundation for future work in document classification. |
Tasks | Document Classification, Language Modelling |
Published | 2019-06-01 |
URL | https://www.aclweb.org/anthology/N19-1408/ |
https://www.aclweb.org/anthology/N19-1408 | |
PWC | https://paperswithcode.com/paper/rethinking-complex-neural-network |
Repo | https://github.com/castorini/hedwig |
Framework | pytorch |
Speak up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment
Title | Speak up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment |
Authors | Arijit Ghosh Chowdhury, Ramit Sawhney, Puneet Mathur, Debanjan Mahata, Rajiv Ratn Shah |
Abstract | The {#}MeToo movement is an ongoing prevalent phenomenon on social media aiming to demonstrate the frequency and widespread of sexual harassment by providing a platform to speak narrate personal experiences of such harassment. The aggregation and analysis of such disclosures pave the way to development of technology-based prevention of sexual harassment. We contend that the lack of specificity in generic sentence classification models may not be the best way to tackle text subtleties that intrinsically prevail in a classification task as complex as identifying disclosures of sexual harassment. We propose the Disclosure Language Model, a three part ULMFiT architecture, consisting of a Language model, a Medium-Specific (Twitter) model and a Task-Specific classifier to tackle this problem and create a manually annotated real-world dataset to test our technique on this, to show that using a Discourse Language Model often yields better classification performance over (i) Generic deep learning based sentence classification models (ii) existing models that rely on handcrafted stylistic features. An extensive comparison with state-of-the-art generic and specific models along with a detailed error analysis presents the case for our proposed methodology. |
Tasks | Language Modelling, Sentence Classification |
Published | 2019-06-01 |
URL | https://www.aclweb.org/anthology/N19-3018/ |
https://www.aclweb.org/anthology/N19-3018 | |
PWC | https://paperswithcode.com/paper/speak-up-fight-back-detection-of-social-media |
Repo | https://github.com/ramitsawhney27/NAACLSRW19meToo |
Framework | none |
Offline Contextual Bayesian Optimization
Title | Offline Contextual Bayesian Optimization |
Authors | Ian Char, Youngseog Chung, Willie Neiswanger, Kirthevasan Kandasamy, Oak Nelson, Mark Boyer, Egemen Kolemen |
Abstract | In black-box optimization, an agent repeatedly chooses a configuration to test, so as to find an optimal configuration. In many practical problems of interest, one would like to optimize several systems, or tasks'', simultaneously; however, in most of these scenarios the current task is determined by nature. In this work, we explore the offline’’ case in which one is able to bypass nature and choose the next task to evaluate (e.g. via a simulator). Because some tasks may be easier to optimize and others may be more critical, it is crucial to leverage algorithms that not only consider which configurations to try next, but also which tasks to make evaluations for. In this work, we describe a theoretically grounded Bayesian optimization method to tackle this problem. We also demonstrate that if the model of the reward structure does a poor job of capturing variation in difficulty between tasks, then algorithms that actively pick tasks for evaluation may end up doing more harm than good. Following this, we show how our approach can be used for real world applications in science and engineering, including optimizing tokamak controls for nuclear fusion. |
Tasks | |
Published | 2019-12-01 |
URL | http://papers.nips.cc/paper/8711-offline-contextual-bayesian-optimization |
http://papers.nips.cc/paper/8711-offline-contextual-bayesian-optimization.pdf | |
PWC | https://paperswithcode.com/paper/offline-contextual-bayesian-optimization |
Repo | https://github.com/fusion-ml/OCBO |
Framework | none |