October 15, 2019

2107 words 10 mins read

Paper Group NANR 113

Fast Query Expansion on an Accounting Corpus using Sub-Word Embeddings. The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation. Very Large-Scale Lexical Resources to Enhance Chinese and Japanese Machine Translation. Inference in Higher Order MRF-MAP Problems With Small and Large Cliques. Parallel Form …

Fast Query Expansion on an Accounting Corpus using Sub-Word Embeddings


Title	Fast Query Expansion on an Accounting Corpus using Sub-Word Embeddings
Authors	Hrishikesh Ganu, Viswa Datha P.
Abstract	We present early results from a system under development which uses sub-word embeddings for query expansion in presence of mis-spelled words and other aberrations. We work for a company which creates accounting software and the end goal is to improve customer experience when they search for help on our {``}Customer Care{''} portal. Our customers use colloquial language, non-standard acronyms and sometimes mis-spell words when they use our Search portal or interact over other channels. However, our Knowledge Base has curated content which leverages technical terms and is in language which is quite formal. This results in the answer not being retrieved even though the answer might actually be present in the documentation (as assessed by a human). We address this problem by creating equivalence classes of words with similar meanings (with the additional property that the mappings to these equivalence classes are robust to mis-spellings) using sub-word embeddings and then use them to fine tune an Elasticsearch index to improve recall. We demonstrate through an end-end system that using sub-word embeddings leads to a significant lift in correct answers retrieved for an accounting corpus available in the public domain. \|
Tasks	Information Retrieval, Lemmatization, Word Embeddings
Published	2018-06-01
URL	https://www.aclweb.org/anthology/W18-1208/
PDF	https://www.aclweb.org/anthology/W18-1208
PWC	https://paperswithcode.com/paper/fast-query-expansion-on-an-accounting-corpus
Repo
Framework

The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation


Title	The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation
Authors	Pia Bideau, Aruni RoyChowdhury, Rakesh R. Menon, Erik Learned-Miller
Abstract	Traditional methods of motion segmentation use powerful geometric constraints to understand motion, but fail to leverage the semantics of high-level image understanding. Modern CNN methods of motion analysis, on the other hand, excel at identifying well-known structures, but may not precisely characterize well-known geometric constraints. In this work, we build a new statistical model of rigid motion flow based on classical perspective projection constraints. We then combine piecewise rigid motions into complex deformable and articulated objects, guided by semantic segmentation from CNNs and a second ``object-level” statistical model. This combination of classical geometric knowledge combined with the pattern recognition abilities of CNNs yields excellent performance on a wide range of motion segmentation benchmarks, from complex geometric scenes to camouflaged animals. \|
Tasks	Motion Segmentation, Semantic Segmentation
Published	2018-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2018/html/Bideau_The_Best_of_CVPR_2018_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2018/papers/Bideau_The_Best_of_CVPR_2018_paper.pdf
PWC	https://paperswithcode.com/paper/the-best-of-both-worlds-combining-cnns-and
Repo
Framework

Very Large-Scale Lexical Resources to Enhance Chinese and Japanese Machine Translation


Title	Very Large-Scale Lexical Resources to Enhance Chinese and Japanese Machine Translation
Authors	Jack Halpern
Abstract
Tasks	Lemmatization, Machine Translation
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1137/
PDF	https://www.aclweb.org/anthology/L18-1137
PWC	https://paperswithcode.com/paper/very-large-scale-lexical-resources-to-enhance
Repo
Framework

Inference in Higher Order MRF-MAP Problems With Small and Large Cliques


Title	Inference in Higher Order MRF-MAP Problems With Small and Large Cliques
Authors	Ishant Shanu, Chetan Arora, S.N. Maheshwari
Abstract	Higher Order MRF-MAP formulation has been a popular technique for solving many problems in computer vision. Inference in a general MRF-MAP problem is NP Hard, but can be performed in polynomial time for the special case when potential functions are submodular. Two popular combinatorial approaches for solving such formulations are flow based and polyhedral approaches. Flow based approaches work well with small cliques and in that mode can handle problems with millions of variables. Polyhedral approaches can handle large cliques but in small numbers. We show in this paper that the variables in these seemingly disparate techniques can be mapped to each other. This allows us to combine the two styles in a joint framework exploiting the strength of both of them. Using the proposed joint framework, we are able to perform tractable inference in MRF-MAP problems with millions of variables and a mix of small and large cliques, a formulation which can not be solved by either of the two styles individually. We show applicability of this hybrid framework on object segmentation problem as an example of a situation where quality of results is significantly better than systems which are based only on the use of small or large cliques.
Tasks	Semantic Segmentation
Published	2018-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2018/html/Shanu_Inference_in_Higher_CVPR_2018_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2018/papers/Shanu_Inference_in_Higher_CVPR_2018_paper.pdf
PWC	https://paperswithcode.com/paper/inference-in-higher-order-mrf-map-problems
Repo
Framework

Parallel Forms in Estonian Finite State Morphology


Title	Parallel Forms in Estonian Finite State Morphology
Authors	Heiki-Jaan Kaalep
Abstract
Tasks
Published	2018-01-01
URL	https://www.aclweb.org/anthology/W18-0212/
PDF	https://www.aclweb.org/anthology/W18-0212
PWC	https://paperswithcode.com/paper/parallel-forms-in-estonian-finite-state
Repo
Framework

ASR for Documenting Acutely Under-Resourced Indigenous Languages


Title	ASR for Documenting Acutely Under-Resourced Indigenous Languages
Authors	Robbie Jimerson, Emily Prud{'}hommeaux
Abstract
Tasks	Language Modelling, Speech Recognition
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1657/
PDF	https://www.aclweb.org/anthology/L18-1657
PWC	https://paperswithcode.com/paper/asr-for-documenting-acutely-under-resourced
Repo
Framework

An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures


Title	An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures
Authors	Leonardo Zilio, Rodrigo Wilkens, C{'e}drick Fairon
Abstract
Tasks
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1650/
PDF	https://www.aclweb.org/anthology/L18-1650
PWC	https://paperswithcode.com/paper/an-sla-corpus-annotated-with-pedagogically
Repo
Framework

Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations


Title	Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations
Authors	Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi
Abstract
Tasks
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1637/
PDF	https://www.aclweb.org/anthology/L18-1637
PWC	https://paperswithcode.com/paper/improving-crowdsourcing-based-annotation-of
Repo
Framework

Point-to-Point Regression PointNet for 3D Hand Pose Estimation


Title	Point-to-Point Regression PointNet for 3D Hand Pose Estimation
Authors	Liuhao Ge, Zhou Ren, Junsong Yuan
Abstract	Convolutional Neural Networks (CNNs)-based methods for 3D hand pose estimation with depth cameras usually take 2D depth images as input and directly regress holistic 3D hand pose. Different from these methods, our proposed Point-to-Point Regression PointNet directly takes the 3D point cloud as input and outputs point-wise estimations, i.e., heat-maps and unit vector fields on the point cloud, representing the closeness and direction from every point in the point cloud to the hand joint. The point-wise estimations are used to infer 3D joint locations with weighted fusion. To better capture 3D spatial information in the point cloud, we apply a stacked network architecture for PointNet with intermediate supervision, which is trained end-to-end. Experiments show that our method can achieve outstanding results when compared with state-of-the-art methods on three challenging hand pose datasets.
Tasks	Hand Pose Estimation, Pose Estimation
Published	2018-09-01
URL	http://openaccess.thecvf.com/content_ECCV_2018/html/Liuhao_Ge_Point-to-Point_Regression_PointNet_ECCV_2018_paper.html
PDF	http://openaccess.thecvf.com/content_ECCV_2018/papers/Liuhao_Ge_Point-to-Point_Regression_PointNet_ECCV_2018_paper.pdf
PWC	https://paperswithcode.com/paper/point-to-point-regression-pointnet-for-3d
Repo
Framework


Title	Not that much power: Linguistic alignment is influenced more by low-level linguistic features rather than social power
Authors	Yang Xu, Jeremy Cole, David Reitter
Abstract	Linguistic alignment between dialogue partners has been claimed to be affected by their relative social power. A common finding has been that interlocutors of higher power tend to receive more alignment than those of lower power. However, these studies overlook some low-level linguistic features that can also affect alignment, which casts doubts on these findings. This work characterizes the effect of power on alignment with logistic regression models in two datasets, finding that the effect vanishes or is reversed after controlling for low-level features such as utterance length. Thus, linguistic alignment is explained better by low-level features than by social power. We argue that a wider range of factors, especially cognitive factors, need to be taken into account for future studies on observational data when social factors of language use are in question.
Tasks
Published	2018-07-01
URL	https://www.aclweb.org/anthology/P18-1056/
PDF	https://www.aclweb.org/anthology/P18-1056
PWC	https://paperswithcode.com/paper/not-that-much-power-linguistic-alignment-is
Repo
Framework


Title	Learning Data Terms for Non-blind Deblurring
Authors	Jiangxin Dong, Jinshan Pan, Deqing Sun, Zhixun Su, Ming-Hsuan Yang
Abstract	Existing deblurring methods mainly focus on developing effective image priors and assume that blurred images contain insignificant amounts of noise. However, state-of-the-art deblurring methods do not perform well on real-world images degraded with significant noise or outliers. To address these issues, we show that it is critical to learn data fitting terms beyond the commonly used L1 or L2 norm. We propose a simple and effective discriminative framework to learn data terms that can adaptively handle blurred images in the presence of severe noise and outliers. Instead of learning the distribution of the data fitting errors, we directly learn the associated shrinkage function for the data term using a cascaded architecture, which is more flexible and efficient. Our analysis shows that the shrinkage functions learned at the intermediate stages can effectively suppress noise and preserve image structures. Extensive experimental results show that the proposed algorithm performs favorably against state-of-the-art methods.
Tasks	Deblurring
Published	2018-09-01
URL	http://openaccess.thecvf.com/content_ECCV_2018/html/Jiangxin_Dong_Learning_Data_Terms_ECCV_2018_paper.html
PDF	http://openaccess.thecvf.com/content_ECCV_2018/papers/Jiangxin_Dong_Learning_Data_Terms_ECCV_2018_paper.pdf
PWC	https://paperswithcode.com/paper/learning-data-terms-for-non-blind-deblurring
Repo
Framework

Scientific Discovery as Link Prediction in Influence and Citation Graphs


Title	Scientific Discovery as Link Prediction in Influence and Citation Graphs
Authors	Fan Luo, Marco A. Valenzuela-Esc{'a}rcega, Gus Hahn-Powell, Mihai Surdeanu
Abstract	We introduce a machine learning approach for the identification of {`}white spaces{''} in scientific knowledge. Our approach addresses this task as link prediction over a graph that contains over 2M influence statements such as {`}CTCF activates FOXA1{''}, which were automatically extracted using open-domain machine reading. We model this prediction task using graph-based features extracted from the above influence graph, as well as from a citation graph that captures scientific communities. We evaluated the proposed approach through backtesting. Although the data is heavily unbalanced (50 times more negative examples than positives), our approach predicts which influence links will be discovered in the {``}near future{''} with a F1 score of 27 points, and a mean average precision of 68{%}. \|
Tasks	Link Prediction, Reading Comprehension
Published	2018-06-01
URL	https://www.aclweb.org/anthology/W18-1701/
PDF	https://www.aclweb.org/anthology/W18-1701
PWC	https://paperswithcode.com/paper/scientific-discovery-as-link-prediction-in
Repo
Framework

Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues


Title	Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues
Authors	Soumendu Sundar Mukherjee, Purnamrita Sarkar, Y. X. Rachel Wang, Bowei Yan
Abstract	Variational approximation has been widely used in large-scale Bayesian inference recently, the simplest kind of which involves imposing a mean field assumption to approximate complicated latent structures. Despite the computational scalability of mean field, theoretical studies of its loss function surface and the convergence behavior of iterative updates for optimizing the loss are far from complete. In this paper, we focus on the problem of community detection for a simple two-class Stochastic Blockmodel (SBM). Using batch co-ordinate ascent (BCAVI) for updates, we give a complete characterization of all the critical points and show different convergence behaviors with respect to initializations. When the parameters are known, we show a significant proportion of random initializations will converge to ground truth. On the other hand, when the parameters themselves need to be estimated, a random initialization will converge to an uninformative local optimum.
Tasks	Bayesian Inference, Community Detection
Published	2018-12-01
URL	http://papers.nips.cc/paper/8268-mean-field-for-the-stochastic-blockmodel-optimization-landscape-and-convergence-issues
PDF	http://papers.nips.cc/paper/8268-mean-field-for-the-stochastic-blockmodel-optimization-landscape-and-convergence-issues.pdf
PWC	https://paperswithcode.com/paper/mean-field-for-the-stochastic-blockmodel
Repo
Framework

Learning Comment Controversy Prediction in Web Discussions Using Incidentally Supervised Multi-Task CNNs


Title	Learning Comment Controversy Prediction in Web Discussions Using Incidentally Supervised Multi-Task CNNs
Authors	Nils Rethmeier, Marc H{"u}bner, Leonhard Hennig
Abstract	Comments on web news contain controversies that manifest as inter-group agreement-conflicts. Tracking such \textit{rapidly evolving controversy} could ease conflict resolution or journalist-user interaction. However, this presupposes controversy online-prediction that scales to diverse domains using incidental supervision signals instead of manual labeling. To more deeply interpret comment-controversy model decisions we frame prediction as binary classification and evaluate baselines and multi-task CNNs that use an auxiliary news-genre-encoder. Finally, we use ablation and interpretability methods to determine the impacts of topic, discourse and sentiment indicators, contextual vs. global word influence, as well as genre-keywords vs. per-genre-controversy keywords {–} to find that the models learn plausible controversy features using only incidentally supervised signals.
Tasks	Language Modelling
Published	2018-10-01
URL	https://www.aclweb.org/anthology/W18-6246/
PDF	https://www.aclweb.org/anthology/W18-6246
PWC	https://paperswithcode.com/paper/learning-comment-controversy-prediction-in
Repo
Framework

Abbreviation Expander - a Web-based System for Easy Reading of Technical Documents


Title	Abbreviation Expander - a Web-based System for Easy Reading of Technical Documents
Authors	Manuel R. Ciosici, Ira Assent
Abstract	Abbreviations and acronyms are a part of textual communication in most domains. However, abbreviations are not necessarily defined in documents that employ them. Understanding all abbreviations used in a given document often requires extensive knowledge of the target domain and the ability to disambiguate based on context. This creates considerable entry barriers to newcomers and difficulties in automated document processing. Existing abbreviation expansion systems or tools require substantial technical knowledge for set up or make strong assumptions which limit their use in practice. Here, we present Abbreviation Expander, a system that builds on state of the art methods for identification of abbreviations, acronyms and their definitions and a novel disambiguator for abbreviation expansion in an easily accessible web-based solution.
Tasks
Published	2018-08-01
URL	https://www.aclweb.org/anthology/C18-2001/
PDF	https://www.aclweb.org/anthology/C18-2001
PWC	https://paperswithcode.com/paper/abbreviation-expander-a-web-based-system-for
Repo
Framework