October 15, 2019

2075 words 10 mins read

Paper Group NANR 215

Meaningless yet meaningful: Morphology grounded subword-level NMT. Learning multiview embeddings for assessing dementia. Integrating Predictions from Neural-Network Relation Classifiers into Coreference and Bridging Resolution. The Word Analogy Testing Caveat. Towards Bridging Resolution in German: Data Analysis and Rule-based Experiments. Sisyphus …

Meaningless yet meaningful: Morphology grounded subword-level NMT


Title	Meaningless yet meaningful: Morphology grounded subword-level NMT
Authors	Tamali Banerjee, Pushpak Bhattacharyya
Abstract	We explore the use of two independent subsystems Byte Pair Encoding (BPE) and Morfessor as basic units for subword-level neural machine translation (NMT). We show that, for linguistically distant language-pairs Morfessor-based segmentation algorithm produces significantly better quality translation than BPE. However, for close language-pairs BPE-based subword-NMT may translate better than Morfessor-based subword-NMT. We propose a combined approach of these two segmentation algorithms Morfessor-BPE (M-BPE) which outperforms these two baseline systems in terms of BLEU score. Our results are supported by experiments on three language-pairs: English-Hindi, Bengali-Hindi and English-Bengali.
Tasks	Machine Translation, Transliteration
Published	2018-06-01
URL	https://www.aclweb.org/anthology/W18-1207/
PDF	https://www.aclweb.org/anthology/W18-1207
PWC	https://paperswithcode.com/paper/meaningless-yet-meaningful-morphology
Repo
Framework

Learning multiview embeddings for assessing dementia


Title	Learning multiview embeddings for assessing dementia
Authors	Chlo{'e} Pou-Prom, Frank Rudzicz
Abstract	As the incidence of Alzheimer{'}s Disease (AD) increases, early detection becomes crucial. Unfortunately, datasets for AD assessment are often sparse and incomplete. In this work, we leverage the multiview nature of a small AD dataset, DementiaBank, to learn an embedding that captures different modes of cognitive impairment. We apply generalized canonical correlation analysis (GCCA) to our dataset and demonstrate the added benefit of using multiview embeddings in two downstream tasks: identifying AD and predicting clinical scores. By including multiview embeddings, we obtain an F1 score of 0.82 in the classification task and a mean absolute error of 3.42 in the regression task. Furthermore, we show that multiview embeddings can be obtained from other datasets as well.
Tasks
Published	2018-10-01
URL	https://www.aclweb.org/anthology/D18-1304/
PDF	https://www.aclweb.org/anthology/D18-1304
PWC	https://paperswithcode.com/paper/learning-multiview-embeddings-for-assessing
Repo
Framework

Integrating Predictions from Neural-Network Relation Classifiers into Coreference and Bridging Resolution


Title	Integrating Predictions from Neural-Network Relation Classifiers into Coreference and Bridging Resolution
Authors	Ina Roesiger, Maximilian K{"o}per, Kim Anh Nguyen, Sabine Schulte im Walde
Abstract	Cases of coreference and bridging resolution often require knowledge about semantic relations between anaphors and antecedents. We suggest state-of-the-art neural-network classifiers trained on relation benchmarks to predict and integrate likelihoods for relations. Two experiments with representations differing in noise and complexity improve our bridging but not our coreference resolver.
Tasks	Coreference Resolution
Published	2018-06-01
URL	https://www.aclweb.org/anthology/W18-0705/
PDF	https://www.aclweb.org/anthology/W18-0705
PWC	https://paperswithcode.com/paper/integrating-predictions-from-neural-network
Repo
Framework

The Word Analogy Testing Caveat


Title	The Word Analogy Testing Caveat
Authors	Natalie Schluter
Abstract	There are some important problems in the evaluation of word embeddings using standard word analogy tests. In particular, in virtue of the assumptions made by systems generating the embeddings, these remain tests over randomness. We show that even supposing there were such word analogy regularities that should be detected in the word embeddings obtained via unsupervised means, standard word analogy test implementation practices provide distorted or contrived results. We raise concerns regarding the use of Principal Component Analysis to 2 or 3 dimensions as a provision of visual evidence for the existence of word analogy relations in embeddings. Finally, we propose some solutions to these problems.
Tasks	Semantic Textual Similarity, Transfer Learning, Word Embeddings
Published	2018-06-01
URL	https://www.aclweb.org/anthology/N18-2039/
PDF	https://www.aclweb.org/anthology/N18-2039
PWC	https://paperswithcode.com/paper/the-word-analogy-testing-caveat
Repo
Framework

Towards Bridging Resolution in German: Data Analysis and Rule-based Experiments


Title	Towards Bridging Resolution in German: Data Analysis and Rule-based Experiments
Authors	Janis Pagel, Ina Roesiger
Abstract	Bridging resolution is the task of recognising bridging anaphors and linking them to their antecedents. While there is some work on bridging resolution for English, there is only little work for German. We present two datasets which contain bridging annotations, namely DIRNDL and GRAIN, and compare the performance of a rule-based system with a simple baseline approach on these two corpora. The performance for full bridging resolution ranges between an F1 score of 13.6{%} for DIRNDL and 11.8{%} for GRAIN. An analysis using oracle lists suggests that the system could, to a certain extent, benefit from ranking and re-ranking antecedent candidates. Furthermore, we investigate the importance of single features and show that the features used in our work seem promising for future bridging resolution approaches.
Tasks
Published	2018-06-01
URL	https://www.aclweb.org/anthology/W18-0706/
PDF	https://www.aclweb.org/anthology/W18-0706
PWC	https://paperswithcode.com/paper/towards-bridging-resolution-in-german-data
Repo
Framework

Sisyphus, a Workflow Manager Designed for Machine Translation and Automatic Speech Recognition


Title	Sisyphus, a Workflow Manager Designed for Machine Translation and Automatic Speech Recognition
Authors	Jan-Thorsten Peter, Eugen Beck, Hermann Ney
Abstract	Training and testing many possible parameters or model architectures of state-of-the-art machine translation or automatic speech recognition system is a cumbersome task. They usually require a long pipeline of commands reaching from pre-processing the training data to post-processing and evaluating the output.
Tasks	Machine Translation, Speech Recognition
Published	2018-11-01
URL	https://www.aclweb.org/anthology/D18-2015/
PDF	https://www.aclweb.org/anthology/D18-2015
PWC	https://paperswithcode.com/paper/sisyphus-a-workflow-manager-designed-for
Repo
Framework

Deep Domain Generalization via Conditional Invariant Adversarial Networks


Title	Deep Domain Generalization via Conditional Invariant Adversarial Networks
Authors	Ya Li, Xinmei Tian, Mingming Gong, Yajing Liu, Tongliang Liu, Kun Zhang, Dacheng Tao
Abstract	Domain generalization aims to learn a classification model from multiple source domains and generalize it to unseen target domains. A critical problem in domain generalization involves learning domain-invariant representations. Let $X$ and $Y$ denote the features and the labels, respectively. Under the assumption that the conditional distribution $P(YX)$ remains unchanged across domains, earlier approaches to domain generalization learned the invariant representation $T(X)$ by minimizing the discrepancy of the marginal distribution $P(T(X))$. However, such an assumption of stable $P(YX)$ does not necessarily hold in practice. In addition, the representation learning function $T(X)$ is usually constrained to a simple linear transformation or shallow networks. To address the above two drawbacks, we propose an end-to-end conditional invariant deep domain generalization approach by leveraging deep neural networks for domain-invariant representation learning. The domain-invariance property is guaranteed through a conditional invariant adversarial network that can learn domain-invariant representations w.r.t. the joint distribution $P(T(X),Y)$ if the target domain data are not severely class unbalanced. We perform various experiments to demonstrate the effectiveness of the proposed method.
Tasks	Domain Generalization, Representation Learning
Published	2018-09-01
URL	http://openaccess.thecvf.com/content_ECCV_2018/html/Ya_Li_Deep_Domain_Generalization_ECCV_2018_paper.html
PDF	http://openaccess.thecvf.com/content_ECCV_2018/papers/Ya_Li_Deep_Domain_Generalization_ECCV_2018_paper.pdf
PWC	https://paperswithcode.com/paper/deep-domain-generalization-via-conditional
Repo
Framework

Indra: A Word Embedding and Semantic Relatedness Server


Title	Indra: A Word Embedding and Semantic Relatedness Server
Authors	Juliano Efson Sales, Leonardo Souza, Siamak Barzegar, Brian Davis, Andr{'e} Freitas, H, Siegfried schuh
Abstract
Tasks	Semantic Textual Similarity
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1211/
PDF	https://www.aclweb.org/anthology/L18-1211
PWC	https://paperswithcode.com/paper/indra-a-word-embedding-and-semantic
Repo
Framework

A Corpus of Drug Usage Guidelines Annotated with Type of Advice


Title	A Corpus of Drug Usage Guidelines Annotated with Type of Advice
Authors	Sarah Masud Preum, Md. Rizwan Parvez, Kai-Wei Chang, John Stankovic
Abstract
Tasks
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1190/
PDF	https://www.aclweb.org/anthology/L18-1190
PWC	https://paperswithcode.com/paper/a-corpus-of-drug-usage-guidelines-annotated
Repo
Framework

Mesoscopic Facial Geometry Inference Using Deep Neural Networks


Title	Mesoscopic Facial Geometry Inference Using Deep Neural Networks
Authors	Loc Huynh, Weikai Chen, Shunsuke Saito, Jun Xing, Koki Nagano, Andrew Jones, Paul Debevec, Hao Li
Abstract	We present a learning-based approach for synthesizing facial geometry at medium and fine scales from diffusely-lit facial texture maps. When applied to an image sequence, the synthesized detail is temporally coherent. Unlike current state-of-the-art methods, which assume “dark is deep”, our model is trained with measured facial detail collected using polarized gradient illumination in a Light Stage. This enables us to produce plausible facial detail across the entire face, including where previous approaches may incorrectly interpret dark features as concavities such as at moles, hair stubble, and occluded pores. Instead of directly inferring 3D geometry, we propose to encode fine details in high-resolution displacement maps which are learned through a hybrid network adopting the state-of-the-art image-to-image translation network and super resolution network. To effectively capture geometric detail at both mid- and high frequencies, we factorize the learning into two separate sub-networks, enabling the full range of facial detail to be modeled. Results from our learning-based approach compare favorably with a high-quality active facial scanning technique, and require only a single passive lighting condition without a complex scanning setup.
Tasks	Image-to-Image Translation, Super-Resolution
Published	2018-06-01
URL	http://openaccess.thecvf.com/content_cvpr_2018/html/Huynh_Mesoscopic_Facial_Geometry_CVPR_2018_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2018/papers/Huynh_Mesoscopic_Facial_Geometry_CVPR_2018_paper.pdf
PWC	https://paperswithcode.com/paper/mesoscopic-facial-geometry-inference-using
Repo
Framework

Variable Ring Light Imaging: Capturing Transient Subsurface Scattering with An Ordinary Camera


Title	Variable Ring Light Imaging: Capturing Transient Subsurface Scattering with An Ordinary Camera
Authors	Ko Nishino, Art Subpa-asa, Yuta Asano, Mihoko Shimano, Imari Sato
Abstract	Subsurface scattering plays a significant role in determining the appearance of real-world surfaces. A light ray penetrating into the subsurface is repeatedly scattered and absorbed by particles along its path before reemerging from the outer interface, which determines its spectral radiance. We introduce a novel imaging method that enables the decomposition of the appearance of a fronto-parallel real-world surface into images of light with bounded path lengths, i.e., transient subsurface light transport. Our key idea is to observe each surface point under a variable ring light: a circular illumination pattern of increasingly larger radius centered on it. We show that the path length of light captured in each of these observations is naturally lower-bounded by the ring light radius. By taking the difference of ring light images of incrementally larger radii, we compute transient images that encode light with bounded path lengths. Experimental results on synthetic and complex real-world surfaces demonstrate that the recovered transient images reveal the subsurface structure of general translucent inhomogeneous surfaces. We further show that their differences reveal the surface colors at different surface depths. The proposed method is the first to enable the unveiling of dense and continuous subsurface structures from steady-state external appearance using ordinary camera and illumination.
Tasks
Published	2018-09-01
URL	http://openaccess.thecvf.com/content_ECCV_2018/html/Ko_Nishino_Variable_Ring_Light_ECCV_2018_paper.html
PDF	http://openaccess.thecvf.com/content_ECCV_2018/papers/Ko_Nishino_Variable_Ring_Light_ECCV_2018_paper.pdf
PWC	https://paperswithcode.com/paper/variable-ring-light-imaging-capturing
Repo
Framework

Parsivar: A Language Processing Toolkit for Persian


Title	Parsivar: A Language Processing Toolkit for Persian
Authors	Salar Mohtaj, Behnam Roshanfekr, Atefeh Zafarian, Habibollah Asghari
Abstract
Tasks	Morphological Analysis, Tokenization
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1179/
PDF	https://www.aclweb.org/anthology/L18-1179
PWC	https://paperswithcode.com/paper/parsivar-a-language-processing-toolkit-for
Repo
Framework

QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation


Title	QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
Authors	Kordula De Kuthy, Nils Reiter, Arndt Riester
Abstract
Tasks
Published	2018-05-01
URL	https://www.aclweb.org/anthology/L18-1304/
PDF	https://www.aclweb.org/anthology/L18-1304
PWC	https://paperswithcode.com/paper/qud-based-annotation-of-discourse-structure
Repo
Framework

Automatically Extracting Qualia Relations for the Rich Event Ontology


Title	Automatically Extracting Qualia Relations for the Rich Event Ontology
Authors	Ghazaleh Kazeminejad, Claire Bonial, Susan Windisch Brown, Martha Palmer
Abstract	Commonsense, real-world knowledge about the events that entities or {``}things in the world{''} are typically involved in, as well as part-whole relationships, is valuable for allowing computational systems to draw everyday inferences about the world. Here, we focus on automatically extracting information about (1) the events that typically bring about certain entities (origins), (2) the events that are the typical functions of entities, and (3) part-whole relationships in entities. These correspond to the agentive, telic and constitutive qualia central to the Generative Lexicon. We describe our motivations and methods for extracting these qualia relations from the Suggested Upper Merged Ontology (SUMO) and show that human annotators overwhelmingly find the information extracted to be reasonable. Because ontologies provide a way of structuring this information and making it accessible to agents and computational systems generally, efforts are underway to incorporate the extracted information to an ontology hub of Natural Language Processing semantic role labeling resources, the Rich Event Ontology. \|
Tasks	Semantic Role Labeling
Published	2018-08-01
URL	https://www.aclweb.org/anthology/C18-1224/
PDF	https://www.aclweb.org/anthology/C18-1224
PWC	https://paperswithcode.com/paper/automatically-extracting-qualia-relations-for
Repo
Framework

A dataset for identifying actionable feedback in collaborative software development


Title	A dataset for identifying actionable feedback in collaborative software development
Authors	Benjamin S. Meyers, Nuthan Munaiah, Emily Prud{'}hommeaux, Andrew Meneely, Josephine Wolff, Cecilia Ovesdotter Alm, Pradeep Murukannaiah
Abstract	Software developers and testers have long struggled with how to elicit proactive responses from their coworkers when reviewing code for security vulnerabilities and errors. For a code review to be successful, it must not only identify potential problems but also elicit an active response from the colleague responsible for modifying the code. To understand the factors that contribute to this outcome, we analyze a novel dataset of more than one million code reviews for the Google Chromium project, from which we extract linguistic features of feedback that elicited responsive actions from coworkers. Using a manually-labeled subset of reviewer comments, we trained a highly accurate classifier to identify acted-upon comments (AUC = 0.85). Our results demonstrate the utility of our dataset, the feasibility of using NLP for this new task, and the potential of NLP to improve our understanding of how communications between colleagues can be authored to elicit positive, proactive responses.
Tasks
Published	2018-07-01
URL	https://www.aclweb.org/anthology/P18-2021/
PDF	https://www.aclweb.org/anthology/P18-2021
PWC	https://paperswithcode.com/paper/a-dataset-for-identifying-actionable-feedback
Repo
Framework