January 25, 2020

2263 words 11 mins read

Paper Group NANR 35

A Machine Learning Approach for Identifying Compound Words from a Sanskrit Text. Reflexives in Czech from a Dependency Perspective. hpGAT: High-order Proximity Informed Graph Attention Network. Experiments on human incremental parsing. Amharic Question Answering for Biography, Definition, and Description Questions. On reducing translation shifts in …

A Machine Learning Approach for Identifying Compound Words from a Sanskrit Text


Title	A Machine Learning Approach for Identifying Compound Words from a Sanskrit Text
Authors	Premjith B, Ch V, ni Ch, ran, Shriganesh Bhat, Soman Kp, Prabaharan P
Abstract
Tasks
Published	2019-10-01
URL	https://www.aclweb.org/anthology/W19-7504/
PDF	https://www.aclweb.org/anthology/W19-7504
PWC	https://paperswithcode.com/paper/a-machine-learning-approach-for-identifying-1
Repo
Framework

Reflexives in Czech from a Dependency Perspective


Title	Reflexives in Czech from a Dependency Perspective
Authors	Vaclava Kettnerova, Marketa Lopatkova
Abstract
Tasks
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-7704/
PDF	https://www.aclweb.org/anthology/W19-7704
PWC	https://paperswithcode.com/paper/reflexives-in-czech-from-a-dependency
Repo
Framework

hpGAT: High-order Proximity Informed Graph Attention Network


Title	hpGAT: High-order Proximity Informed Graph Attention Network
Authors	Zhining Liu, Weiyi Liu, Pin-yu Chen, Chenyi Zhuang, Chengyun Song
Abstract	Graph neural networks (GNNs) have recently made remarkable breakthroughs in the paradigm of learning with graph-structured data. However, most existing GNNs limit the receptive field of the node on each layer to its connected (one-hop) neighbors, which disregards the fact that large receptive field has been proven to be a critical factor in state-of-the-art neural networks. In this paper, we propose a novel approach to appropriately define a variable receptive field for GNNs by incorporating high-order proximity information extracted from the hierarchical topological structure of the input graph. Specifically, multiscale groups obtained from trainable hierarchical semi-nonnegative matrix factorization are used for adjusting the weights when aggregating one-hop neighbors. Integrated with the graph attention mechanism on attributes of neighboring nodes, the learnable parameters within the process of aggregation are optimized in an end-to-end manner. Extensive experiments show that the proposed method (hpGAT) outperforms state-of-the-art methods and demonstrate the importance of exploiting high-order proximity in handling noisy information of local neighborhood.
Tasks	Node Classification
Published	2019-08-28
URL	https://doi.org/10.1109/ACCESS.2019.2938039
PDF	https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8818141
PWC	https://paperswithcode.com/paper/hpgat-high-order-proximity-informed-graph
Repo
Framework

Experiments on human incremental parsing


Title	Experiments on human incremental parsing
Authors	Leonid Mityushin, Leonid Iomdin
Abstract
Tasks
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-7725/
PDF	https://www.aclweb.org/anthology/W19-7725
PWC	https://paperswithcode.com/paper/experiments-on-human-incremental-parsing
Repo
Framework

Amharic Question Answering for Biography, Definition, and Description Questions


Title	Amharic Question Answering for Biography, Definition, and Description Questions
Authors	Tilahun Abedissa Taffa, Mulugeta Libsie
Abstract	A broad range of information needs can often be stated as a question. Question Answering (QA) systems attempt to provide users concise answer(s) to natural language questions. The existing Amharic QA systems handle fact-based questions that usually take named entities as an answer. To deal with more complex information needs we developed an Amharic non-factoid QA for biography, definition, and description questions. A hybrid approach has been used for the question classification. For document filtering and answer extraction we have used lexical patterns. On the other hand to answer biography questions we have used a summarizer and the generated summary is validated using a text classifier. Our QA system is evaluated and has shown a promising result.
Tasks	Question Answering
Published	2019-08-01
URL	https://www.aclweb.org/anthology/papers/W/W19/W19-3635/
PDF	https://www.aclweb.org/anthology/W19-3635
PWC	https://paperswithcode.com/paper/amharic-question-answering-for-biography
Repo
Framework

On reducing translation shifts in translations intended for MT evaluation


Title	On reducing translation shifts in translations intended for MT evaluation
Authors	Maja Popovic
Abstract
Tasks
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-6712/
PDF	https://www.aclweb.org/anthology/W19-6712
PWC	https://paperswithcode.com/paper/on-reducing-translation-shifts-in
Repo
Framework

Memory Grounded Conversational Reasoning


Title	Memory Grounded Conversational Reasoning
Authors	Seungwhan Moon, Pararth Shah, Rajen Subba, Anuj Kumar
Abstract	We demonstrate a conversational system which engages the user through a multi-modal, multi-turn dialog over the user{'}s memories. The system can perform QA over memories by responding to user queries to recall specific attributes and associated media (e.g. photos) of past episodic memories. The system can also make proactive suggestions to surface related events or facts from past memories to make conversations more engaging and natural. To implement such a system, we collect a new corpus of memory grounded conversations, which comprises human-to-human role-playing dialogs given synthetic memory graphs with simulated attributes. Our proof-of-concept system operates on these synthetic memory graphs, however it can be trained and applied to real-world user memory data (e.g. photo albums, etc.) We present the architecture of the proposed conversational system, and example queries that the system supports.
Tasks
Published	2019-11-01
URL	https://www.aclweb.org/anthology/D19-3025/
PDF	https://www.aclweb.org/anthology/D19-3025
PWC	https://paperswithcode.com/paper/memory-grounded-conversational-reasoning
Repo
Framework

Unsupervised Learning of Discourse-Aware Text Representation for Essay Scoring


Title	Unsupervised Learning of Discourse-Aware Text Representation for Essay Scoring
Authors	Farjana Sultana Mim, Naoya Inoue, Paul Reisert, Hiroki Ouchi, Kentaro Inui
Abstract	Existing document embedding approaches mainly focus on capturing sequences of words in documents. However, some document classification and regression tasks such as essay scoring need to consider discourse structure of documents. Although some prior approaches consider this issue and utilize discourse structure of text for document classification, these approaches are dependent on computationally expensive parsers. In this paper, we propose an unsupervised approach to capture discourse structure in terms of coherence and cohesion for document embedding that does not require any expensive parser or annotation. Extrinsic evaluation results show that the document representation obtained from our approach improves the performance of essay Organization scoring and Argument Strength scoring.
Tasks	Document Classification, Document Embedding
Published	2019-07-01
URL	https://www.aclweb.org/anthology/P19-2053/
PDF	https://www.aclweb.org/anthology/P19-2053
PWC	https://paperswithcode.com/paper/unsupervised-learning-of-discourse-aware-text
Repo
Framework

Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil


Title	Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil
Authors	Kengatharaiyer Sarveswaran, Gihan Dias, Miriam Butt
Abstract	This paper describes a new and larger coverage Finite-State Morphological Analyser (FSM) and Generator for the Dravidian language Tamil. The FSM has been developed in the context of computational grammar engineering, adhering to the standards of the ParGram effort. Tamil is a morphologically rich language and the interaction between linguistic analysis and formal implementation is complex, resulting in a challenging task. In order to allow the development of the FSM to focus more on the linguistic analysis and less on the formal details, we have developed a system of meta-morph(ology) rules along with a script which translates these rules into FSM processable representations. The introduction of meta-morph rules makes it possible for computationally naive linguists to interact with the system and to expand it in future work. We found that the meta-morph rules help to express linguistic generalisations and reduce the manual effort of writing lexical classes for morphological analysis. Our Tamil FSM currently handles mainly the inflectional morphology of 3,300 verb roots and their 260 forms. Further, it also has a lexicon of approximately 100,000 nouns along with a guesser to handle out-of-vocabulary items. Although the Tamil FSM was primarily developed to be part of a computational grammar, it can also be used as a web or stand-alone application for other NLP tasks, as per general ParGram practice.
Tasks	Morphological Analysis
Published	2019-09-01
URL	https://www.aclweb.org/anthology/W19-3111/
PDF	https://www.aclweb.org/anthology/W19-3111
PWC	https://paperswithcode.com/paper/using-meta-morph-rules-to-develop
Repo
Framework

MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input


Title	MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input
Authors	Jun Hao Liew, Scott Cohen, Brian Price, Long Mai, Sim-Heng Ong, Jiashi Feng
Abstract	Existing deep learning-based interactive image segmentation approaches typically assume the target-of-interest is always a single object and fail to account for the potential diversity in user expectations, thus requiring excessive user input when it comes to segmenting an object part or a group of objects instead. Motivated by the observation that the object part, full object, and a collection of objects essentially differ in size, we propose a new concept called scale-diversity, which characterizes the spectrum of segmentations w.r.t. different scales. To address this, we present MultiSeg, a scale-diverse interactive image segmentation network that incorporates a set of two-dimensional scale priors into the model to generate a set of scale-varying proposals that conform to the user input. We explicitly encourage segmentation diversity during training by synthesizing diverse training samples for a given image. As a result, our method allows the user to quickly locate the closest segmentation target for further refinement if necessary. Despite its simplicity, experimental results demonstrate that our proposed model is capable of quickly producing diverse yet plausible segmentation outputs, reducing the user interaction required, especially in cases where many types of segmentations (object parts or groups) are expected.
Tasks	Semantic Segmentation
Published	2019-10-01
URL	http://openaccess.thecvf.com/content_ICCV_2019/html/Liew_MultiSeg_Semantically_Meaningful_Scale-Diverse_Segmentations_From_Minimal_User_Input_ICCV_2019_paper.html
PDF	http://openaccess.thecvf.com/content_ICCV_2019/papers/Liew_MultiSeg_Semantically_Meaningful_Scale-Diverse_Segmentations_From_Minimal_User_Input_ICCV_2019_paper.pdf
PWC	https://paperswithcode.com/paper/multiseg-semantically-meaningful-scale
Repo
Framework

The Private Income Tax Shock Premium


Title	The Private Income Tax Shock Premium
Authors	Zornitsa Todorova
Abstract	This paper investigates the asset pricing implications of tax policy changes. News about tax cuts decreases future tax revenues and increases future consumer demand and output. Using cross-sectional variation in industry exposure to structurally identified tax news, I develop a factor mimicking private income tax shocks. I construct an investment strategy, which generates annualized risk-adjusted returns of 5.16 % over the Fama-French 3-factor model. I rationalize the finding by arguing that firms with more elastic demands bear higher consumption risk, which works through a wealth effect.
Tasks
Published	2019-07-30
URL	https://ijbassnet.com/publication/248/details
PDF	https://ijbassnet.com/storage/app/publications/5d40181c3bf5911564481564.pdf
PWC	https://paperswithcode.com/paper/the-private-income-tax-shock-premium
Repo
Framework

Phonetic Normalization for Machine Translation of User Generated Content


Title	Phonetic Normalization for Machine Translation of User Generated Content
Authors	Jos{'e} Carlos Rosales N{'u}{~n}ez, Djam{'e} Seddah, Guillaume Wisniewski
Abstract	We present an approach to correct noisy User Generated Content (UGC) in French aiming to produce a pretreatement pipeline to improve Machine Translation for this kind of non-canonical corpora. In order to do so, we have implemented a character-based neural model phonetizer to produce IPA pronunciations of words. In this way, we intend to correct grammar, vocabulary and accentuation errors often present in noisy UGC corpora. Our method leverages on the fact that some errors are due to confusion induced by words with similar pronunciation which can be corrected using a phonetic look-up table to produce normalization candidates. These potential corrections are then encoded in a lattice and ranked using a language model to output the most probable corrected phrase. Compare to using other phonetizers, our method boosts a transformer-based machine translation system on UGC.
Tasks	Language Modelling, Machine Translation
Published	2019-11-01
URL	https://www.aclweb.org/anthology/D19-5553/
PDF	https://www.aclweb.org/anthology/D19-5553
PWC	https://paperswithcode.com/paper/phonetic-normalization-for-machine
Repo
Framework

Grounded Word Sense Translation


Title	Grounded Word Sense Translation
Authors	Chiraag Lala, Pranava Madhyastha, Lucia Specia
Abstract	Recent work on visually grounded language learning has focused on broader applications of grounded representations, such as visual question answering and multimodal machine translation. In this paper we consider grounded word sense translation, i.e. the task of correctly translating an ambiguous source word given the corresponding textual and visual context. Our main objective is to investigate the extent to which images help improve word-level (lexical) translation quality. We do so by first studying the dataset for this task to understand the scope and challenges of the task. We then explore different data settings, image features, and ways of grounding to investigate the gain from using images in each of the combinations. We find that grounding on the image is specially beneficial in weaker unidirectional recurrent translation models. We observe that adding structured image information leads to stronger gains in lexical translation accuracy.
Tasks	Machine Translation, Multimodal Machine Translation, Question Answering, Visual Question Answering
Published	2019-06-01
URL	https://www.aclweb.org/anthology/W19-1808/
PDF	https://www.aclweb.org/anthology/W19-1808
PWC	https://paperswithcode.com/paper/grounded-word-sense-translation
Repo
Framework

HITSZ-ICRC: A Report for SMM4H Shared Task 2019-Automatic Classification and Extraction of Adverse Effect Mentions in Tweets


Title	HITSZ-ICRC: A Report for SMM4H Shared Task 2019-Automatic Classification and Extraction of Adverse Effect Mentions in Tweets
Authors	Shuai Chen, Yuanhang Huang, Xiaowei Huang, Haoming Qin, Jun Yan, Buzhou Tang
Abstract	This is the system description of the Harbin Institute of Technology Shenzhen (HITSZ) team for the first and second subtasks of the fourth Social Media Mining for Health Applications (SMM4H) shared task in 2019. The two subtasks are automatic classification and extraction of adverse effect mentions in tweets. The systems for the two subtasks are based on bidirectional encoder representations from transformers (BERT), and achieves promising results. Among the systems we developed for subtask1, the best F1-score was 0.6457, for subtask2, the best relaxed F1-score and the best strict F1-score were 0.614 and 0.407 respectively. Our system ranks first among all systems on subtask1.
Tasks
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-3206/
PDF	https://www.aclweb.org/anthology/W19-3206
PWC	https://paperswithcode.com/paper/hitsz-icrc-a-report-for-smm4h-shared-task
Repo
Framework

KFU NLP Team at SMM4H 2019 Tasks: Want to Extract Adverse Drugs Reactions from Tweets? BERT to The Rescue


Title	KFU NLP Team at SMM4H 2019 Tasks: Want to Extract Adverse Drugs Reactions from Tweets? BERT to The Rescue
Authors	Zulfat Miftahutdinov, Ilseyar Alimova, Elena Tutubalina
Abstract	This paper describes a system developed for the Social Media Mining for Health (SMM4H) 2019 shared tasks. Specifically, we participated in three tasks. The goals of the first two tasks are to classify whether a tweet contains mentions of adverse drug reactions (ADR) and extract these mentions, respectively. The objective of the third task is to build an end-to-end solution: first, detect ADR mentions and then map these entities to concepts in a controlled vocabulary. We investigate the use of a language representation model BERT trained to obtain semantic representations of social media texts. Our experiments on a dataset of user reviews showed that BERT is superior to state-of-the-art models based on recurrent neural networks. The BERT-based system for Task 1 obtained an F1 of 57.38{%}, with improvements up to +7.19{%} F1 over a score averaged across all 43 submissions. The ensemble of neural networks with a voting scheme for named entity recognition ranked first among 9 teams at the SMM4H 2019 Task 2 and obtained a relaxed F1 of 65.8{%}. The end-to-end model based on BERT for ADR normalization ranked first at the SMM4H 2019 Task 3 and obtained a relaxed F1 of 43.2{%}.
Tasks	Named Entity Recognition
Published	2019-08-01
URL	https://www.aclweb.org/anthology/W19-3207/
PDF	https://www.aclweb.org/anthology/W19-3207
PWC	https://paperswithcode.com/paper/kfu-nlp-team-at-smm4h-2019-tasks-want-to
Repo
Framework