May 4, 2019

1869 words 9 mins read

Paper Group NANR 225

Deeper syntax for better semantic parsing. Determining the Multiword Expression Inventory of a Surprise Language. The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.. The Multiple Quantile Graphical Model. Using Graphs of Classifiers to Impose Constraints on Semi-supervised Rela …

Deeper syntax for better semantic parsing


Title	Deeper syntax for better semantic parsing
Authors	Olivier Michalon, Corentin Ribeyre, C, Marie ito, Alexis Nasr
Abstract	Syntax plays an important role in the task of predicting the semantic structure of a sentence. But syntactic phenomena such as alternations, control and raising tend to obfuscate the relation between syntax and semantics. In this paper we predict the semantic structure of a sentence using a deeper syntax than what is usually done. This deep syntactic representation abstracts away from purely syntactic phenomena and proposes a structural organization of the sentence that is closer to the semantic representation. Experiments conducted on a French corpus annotated with semantic frames showed that a semantic parser reaches better performances with such a deep syntactic input.
Tasks	Semantic Parsing, Semantic Role Labeling
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1040/
PDF	https://www.aclweb.org/anthology/C16-1040
PWC	https://paperswithcode.com/paper/deeper-syntax-for-better-semantic-parsing
Repo
Framework

Determining the Multiword Expression Inventory of a Surprise Language


Title	Determining the Multiword Expression Inventory of a Surprise Language
Authors	Bahar Salehi, Paul Cook, Timothy Baldwin
Abstract	Much previous research on multiword expressions (MWEs) has focused on the token- and type-level tasks of MWE identification and extraction, respectively. Such studies typically target known prevalent MWE types in a given language. This paper describes the first attempt to learn the MWE inventory of a {``}surprise{''} language for which we have no explicit prior knowledge of MWE patterns, certainly no annotated MWE data, and not even a parallel corpus. Our proposed model is trained on a treebank with MWE relations of a source language, and can be applied to the monolingual corpus of the surprise language to identify its MWE construction types. \|
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1046/
PDF	https://www.aclweb.org/anthology/C16-1046
PWC	https://paperswithcode.com/paper/determining-the-multiword-expression
Repo
Framework

The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.


Title	The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
Authors	Miguel Matos, Alberto Abad, Ant{'o}nio Serralheiro
Abstract	In this paper, we describe a new corpus -named DIRHA-L2F RealCorpus- composed of typical home automation speech interactions in European Portuguese that has been recorded by the INESC-ID{'}s Spoken Language Systems Laboratory (L2F) to support the activities of the Distant-speech Interaction for Robust Home Applications (DIRHA) EU-funded project. The corpus is a multi-microphone and multi-room database of real continuous audio sequences containing read phonetically rich sentences, read and spontaneous keyword activation sentences, and read and spontaneous home automation commands. The background noise conditions are controlled and randomly recreated with noises typically found in home environments. Experimental validation on this corpus is reported in comparison with the results obtained on a simulated corpus using a fully automated speech processing pipeline for two fundamental automatic speech recognition tasks of typical {`}always-listening{'} home-automation scenarios: system activation and voice command recognition. Attending to results on both corpora, the presence of overlapping voice-like noise is shown as the main problem: simulated sequences contain concurrent speakers that result in general in a more challenging corpus, while real sequences performance drops drastically when TV or radio is on. \|
Tasks	Speech Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1633/
PDF	https://www.aclweb.org/anthology/L16-1633
PWC	https://paperswithcode.com/paper/the-dirha-portuguese-corpus-a-comparison-of
Repo
Framework

The Multiple Quantile Graphical Model


Title	The Multiple Quantile Graphical Model
Authors	Alnur Ali, J. Zico Kolter, Ryan J. Tibshirani
Abstract	We introduce the Multiple Quantile Graphical Model (MQGM), which extends the neighborhood selection approach of Meinshausen and Buhlmann for learning sparse graphical models. The latter is defined by the basic subproblem of modeling the conditional mean of one variable as a sparse function of all others. Our approach models a set of conditional quantiles of one variable as a sparse function of all others, and hence offers a much richer, more expressive class of conditional distribution estimates. We establish that, under suitable regularity conditions, the MQGM identifies the exact conditional independencies with probability tending to one as the problem size grows, even outside of the usual homoskedastic Gaussian data model. We develop an efficient algorithm for fitting the MQGM using the alternating direction method of multipliers. We also describe a strategy for sampling from the joint distribution that underlies the MQGM estimate. Lastly, we present detailed experiments that demonstrate the flexibility and effectiveness of the MQGM in modeling hetereoskedastic non-Gaussian data.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6092-the-multiple-quantile-graphical-model
PDF	http://papers.nips.cc/paper/6092-the-multiple-quantile-graphical-model.pdf
PWC	https://paperswithcode.com/paper/the-multiple-quantile-graphical-model
Repo
Framework

Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction


Title	Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction
Authors	Lidong Bing, William Cohen, Bhuwan Dhingra, Richard Wang
Abstract
Tasks	Relation Extraction
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1301/
PDF	https://www.aclweb.org/anthology/W16-1301
PWC	https://paperswithcode.com/paper/using-graphs-of-classifiers-to-impose-1
Repo
Framework

What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation


Title	What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation
Authors	Fabian Hirschmann, Jinseok Nam, Johannes F{"u}rnkranz
Abstract	Traditional machine translation systems often require heavy feature engineering and the combination of multiple techniques for solving different subproblems. In recent years, several end-to-end learning architectures based on recurrent neural networks have been proposed. Unlike traditional systems, Neural Machine Translation (NMT) systems learn the parameters of the model and require only minimal preprocessing. Memory and time constraints allow to take only a fixed number of words into account, which leads to the out-of-vocabulary (OOV) problem. In this work, we analyze why the OOV problem arises and why it is considered a serious problem in German. We study the effectiveness of compound word splitters for alleviating the OOV problem, resulting in a 2.5+ BLEU points improvement over a baseline on the WMT{'}14 German-to-English translation task. For English-to-German translation, we use target-side compound splitting through a special syntax during training that allows the model to merge compound words and gain 0.2 BLEU points.
Tasks	Feature Engineering, Machine Translation, Tokenization
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1301/
PDF	https://www.aclweb.org/anthology/C16-1301
PWC	https://paperswithcode.com/paper/what-makes-word-level-neural-machine
Repo
Framework

Coordinate-wise Power Method


Title	Coordinate-wise Power Method
Authors	Qi Lei, Kai Zhong, Inderjit S. Dhillon
Abstract	In this paper, we propose a coordinate-wise version of the power method from an optimization viewpoint. The vanilla power method simultaneously updates all the coordinates of the iterate, which is essential for its convergence analysis. However, different coordinates converge to the optimal value at different speeds. Our proposed algorithm, which we call coordinate-wise power method, is able to select and update the most important k coordinates in O(kn) time at each iteration, where n is the dimension of the matrix and k <= n is the size of the active set. Inspired by the ‘‘greedy’’ nature of our method, we further propose a greedy coordinate descent algorithm applied on a non-convex objective function specialized for symmetric matrices. We provide convergence analyses for both methods. Experimental results on both synthetic and real data show that our methods achieve up to 20 times speedup over the basic power method. Meanwhile, due to their coordinate-wise nature, our methods are very suitable for the important case when data cannot fit into memory. Finally, we introduce how the coordinate-wise mechanism could be applied to other iterative methods that are used in machine learning.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6103-coordinate-wise-power-method
PDF	http://papers.nips.cc/paper/6103-coordinate-wise-power-method.pdf
PWC	https://paperswithcode.com/paper/coordinate-wise-power-method
Repo
Framework

Information-based Modeling of Diachronic Linguistic Change: from Typicality to Productivity


Title	Information-based Modeling of Diachronic Linguistic Change: from Typicality to Productivity
Authors	Stefania Degaetano-Ortlieb, Elke Teich
Abstract
Tasks	Information Retrieval
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2121/
PDF	https://www.aclweb.org/anthology/W16-2121
PWC	https://paperswithcode.com/paper/information-based-modeling-of-diachronic
Repo
Framework

String Kernels for Native Language Identification: Insights from Behind the Curtains


Title	String Kernels for Native Language Identification: Insights from Behind the Curtains
Authors	Radu Tudor Ionescu, Marius Popescu, Aoife Cahill
Abstract
Tasks	Language Identification, Native Language Identification
Published	2016-09-01
URL	https://www.aclweb.org/anthology/J16-3005/
PDF	https://www.aclweb.org/anthology/J16-3005
PWC	https://paperswithcode.com/paper/string-kernels-for-native-language
Repo
Framework

Overfitting at SemEval-2016 Task 3: Detecting Semantically Similar Questions in Community Question Answering Forums with Word Embeddings


Title	Overfitting at SemEval-2016 Task 3: Detecting Semantically Similar Questions in Community Question Answering Forums with Word Embeddings
Authors	Hujie Wang, Pascal Poupart
Abstract
Tasks	Community Question Answering, Question Answering, Question Similarity, Semantic Textual Similarity, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1133/
PDF	https://www.aclweb.org/anthology/S16-1133
PWC	https://paperswithcode.com/paper/overfitting-at-semeval-2016-task-3-detecting
Repo
Framework

Ranking Automatically Generated Questions Using Common Human Queries


Title	Ranking Automatically Generated Questions Using Common Human Queries
Authors	Yllias Chali, Sina Golestanirad
Abstract
Tasks	Question Answering, Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-6635/
PDF	https://www.aclweb.org/anthology/W16-6635
PWC	https://paperswithcode.com/paper/ranking-automatically-generated-questions
Repo
Framework

DAG-Structured Long Short-Term Memory for Semantic Compositionality


Title	DAG-Structured Long Short-Term Memory for Semantic Compositionality
Authors	Xiaodan Zhu, Parinaz Sobhani, Hongyu Guo
Abstract
Tasks	Machine Translation, Semantic Composition, Sentiment Analysis, Speech Recognition
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1106/
PDF	https://www.aclweb.org/anthology/N16-1106
PWC	https://paperswithcode.com/paper/dag-structured-long-short-term-memory-for
Repo
Framework

What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias


Title	What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias
Authors	Graham Wilcock, Kristiina Jokinen, Seiichi Yamamoto
Abstract	We demonstrate a bilingual robot application, WikiTalk, that can talk fluently in both English and Japanese about almost any topic using information from English and Japanese Wikipedias. The English version of the system has been demonstrated previously, but we now present a live demo with a Nao robot that speaks English and Japanese and switches language on request. The robot supports the verbal interaction with face-tracking, nodding and communicative gesturing. One of the key features of the WikiTalk system is that the robot can switch from the current topic to related topics during the interaction in order to navigate around Wikipedia following the user{'}s individual interests.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2025/
PDF	https://www.aclweb.org/anthology/C16-2025
PWC	https://paperswithcode.com/paper/what-topic-do-you-want-to-hear-about-a
Repo
Framework

Overview of the 3rd Workshop on Asian Translation


Title	Overview of the 3rd Workshop on Asian Translation
Authors	Toshiaki Nakazawa, Chenchen Ding, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi
Abstract	This paper presents the results of the shared tasks from the 3rd workshop on Asian translation (WAT2016) including J ↔ E, J ↔ C scientific paper translation subtasks, C ↔ J, K ↔ J, E ↔ J patent translation subtasks, I ↔ E newswire subtasks and H ↔ E, H ↔ J mixed domain subtasks. For the WAT2016, 15 institutions participated in the shared tasks. About 500 translation results have been submitted to the automatic evaluation server, and selected submissions were manually evaluated.
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4601/
PDF	https://www.aclweb.org/anthology/W16-4601
PWC	https://paperswithcode.com/paper/overview-of-the-3rd-workshop-on-asian
Repo
Framework

Evaluating Argumentative and Narrative Essays using Graphs


Title	Evaluating Argumentative and Narrative Essays using Graphs
Authors	Swapna Somasundaran, Brian Riordan, Binod Gyawali, Su-Youn Yoon
Abstract	This work investigates whether the development of ideas in writing can be captured by graph properties derived from the text. Focusing on student essays, we represent the essay as a graph, and encode a variety of graph properties including PageRank as features for modeling essay scores related to quality of development. We demonstrate that our approach improves on a state-of-the-art system on the task of holistic scoring of persuasive essays and on the task of scoring narrative essays along the development dimension.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1148/
PDF	https://www.aclweb.org/anthology/C16-1148
PWC	https://paperswithcode.com/paper/evaluating-argumentative-and-narrative-essays
Repo
Framework