May 4, 2019

1869 words 9 mins read

Paper Group NANR 225

Paper Group NANR 225

Deeper syntax for better semantic parsing. Determining the Multiword Expression Inventory of a Surprise Language. The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.. The Multiple Quantile Graphical Model. Using Graphs of Classifiers to Impose Constraints on Semi-supervised Rela …

Deeper syntax for better semantic parsing

Title Deeper syntax for better semantic parsing
Authors Olivier Michalon, Corentin Ribeyre, C, Marie ito, Alexis Nasr
Abstract Syntax plays an important role in the task of predicting the semantic structure of a sentence. But syntactic phenomena such as alternations, control and raising tend to obfuscate the relation between syntax and semantics. In this paper we predict the semantic structure of a sentence using a deeper syntax than what is usually done. This deep syntactic representation abstracts away from purely syntactic phenomena and proposes a structural organization of the sentence that is closer to the semantic representation. Experiments conducted on a French corpus annotated with semantic frames showed that a semantic parser reaches better performances with such a deep syntactic input.
Tasks Semantic Parsing, Semantic Role Labeling
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1040/
PDF https://www.aclweb.org/anthology/C16-1040
PWC https://paperswithcode.com/paper/deeper-syntax-for-better-semantic-parsing
Repo
Framework

Determining the Multiword Expression Inventory of a Surprise Language

Title Determining the Multiword Expression Inventory of a Surprise Language
Authors Bahar Salehi, Paul Cook, Timothy Baldwin
Abstract Much previous research on multiword expressions (MWEs) has focused on the token- and type-level tasks of MWE identification and extraction, respectively. Such studies typically target known prevalent MWE types in a given language. This paper describes the first attempt to learn the MWE inventory of a {``}surprise{''} language for which we have no explicit prior knowledge of MWE patterns, certainly no annotated MWE data, and not even a parallel corpus. Our proposed model is trained on a treebank with MWE relations of a source language, and can be applied to the monolingual corpus of the surprise language to identify its MWE construction types. |
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1046/
PDF https://www.aclweb.org/anthology/C16-1046
PWC https://paperswithcode.com/paper/determining-the-multiword-expression
Repo
Framework

The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.

Title The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
Authors Miguel Matos, Alberto Abad, Ant{'o}nio Serralheiro
Abstract In this paper, we describe a new corpus -named DIRHA-L2F RealCorpus- composed of typical home automation speech interactions in European Portuguese that has been recorded by the INESC-ID{'}s Spoken Language Systems Laboratory (L2F) to support the activities of the Distant-speech Interaction for Robust Home Applications (DIRHA) EU-funded project. The corpus is a multi-microphone and multi-room database of real continuous audio sequences containing read phonetically rich sentences, read and spontaneous keyword activation sentences, and read and spontaneous home automation commands. The background noise conditions are controlled and randomly recreated with noises typically found in home environments. Experimental validation on this corpus is reported in comparison with the results obtained on a simulated corpus using a fully automated speech processing pipeline for two fundamental automatic speech recognition tasks of typical {`}always-listening{'} home-automation scenarios: system activation and voice command recognition. Attending to results on both corpora, the presence of overlapping voice-like noise is shown as the main problem: simulated sequences contain concurrent speakers that result in general in a more challenging corpus, while real sequences performance drops drastically when TV or radio is on. |
Tasks Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1633/
PDF https://www.aclweb.org/anthology/L16-1633
PWC https://paperswithcode.com/paper/the-dirha-portuguese-corpus-a-comparison-of
Repo
Framework

The Multiple Quantile Graphical Model

Title The Multiple Quantile Graphical Model
Authors Alnur Ali, J. Zico Kolter, Ryan J. Tibshirani
Abstract We introduce the Multiple Quantile Graphical Model (MQGM), which extends the neighborhood selection approach of Meinshausen and Buhlmann for learning sparse graphical models. The latter is defined by the basic subproblem of modeling the conditional mean of one variable as a sparse function of all others. Our approach models a set of conditional quantiles of one variable as a sparse function of all others, and hence offers a much richer, more expressive class of conditional distribution estimates. We establish that, under suitable regularity conditions, the MQGM identifies the exact conditional independencies with probability tending to one as the problem size grows, even outside of the usual homoskedastic Gaussian data model. We develop an efficient algorithm for fitting the MQGM using the alternating direction method of multipliers. We also describe a strategy for sampling from the joint distribution that underlies the MQGM estimate. Lastly, we present detailed experiments that demonstrate the flexibility and effectiveness of the MQGM in modeling hetereoskedastic non-Gaussian data.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6092-the-multiple-quantile-graphical-model
PDF http://papers.nips.cc/paper/6092-the-multiple-quantile-graphical-model.pdf
PWC https://paperswithcode.com/paper/the-multiple-quantile-graphical-model
Repo
Framework

Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction

Title Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction
Authors Lidong Bing, William Cohen, Bhuwan Dhingra, Richard Wang
Abstract
Tasks Relation Extraction
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1301/
PDF https://www.aclweb.org/anthology/W16-1301
PWC https://paperswithcode.com/paper/using-graphs-of-classifiers-to-impose-1
Repo
Framework

What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation

Title What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation
Authors Fabian Hirschmann, Jinseok Nam, Johannes F{"u}rnkranz
Abstract Traditional machine translation systems often require heavy feature engineering and the combination of multiple techniques for solving different subproblems. In recent years, several end-to-end learning architectures based on recurrent neural networks have been proposed. Unlike traditional systems, Neural Machine Translation (NMT) systems learn the parameters of the model and require only minimal preprocessing. Memory and time constraints allow to take only a fixed number of words into account, which leads to the out-of-vocabulary (OOV) problem. In this work, we analyze why the OOV problem arises and why it is considered a serious problem in German. We study the effectiveness of compound word splitters for alleviating the OOV problem, resulting in a 2.5+ BLEU points improvement over a baseline on the WMT{'}14 German-to-English translation task. For English-to-German translation, we use target-side compound splitting through a special syntax during training that allows the model to merge compound words and gain 0.2 BLEU points.
Tasks Feature Engineering, Machine Translation, Tokenization
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1301/
PDF https://www.aclweb.org/anthology/C16-1301
PWC https://paperswithcode.com/paper/what-makes-word-level-neural-machine
Repo
Framework

Coordinate-wise Power Method

Title Coordinate-wise Power Method
Authors Qi Lei, Kai Zhong, Inderjit S. Dhillon
Abstract In this paper, we propose a coordinate-wise version of the power method from an optimization viewpoint. The vanilla power method simultaneously updates all the coordinates of the iterate, which is essential for its convergence analysis. However, different coordinates converge to the optimal value at different speeds. Our proposed algorithm, which we call coordinate-wise power method, is able to select and update the most important k coordinates in O(kn) time at each iteration, where n is the dimension of the matrix and k <= n is the size of the active set. Inspired by the ‘‘greedy’’ nature of our method, we further propose a greedy coordinate descent algorithm applied on a non-convex objective function specialized for symmetric matrices. We provide convergence analyses for both methods. Experimental results on both synthetic and real data show that our methods achieve up to 20 times speedup over the basic power method. Meanwhile, due to their coordinate-wise nature, our methods are very suitable for the important case when data cannot fit into memory. Finally, we introduce how the coordinate-wise mechanism could be applied to other iterative methods that are used in machine learning.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6103-coordinate-wise-power-method
PDF http://papers.nips.cc/paper/6103-coordinate-wise-power-method.pdf
PWC https://paperswithcode.com/paper/coordinate-wise-power-method
Repo
Framework

Information-based Modeling of Diachronic Linguistic Change: from Typicality to Productivity

Title Information-based Modeling of Diachronic Linguistic Change: from Typicality to Productivity
Authors Stefania Degaetano-Ortlieb, Elke Teich
Abstract
Tasks Information Retrieval
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2121/
PDF https://www.aclweb.org/anthology/W16-2121
PWC https://paperswithcode.com/paper/information-based-modeling-of-diachronic
Repo
Framework

String Kernels for Native Language Identification: Insights from Behind the Curtains

Title String Kernels for Native Language Identification: Insights from Behind the Curtains
Authors Radu Tudor Ionescu, Marius Popescu, Aoife Cahill
Abstract
Tasks Language Identification, Native Language Identification
Published 2016-09-01
URL https://www.aclweb.org/anthology/J16-3005/
PDF https://www.aclweb.org/anthology/J16-3005
PWC https://paperswithcode.com/paper/string-kernels-for-native-language
Repo
Framework

Overfitting at SemEval-2016 Task 3: Detecting Semantically Similar Questions in Community Question Answering Forums with Word Embeddings

Title Overfitting at SemEval-2016 Task 3: Detecting Semantically Similar Questions in Community Question Answering Forums with Word Embeddings
Authors Hujie Wang, Pascal Poupart
Abstract
Tasks Community Question Answering, Question Answering, Question Similarity, Semantic Textual Similarity, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1133/
PDF https://www.aclweb.org/anthology/S16-1133
PWC https://paperswithcode.com/paper/overfitting-at-semeval-2016-task-3-detecting
Repo
Framework

Ranking Automatically Generated Questions Using Common Human Queries

Title Ranking Automatically Generated Questions Using Common Human Queries
Authors Yllias Chali, Sina Golestanirad
Abstract
Tasks Question Answering, Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-6635/
PDF https://www.aclweb.org/anthology/W16-6635
PWC https://paperswithcode.com/paper/ranking-automatically-generated-questions
Repo
Framework

DAG-Structured Long Short-Term Memory for Semantic Compositionality

Title DAG-Structured Long Short-Term Memory for Semantic Compositionality
Authors Xiaodan Zhu, Parinaz Sobhani, Hongyu Guo
Abstract
Tasks Machine Translation, Semantic Composition, Sentiment Analysis, Speech Recognition
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1106/
PDF https://www.aclweb.org/anthology/N16-1106
PWC https://paperswithcode.com/paper/dag-structured-long-short-term-memory-for
Repo
Framework

What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias

Title What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias
Authors Graham Wilcock, Kristiina Jokinen, Seiichi Yamamoto
Abstract We demonstrate a bilingual robot application, WikiTalk, that can talk fluently in both English and Japanese about almost any topic using information from English and Japanese Wikipedias. The English version of the system has been demonstrated previously, but we now present a live demo with a Nao robot that speaks English and Japanese and switches language on request. The robot supports the verbal interaction with face-tracking, nodding and communicative gesturing. One of the key features of the WikiTalk system is that the robot can switch from the current topic to related topics during the interaction in order to navigate around Wikipedia following the user{'}s individual interests.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2025/
PDF https://www.aclweb.org/anthology/C16-2025
PWC https://paperswithcode.com/paper/what-topic-do-you-want-to-hear-about-a
Repo
Framework

Overview of the 3rd Workshop on Asian Translation

Title Overview of the 3rd Workshop on Asian Translation
Authors Toshiaki Nakazawa, Chenchen Ding, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi
Abstract This paper presents the results of the shared tasks from the 3rd workshop on Asian translation (WAT2016) including J ↔ E, J ↔ C scientific paper translation subtasks, C ↔ J, K ↔ J, E ↔ J patent translation subtasks, I ↔ E newswire subtasks and H ↔ E, H ↔ J mixed domain subtasks. For the WAT2016, 15 institutions participated in the shared tasks. About 500 translation results have been submitted to the automatic evaluation server, and selected submissions were manually evaluated.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4601/
PDF https://www.aclweb.org/anthology/W16-4601
PWC https://paperswithcode.com/paper/overview-of-the-3rd-workshop-on-asian
Repo
Framework

Evaluating Argumentative and Narrative Essays using Graphs

Title Evaluating Argumentative and Narrative Essays using Graphs
Authors Swapna Somasundaran, Brian Riordan, Binod Gyawali, Su-Youn Yoon
Abstract This work investigates whether the development of ideas in writing can be captured by graph properties derived from the text. Focusing on student essays, we represent the essay as a graph, and encode a variety of graph properties including PageRank as features for modeling essay scores related to quality of development. We demonstrate that our approach improves on a state-of-the-art system on the task of holistic scoring of persuasive essays and on the task of scoring narrative essays along the development dimension.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1148/
PDF https://www.aclweb.org/anthology/C16-1148
PWC https://paperswithcode.com/paper/evaluating-argumentative-and-narrative-essays
Repo
Framework
comments powered by Disqus