July 26, 2019

1890 words 9 mins read

Paper Group NANR 33

Control vs. Raising in English. A Dependency Grammar Account. Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding. Querying Multi-word Expressions Annotation with CQL. Towards Harnessing Memory Networks for Coreference Resolution. Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems …

Control vs. Raising in English. A Dependency Grammar Account


Title	Control vs. Raising in English. A Dependency Grammar Account
Authors	Timothy Osborne, Matthew Reeve
Abstract
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-6521/
PDF	https://www.aclweb.org/anthology/W17-6521
PWC	https://paperswithcode.com/paper/control-vs-raising-in-english-a-dependency
Repo
Framework

Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding


Title	Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding
Authors	Arya Mazumdar, Soumyabrata Pal
Abstract	Source coding is the canonical problem of data compression in information theory. In a locally encodable source coding, each compressed bit depends on only few bits of the input. In this paper, we show that a recently popular model of semisupervised clustering is equivalent to locally encodable source coding. In this model, the task is to perform multiclass labeling of unlabeled elements. At the beginning, we can ask in parallel a set of simple queries to an oracle who provides (possibly erroneous) binary answers to the queries. The queries cannot involve more than two (or a fixed constant number $\Delta$ of) elements. Now the labeling of all the elements (or clustering) must be performed based on the (noisy) query answers. The goal is to recover all the correct labelings while minimizing the number of such queries. The equivalence to locally encodable source codes leads us to find lower bounds on the number of queries required in variety of scenarios. We are also able to show fundamental limitations of pairwise `same cluster’ queries - and propose pairwise AND queries, that provably performs better in many situations. \|
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/7227-semisupervised-clustering-and-queries-and-locally-encodable-source-coding
PDF	http://papers.nips.cc/paper/7227-semisupervised-clustering-and-queries-and-locally-encodable-source-coding.pdf
PWC	https://paperswithcode.com/paper/semisupervised-clustering-and-queries-and
Repo
Framework

Querying Multi-word Expressions Annotation with CQL


Title	Querying Multi-word Expressions Annotation with CQL
Authors	Natalia Klyueva, Anna Vernerov{'a}, Behrang Qasemizadeh
Abstract
Tasks	Machine Translation
Published	2017-01-01
URL	https://www.aclweb.org/anthology/W17-7611/
PDF	https://www.aclweb.org/anthology/W17-7611
PWC	https://paperswithcode.com/paper/querying-multi-word-expressions-annotation
Repo
Framework

Towards Harnessing Memory Networks for Coreference Resolution


Title	Towards Harnessing Memory Networks for Coreference Resolution
Authors	Joe Cheri, Pushpak Bhattacharyya
Abstract	Coreference resolution task demands comprehending a discourse, especially for anaphoric mentions which require semantic information for resolving antecedents. We investigate into how memory networks can be helpful for coreference resolution when posed as question answering problem. The comprehension capability of memory networks assists coreference resolution, particularly for the mentions that require semantic and context information. We experiment memory networks for coreference resolution, with 4 synthetic datasets generated for coreference resolution with varying difficulty levels. Our system{'}s performance is compared with a traditional coreference resolution system to show why memory network can be promising for coreference resolution.
Tasks	Coreference Resolution, Question Answering, Representation Learning
Published	2017-08-01
URL	https://www.aclweb.org/anthology/W17-2605/
PDF	https://www.aclweb.org/anthology/W17-2605
PWC	https://paperswithcode.com/paper/towards-harnessing-memory-networks-for
Repo
Framework

Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems


Title	Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems
Authors	Wang Ling, Dani Yogatama, Chris Dyer, Phil Blunsom
Abstract	Solving algebraic word problems requires executing a series of arithmetic operations{—}a program{—}to obtain a final answer. However, since programs can be arbitrarily complicated, inducing them directly from question-answer pairs is a formidable challenge. To make this task more feasible, we solve these problems by generating answer rationales, sequences of natural language and human-readable mathematical expressions that derive the final answer through a series of small steps. Although rationales do not explicitly specify programs, they provide a scaffolding for their structure via intermediate milestones. To evaluate our approach, we have created a new 100,000-sample dataset of questions, answers and rationales. Experimental results show that indirect supervision of program learning via answer rationales is a promising strategy for inducing arithmetic programs.
Tasks	Decision Making
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1015/
PDF	https://www.aclweb.org/anthology/P17-1015
PWC	https://paperswithcode.com/paper/program-induction-by-rationale-generation-1
Repo
Framework

Vowel and Consonant Classification through Spectral Decomposition


Title	Vowel and Consonant Classification through Spectral Decomposition
Authors	Patricia Thaine, Gerald Penn
Abstract	We consider two related problems in this paper. Given an undeciphered alphabetic writing system or mono-alphabetic cipher, determine: (1) which of its letters are vowels and which are consonants; and (2) whether the writing system is a vocalic alphabet or an abjad. We are able to show that a very simple spectral decomposition based on character co-occurrences provides nearly perfect performance with respect to answering both question types.
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-4112/
PDF	https://www.aclweb.org/anthology/W17-4112
PWC	https://paperswithcode.com/paper/vowel-and-consonant-classification-through
Repo
Framework


Title	All that is English may be Hindi: Enhancing language identification through automatic ranking of the likeliness of word borrowing in social media
Authors	Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee
Abstract	n this paper, we present a set of computational methods to identify the likeliness of a word being borrowed, based on the signals from social media. In terms of Spearman{'}s correlation values, our methods perform more than two times better (∼ 0.62) in predicting the borrowing likeliness compared to the best performing baseline (∼ 0.26) reported in literature. Based on this likeliness estimate we asked annotators to re-annotate the language tags of foreign words in predominantly native contexts. In 88{%} of cases the annotators felt that the foreign language tag should be replaced by native language tag, thus indicating a huge scope for improvement of automatic language identification systems.
Tasks	Language Identification
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1240/
PDF	https://www.aclweb.org/anthology/D17-1240
PWC	https://paperswithcode.com/paper/all-that-is-english-may-be-hindi-enhancing-1
Repo
Framework

Mama Edha at SemEval-2017 Task 8: Stance Classification with CNN and Rules


Title	Mama Edha at SemEval-2017 Task 8: Stance Classification with CNN and Rules
Authors	Marianela Garc{'\i}a Lozano, Hanna Lilja, Edward Tj{"o}rnhammar, Maja Karasalo
Abstract	For the competition SemEval-2017 we investigated the possibility of performing stance classification (support, deny, query or comment) for messages in Twitter conversation threads related to rumours. Stance classification is interesting since it can provide a basis for rumour veracity assessment. Our ensemble classification approach of combining convolutional neural networks with both automatic rule mining and manually written rules achieved a final accuracy of 74.9{%} on the competition{'}s test data set for Task 8A. To improve classification we also experimented with data relabeling and using the grammatical structure of the tweet contents for classification.
Tasks	Rumour Detection
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2084/
PDF	https://www.aclweb.org/anthology/S17-2084
PWC	https://paperswithcode.com/paper/mama-edha-at-semeval-2017-task-8-stance
Repo
Framework

Word Representations in Factored Neural Machine Translation


Title	Word Representations in Factored Neural Machine Translation
Authors	Franck Burlot, Mercedes Garc{'\i}a-Mart{'\i}nez, Lo{"\i}c Barrault, Fethi Bougares, Fran{\c{c}}ois Yvon
Abstract
Tasks	Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-4703/
PDF	https://www.aclweb.org/anthology/W17-4703
PWC	https://paperswithcode.com/paper/word-representations-in-factored-neural
Repo
Framework

SemEval-2017 Task 11: End-User Development using Natural Language


Title	SemEval-2017 Task 11: End-User Development using Natural Language
Authors	Juliano Sales, H, Siegfried schuh, Andr{'e} Freitas
Abstract	This task proposes a challenge to support the interaction between users and applications, micro-services and software APIs using natural language. The task aims for supporting the evaluation and evolution of the discussions surrounding the natural language processing approaches within the context of end-user natural language programming, under scenarios of high semantic heterogeneity/gap.
Tasks	Semantic Parsing
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2092/
PDF	https://www.aclweb.org/anthology/S17-2092
PWC	https://paperswithcode.com/paper/semeval-2017-task-11-end-user-development
Repo
Framework

An Eye-tracking Study of Named Entity Annotation


Title	An Eye-tracking Study of Named Entity Annotation
Authors	Takenobu Tokunaga, Hitoshi Nishikawa, Tomoya Iwakura
Abstract	Utilising effective features in machine learning-based natural language processing (NLP) is crucial in achieving good performance for a given NLP task. The paper describes a pilot study on the analysis of eye-tracking data during named entity (NE) annotation, aiming at obtaining insights into effective features for the NE recognition task. The eye gaze data were collected from 10 annotators and analysed regarding working time and fixation distribution. The results of the preliminary qualitative analysis showed that human annotators tend to look at broader contexts around the target NE than recent state-of-the-art automatic NE recognition systems and to use predicate argument relations to identify the NE categories.
Tasks	Active Learning, Coreference Resolution, Eye Tracking, Named Entity Recognition, Sentence Compression, Sentiment Analysis, Word Sense Disambiguation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1097/
PDF	https://doi.org/10.26615/978-954-452-049-6_097
PWC	https://paperswithcode.com/paper/an-eye-tracking-study-of-named-entity
Repo
Framework

The representation and extraction of qunatitative information


Title	The representation and extraction of qunatitative information
Authors	Tianyong Hao, Yunyan We, Jiaqi Qiang, Haitao Wang, Kiyong Lee
Abstract
Tasks	Information Retrieval, Question Answering
Published	2017-01-01
URL	https://www.aclweb.org/anthology/W17-7408/
PDF	https://www.aclweb.org/anthology/W17-7408
PWC	https://paperswithcode.com/paper/the-representation-and-extraction-of
Repo
Framework

Affine-Invariant Online Optimization and the Low-rank Experts Problem


Title	Affine-Invariant Online Optimization and the Low-rank Experts Problem
Authors	Tomer Koren, Roi Livni
Abstract	We present a new affine-invariant optimization algorithm called Online Lazy Newton. The regret of Online Lazy Newton is independent of conditioning: the algorithm’s performance depends on the best possible preconditioning of the problem in retrospect and on its \emph{intrinsic} dimensionality. As an application, we show how Online Lazy Newton can be used to achieve an optimal regret of order $\sqrt{rT}$ for the low-rank experts problem, improving by a $\sqrt{r}$ factor over the previously best known bound and resolving an open problem posed by Hazan et al (2016).
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/7060-affine-invariant-online-optimization-and-the-low-rank-experts-problem
PDF	http://papers.nips.cc/paper/7060-affine-invariant-online-optimization-and-the-low-rank-experts-problem.pdf
PWC	https://paperswithcode.com/paper/affine-invariant-online-optimization-and-the
Repo
Framework

Train-O-Matic: Large-Scale Supervised Word Sense Disambiguation in Multiple Languages without Manual Training Data


Title	Train-O-Matic: Large-Scale Supervised Word Sense Disambiguation in Multiple Languages without Manual Training Data
Authors	Tommaso Pasini, Roberto Navigli
Abstract	Annotating large numbers of sentences with senses is the heaviest requirement of current Word Sense Disambiguation. We present Train-O-Matic, a language-independent method for generating millions of sense-annotated training instances for virtually all meanings of words in a language{'}s vocabulary. The approach is fully automatic: no human intervention is required and the only type of human knowledge used is a WordNet-like resource. Train-O-Matic achieves consistently state-of-the-art performance across gold standard datasets and languages, while at the same time removing the burden of manual annotation. All the training data is available for research purposes at \url{http://trainomatic.org}.
Tasks	Word Sense Disambiguation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1008/
PDF	https://www.aclweb.org/anthology/D17-1008
PWC	https://paperswithcode.com/paper/train-o-matic-large-scale-supervised-word
Repo
Framework

Language Technologies in Teaching Bugarian at Primary and Secondary School Level: the NBU Platform of Language Teaching (PLT)


Title	Language Technologies in Teaching Bugarian at Primary and Secondary School Level: the NBU Platform of Language Teaching (PLT)
Authors	Maria Stambolieva, Valentina Ivanova, Mariana Raykova, Milka Hadjikoteva, Mariya Neykova
Abstract	The NBU Language Teaching Platform (PLT) was initially designed for teaching foreign languages for specific purposes; at a second stage, some of its functionalities were extended to answer the needs of teaching general foreign language. New functionalities have now been created for the purpose of providing e-support for Bulgarian language and literature teaching at primary and secondary school level. The article presents the general structure of the platform and the functionalities specifically developed to match the standards and expected results set by the Ministry of Education. The E-platform integrates: 1/ an environment for creating, organizing and maintaining electronic text archives, for extracting text corpora and aligning corpora; 2/ a linguistic database; 3/ a concordancer; 4/ a set of modules for the generation and editing of practice exercises for each text or corpus; 5/ functionalities for export from the platform and import to other educational platforms. For Moodle, modules were created for test generation, performance assessment and feedback. The PLT allows centralized presentation of abundant teaching content, control of the educational process, fast and reliable feedback on performance.
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-8105/
PDF	http://doi.org/10.26615/978-954-452-046-5_005
PWC	https://paperswithcode.com/paper/language-technologies-in-teaching-bugarian-at
Repo
Framework