May 4, 2019

999 words 5 mins read

Paper Group NANR 230

Paper Group NANR 230

Together we stand: Siamese Networks for Similar Question Retrieval. Selection Criteria for Low Resource Language Programs. Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages. Extra-Specific Multiword Expressions for Language-Endowed Intelligent Agents. Opinion Holder and Target Extraction on Opinio …

Together we stand: Siamese Networks for Similar Question Retrieval

Title Together we stand: Siamese Networks for Similar Question Retrieval
Authors Arpita Das, Harish Yenala, Manoj Chinnakotla, Manish Shrivastava
Abstract
Tasks Community Question Answering, Question Answering, Semantic Textual Similarity, Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1036/
PDF https://www.aclweb.org/anthology/P16-1036
PWC https://paperswithcode.com/paper/together-we-stand-siamese-networks-for
Repo
Framework

Selection Criteria for Low Resource Language Programs

Title Selection Criteria for Low Resource Language Programs
Authors Christopher Cieri, Mike Maxwell, Stephanie Strassel, Jennifer Tracey
Abstract This paper documents and describes the criteria used to select languages for study within programs that include low resource languages whether given that label or another similar one. It focuses on five US common task, Human Language Technology research and development programs in which the authors have provided information or consulting related to the choice of language. The paper does not describe the actual selection process which is the responsibility of program management and highly specific to a program{'}s individual goals and context. Instead it concentrates on the data and criteria that have been considered relevant previously with the thought that future program managers and their consultants may adapt these and apply them with different prioritization to future programs.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1720/
PDF https://www.aclweb.org/anthology/L16-1720
PWC https://paperswithcode.com/paper/selection-criteria-for-low-resource-language
Repo
Framework

Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages

Title Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages
Authors John Sylak-Glassman, Christo Kirov, David Yarowsky
Abstract Structured, complete inflectional paradigm data exists for very few of the world{'}s languages, but is crucial to training morphological analysis tools. We present methods inspired by linguistic fieldwork for gathering inflectional paradigm data in a machine-readable, interoperable format from remotely-located speakers of any language. Informants are tasked with completing language-specific paradigm elicitation templates. Templates are constructed by linguists using grammatical reference materials to ensure completeness. Each cell in a template is associated with contextual prompts designed to help informants with varying levels of linguistic expertise (from professional translators to untrained native speakers) provide the desired inflected form. To facilitate downstream use in interoperable NLP/HLT applications, each cell is also associated with a language-independent machine-readable set of morphological tags from the UniMorph Schema. This data is useful for seeding morphological analysis and generation software, particularly when the data is representative of the range of surface morphological variation in the language. At present, we have obtained 792 lemmas and 25,056 inflected forms from 15 languages.
Tasks Morphological Analysis
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1497/
PDF https://www.aclweb.org/anthology/L16-1497
PWC https://paperswithcode.com/paper/remote-elicitation-of-inflectional-paradigms
Repo
Framework

Extra-Specific Multiword Expressions for Language-Endowed Intelligent Agents

Title Extra-Specific Multiword Expressions for Language-Endowed Intelligent Agents
Authors Marjorie McShane, Sergei Nirenburg
Abstract Language-endowed intelligent agents benefit from leveraging lexical knowledge falling at different points along a spectrum of compositionality. This means that robust computational lexicons should include not only the compositional expectations of argument-taking words, but also non-compositional collocations (idioms), semi-compositional collocations that might be difficult for an agent to interpret (e.g., standard metaphors), and even collocations that could be compositionally analyzed but are so frequently encountered that recording their meaning increases the efficiency of interpretation. In this paper we argue that yet another type of string-to-meaning mapping can also be useful to intelligent agents: remembered semantic analyses of actual text inputs. These can be viewed as super-specific multi-word expressions whose recorded interpretations mimic a person{'}s memories of knowledge previously learned from language input. These differ from typical annotated corpora in two ways. First, they provide a full, context-sensitive semantic interpretation rather than select features. Second, they are are formulated in the ontologically-grounded metalanguage used in a particular agent environment, meaning that the interpretations contribute to the dynamically evolving cognitive capabilites of agents configured in that environment.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3805/
PDF https://www.aclweb.org/anthology/W16-3805
PWC https://paperswithcode.com/paper/extra-specific-multiword-expressions-for
Repo
Framework

Opinion Holder and Target Extraction on Opinion Compounds – A Linguistic Approach

Title Opinion Holder and Target Extraction on Opinion Compounds – A Linguistic Approach
Authors Michael Wiegand, Christine Bocionek, Josef Ruppenhofer
Abstract
Tasks Question Answering, Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/papers/N16-1094/n16-1094
PDF https://www.aclweb.org/anthology/N16-1094
PWC https://paperswithcode.com/paper/opinion-holder-and-target-extraction-on
Repo
Framework

Phrase Representations for Multiword Expressions

Title Phrase Representations for Multiword Expressions
Authors Joėl Legrand, Ronan Collobert
Abstract
Tasks Chunking, Named Entity Recognition, Part-Of-Speech Tagging, Semantic Role Labeling, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/papers/W16-1810/w16-1810
PDF https://www.aclweb.org/anthology/W16-1810
PWC https://paperswithcode.com/paper/phrase-representations-for-multiword
Repo
Framework

Situation entity types: automatic classification of clause-level aspect

Title Situation entity types: automatic classification of clause-level aspect
Authors Annemarie Friedrich, Alexis Palmer, Manfred Pinkal
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1166/
PDF https://www.aclweb.org/anthology/P16-1166
PWC https://paperswithcode.com/paper/situation-entity-types-automatic
Repo
Framework

Bi-LSTM Neural Networks for Chinese Grammatical Error Diagnosis

Title Bi-LSTM Neural Networks for Chinese Grammatical Error Diagnosis
Authors Shen Huang, Houfeng Wang
Abstract Grammatical Error Diagnosis for Chinese has always been a challenge for both foreign learners and NLP researchers, for the variousity of grammar and the flexibility of expression. In this paper, we present a model based on Bidirectional Long Short-Term Memory(Bi-LSTM) neural networks, which treats the task as a sequence labeling problem, so as to detect Chinese grammatical errors, to identify the error types and to locate the error positions. In the corpora of this year{'}s shared task, there can be multiple errors in a single offset of a sentence, to address which, we simutaneously train three Bi-LSTM models sharing word embeddings which label Missing, Redundant and Selection errors respectively. We regard word ordering error as a special kind of word selection error which is longer during training phase, and then separate them by length during testing phase. In NLP-TEA 3 shared task for Chinese Grammatical Error Diagnosis(CGED), Our system achieved relatively high F1 for all the three levels in the traditional Chinese track and for the detection level in the Simpified Chinese track.
Tasks Grammatical Error Detection, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4919/
PDF https://www.aclweb.org/anthology/W16-4919
PWC https://paperswithcode.com/paper/bi-lstm-neural-networks-for-chinese
Repo
Framework
comments powered by Disqus