May 5, 2019

1861 words 9 mins read

Paper Group NANR 78

Paper Group NANR 78

High-Rank Matrix Completion and Clustering under Self-Expressive Models. Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3). Forecasting Emerging Trends from Scientific Literature. If You Even Don’t Have a Bit of Bible: Learning Delexicalized POS Taggers. A Low-Rank Approximation Approach to Learning J …

High-Rank Matrix Completion and Clustering under Self-Expressive Models

Title High-Rank Matrix Completion and Clustering under Self-Expressive Models
Authors Ehsan Elhamifar
Abstract We propose efficient algorithms for simultaneous clustering and completion of incomplete high-dimensional data that lie in a union of low-dimensional subspaces. We cast the problem as finding a completion of the data matrix so that each point can be reconstructed as a linear or affine combination of a few data points. Since the problem is NP-hard, we propose a lifting framework and reformulate the problem as a group-sparse recovery of each incomplete data point in a dictionary built using incomplete data, subject to rank-one constraints. To solve the problem efficiently, we propose a rank pursuit algorithm and a convex relaxation. The solution of our algorithms recover missing entries and provides a similarity matrix for clustering. Our algorithms can deal with both low-rank and high-rank matrices, does not suffer from initialization, does not need to know dimensions of subspaces and can work with a small number of data points. By extensive experiments on synthetic data and real problems of video motion segmentation and completion of motion capture data, we show that when the data matrix is low-rank, our algorithm performs on par with or better than low-rank matrix completion methods, while for high-rank data matrices, our method significantly outperforms existing algorithms.
Tasks Low-Rank Matrix Completion, Matrix Completion, Motion Capture, Motion Segmentation
Published 2016-12-01
URL http://papers.nips.cc/paper/6357-high-rank-matrix-completion-and-clustering-under-self-expressive-models
PDF http://papers.nips.cc/paper/6357-high-rank-matrix-completion-and-clustering-under-self-expressive-models.pdf
PWC https://paperswithcode.com/paper/high-rank-matrix-completion-and-clustering
Repo
Framework

Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3)

Title Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3)
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4800/
PDF https://www.aclweb.org/anthology/W16-4800
PWC https://paperswithcode.com/paper/proceedings-of-the-third-workshop-on-nlp-for
Repo
Framework
Title Forecasting Emerging Trends from Scientific Literature
Authors Kartik Asooja, Georgeta Bordea, Gabriela Vulcu, Paul Buitelaar
Abstract Text analysis methods for the automatic identification of emerging technologies by analyzing the scientific publications, are gaining attention because of their socio-economic impact. The approaches so far have been mainly focused on retrospective analysis by mapping scientific topic evolution over time. We propose regression based approaches to predict future keyword distribution. The prediction is based on historical data of the keywords, which in our case, are LREC conference proceedings. Considering the insufficient number of data points available from LREC proceedings, we do not employ standard time series forecasting methods. We form a dataset by extracting the keywords from previous year proceedings and quantify their yearly relevance using tf-idf scores. This dataset additionally contains ranked lists of related keywords and experts for each keyword.
Tasks Time Series, Time Series Forecasting
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1066/
PDF https://www.aclweb.org/anthology/L16-1066
PWC https://paperswithcode.com/paper/forecasting-emerging-trends-from-scientific
Repo
Framework

If You Even Don’t Have a Bit of Bible: Learning Delexicalized POS Taggers

Title If You Even Don’t Have a Bit of Bible: Learning Delexicalized POS Taggers
Authors Zhiwei Yu, David Mare{\v{c}}ek, Zden{\v{e}}k {\v{Z}}abokrtsk{'y}, Daniel Zeman
Abstract Part-of-speech (POS) induction is one of the most popular tasks in research on unsupervised NLP. Various unsupervised and semi-supervised methods have been proposed to tag an unseen language. However, many of them require some partial understanding of the target language because they rely on dictionaries or parallel corpora such as the Bible. In this paper, we propose a different method named delexicalized tagging, for which we only need a raw corpus of the target language. We transfer tagging models trained on annotated corpora of one or more resource-rich languages. We employ language-independent features such as word length, frequency, neighborhood entropy, character classes (alphabetic vs. numeric vs. punctuation) etc. We demonstrate that such features can, to certain extent, serve as predictors of the part of speech, represented by the universal POS tag.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1015/
PDF https://www.aclweb.org/anthology/L16-1015
PWC https://paperswithcode.com/paper/if-you-even-dont-have-a-bit-of-bible-learning
Repo
Framework

A Low-Rank Approximation Approach to Learning Joint Embeddings of News Stories and Images for Timeline Summarization

Title A Low-Rank Approximation Approach to Learning Joint Embeddings of News Stories and Images for Timeline Summarization
Authors William Yang Wang, Yashar Mehdad, Dragomir R. Radev, Am Stent, a
Abstract
Tasks Feature Engineering, Recommendation Systems, Representation Learning, Timeline Summarization
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1008/
PDF https://www.aclweb.org/anthology/N16-1008
PWC https://paperswithcode.com/paper/a-low-rank-approximation-approach-to-learning
Repo
Framework

SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores

Title SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
Authors Nabin Maharjan, Rajendra Banjade, Nobal Bikram Niraula, Vasile Rus
Abstract This paper introduces a ruled-based method and software tool, called SemAligner, for aligning chunks across texts in a given pair of short English texts. The tool, based on the top performing method at the Interpretable Short Text Similarity shared task at SemEval 2015, where it was used with human annotated (gold) chunks, can now additionally process plain text-pairs using two powerful chunkers we developed, e.g. using Conditional Random Fields. Besides aligning chunks, the tool automatically assigns semantic relations to the aligned chunks (such as EQUI for equivalent and OPPO for opposite) and semantic similarity scores that measure the strength of the semantic relation between the aligned chunks. Experiments show that SemAligner performs competitively for system generated chunks and that these results are also comparable to results obtained on gold chunks. SemAligner has other capabilities such as handling various input formats and chunkers as well as extending lookup resources.
Tasks Semantic Similarity, Semantic Textual Similarity
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1192/
PDF https://www.aclweb.org/anthology/L16-1192
PWC https://paperswithcode.com/paper/semaligner-a-method-and-tool-for-aligning
Repo
Framework

Joining-in-type Humanoid Robot Assisted Language Learning System

Title Joining-in-type Humanoid Robot Assisted Language Learning System
Authors AlBara Khalifa, Tsuneo Kato, Seiichi Yamamoto
Abstract Dialogue robots are attractive to people, and in language learning systems, they motivate learners and let them practice conversational skills in more realistic environment. However, automatic speech recognition (ASR) of the second language (L2) learners is still a challenge, because their speech contains not just pronouncing, lexical, grammatical errors, but is sometimes totally disordered. Hence, we propose a novel robot assisted language learning (RALL) system using two robots, one as a teacher and the other as an advanced learner. The system is designed to simulate multiparty conversation, expecting implicit learning and enhancement of predictability of learners{'} utterance through an alignment similar to {``}interactive alignment{''}, which is observed in human-human conversation. We collected a database with the prototypes, and measured how much the alignment phenomenon observed in the database with initial analysis. |
Tasks Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1037/
PDF https://www.aclweb.org/anthology/L16-1037
PWC https://paperswithcode.com/paper/joining-in-type-humanoid-robot-assisted
Repo
Framework

Protein contact prediction from amino acid co-evolution using convolutional networks for graph-valued images

Title Protein contact prediction from amino acid co-evolution using convolutional networks for graph-valued images
Authors Vladimir Golkov, Marcin J. Skwark, Antonij Golkov, Alexey Dosovitskiy, Thomas Brox, Jens Meiler, Daniel Cremers
Abstract Proteins are the “building blocks of life”, the most abundant organic molecules, and the central focus of most areas of biomedicine. Protein structure is strongly related to protein function, thus structure prediction is a crucial task on the way to solve many biological questions. A contact map is a compact representation of the three-dimensional structure of a protein via the pairwise contacts between the amino acid constituting the protein. We use a convolutional network to calculate protein contact maps from inferred statistical coupling between positions in the protein sequence. The input to the network has an image-like structure amenable to convolutions, but every “pixel” instead of color channels contains a bipartite undirected edge-weighted graph. We propose several methods for treating such “graph-valued images” in a convolutional network. The proposed method outperforms state-of-the-art methods by a large margin. It also allows for a great flexibility with regard to the input data, which makes it useful for studying a wide range of problems.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6488-protein-contact-prediction-from-amino-acid-co-evolution-using-convolutional-networks-for-graph-valued-images
PDF http://papers.nips.cc/paper/6488-protein-contact-prediction-from-amino-acid-co-evolution-using-convolutional-networks-for-graph-valued-images.pdf
PWC https://paperswithcode.com/paper/protein-contact-prediction-from-amino-acid-co
Repo
Framework

Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation

Title Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation
Authors Domagoj Alagi{'c}, Jan {\v{S}}najder
Abstract We introduce Cro36WSD, a freely-available medium-sized lexical sample for Croatian word sense disambiguation (WSD).Cro36WSD comprises 36 words: 12 adjectives, 12 nouns, and 12 verbs, balanced across both frequency bands and polysemy levels. We adopt the multi-label annotation scheme in the hope of lessening the drawbacks of discrete sense inventories and obtaining more realistic annotations from human experts. Sense-annotated data is collected through multiple annotation rounds to ensure high-quality annotations: with a 115 person-hours effort we reached an inter-annotator agreement score of 0.877. We analyze the obtained data and perform a correlation analysis between several relevant variables, including word frequency, number of senses, sense distribution skewness, average annotation time, and the observed inter-annotator agreement (IAA). Using the obtained data, we compile multi- and single-labeled dataset variants using different label aggregation schemes. Finally, we evaluate three different baseline WSD models on both dataset variants and report on the insights gained. We make both dataset variants freely available.
Tasks Word Sense Disambiguation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1267/
PDF https://www.aclweb.org/anthology/L16-1267
PWC https://paperswithcode.com/paper/cro36wsd-a-lexical-sample-for-croatian-word
Repo
Framework

Verbal fields in Hungarian simple sentences and infinitival clausal complements

Title Verbal fields in Hungarian simple sentences and infinitival clausal complements
Authors Kata Balogh
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-3306/
PDF https://www.aclweb.org/anthology/W16-3306
PWC https://paperswithcode.com/paper/verbal-fields-in-hungarian-simple-sentences
Repo
Framework

Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles

Title Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles
Authors Alakan Vempala, a, Eduardo Blanco
Abstract This paper presents a two-step methodology to annotate spatial knowledge on top of OntoNotes semantic roles. First, we manipulate semantic roles to automatically generate potential additional spatial knowledge. Second, we crowdsource annotations with Amazon Mechanical Turk to either validate or discard the potential additional spatial knowledge. The resulting annotations indicate whether entities are or are not located somewhere with a degree of certainty, and temporally anchor this spatial information. Crowdsourcing experiments show that the additional spatial knowledge is ubiquitous and intuitive to humans, and experimental results show that it can be inferred automatically using standard supervised machine learning techniques.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1604/
PDF https://www.aclweb.org/anthology/L16-1604
PWC https://paperswithcode.com/paper/annotating-temporally-anchored-spatial-2
Repo
Framework

Decomposing Bilexical Dependencies into Semantic and Syntactic Vectors

Title Decomposing Bilexical Dependencies into Semantic and Syntactic Vectors
Authors Jeff Mitchell
Abstract
Tasks Chunking, Language Modelling, Named Entity Recognition, Part-Of-Speech Tagging, Representation Learning, Semantic Role Labeling
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1615/
PDF https://www.aclweb.org/anthology/W16-1615
PWC https://paperswithcode.com/paper/decomposing-bilexical-dependencies-into
Repo
Framework

Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter

Title Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter
Authors Qi Zhang, Yang Wang, Yeyun Gong, Xuanjing Huang
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1080/
PDF https://www.aclweb.org/anthology/D16-1080
PWC https://paperswithcode.com/paper/keyphrase-extraction-using-deep-recurrent
Repo
Framework

Generating Paraphrases from DBPedia using Deep Learning

Title Generating Paraphrases from DBPedia using Deep Learning
Authors Amin Sleimi, Claire Gardent
Abstract
Tasks Language Modelling, Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3511/
PDF https://www.aclweb.org/anthology/W16-3511
PWC https://paperswithcode.com/paper/generating-paraphrases-from-dbpedia-using
Repo
Framework

ReadME generation from an OWL ontology describing NLP tools

Title ReadME generation from an OWL ontology describing NLP tools
Authors Driss Sadoun, Satenik Mkhitaryan, Damien Nouvel, Mathieu Valette
Abstract
Tasks Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3509/
PDF https://www.aclweb.org/anthology/W16-3509
PWC https://paperswithcode.com/paper/readme-generation-from-an-owl-ontology
Repo
Framework
comments powered by Disqus