May 4, 2019

1416 words 7 mins read

Paper Group NANR 177

Paper Group NANR 177

plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource. MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries. UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement. Empirical Determination of Basic Heuristics for Narrative Content Planning. Proce …

plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource

Title plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource
Authors Marek Maziarz, Maciej Piasecki, Ewa Rudnicka, Stan Szpakowicz, Pawe{\l} K{\k{e}}dzia
Abstract We have released plWordNet 3.0, a very large wordnet for Polish. In addition to what is expected in wordnets {–} richly interrelated synsets {–} it contains sentiment and emotion annotations, a large set of multi-word expressions, and a mapping onto WordNet 3.1. Part of the release is enWordNet 1.0, a substantially enlarged copy of WordNet 3.1, with material added to allow for a more complete mapping. The paper discusses the design principles of plWordNet, its content, its statistical portrait, a comparison with similar resources, and a partial list of applications.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1213/
PDF https://www.aclweb.org/anthology/C16-1213
PWC https://paperswithcode.com/paper/plwordnet-30-a-a-comprehensive-lexical
Repo
Framework

MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries

Title MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries
Authors Josef Steinberger
Abstract
Tasks Sentiment Analysis
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4025/
PDF https://www.aclweb.org/anthology/P16-4025
PWC https://paperswithcode.com/paper/mediagist-a-cross-lingual-analyser-of
Repo
Framework

UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement

Title UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement
Authors Hua He, John Wieting, Kevin Gimpel, Jinfeng Rao, Jimmy Lin
Abstract
Tasks Feature Engineering, Question Answering, Semantic Textual Similarity, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1170/
PDF https://www.aclweb.org/anthology/S16-1170
PWC https://paperswithcode.com/paper/umd-ttic-uw-at-semeval-2016-task-1-attention
Repo
Framework

Empirical Determination of Basic Heuristics for Narrative Content Planning

Title Empirical Determination of Basic Heuristics for Narrative Content Planning
Authors Pablo Gerv{'a}s
Abstract
Tasks Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-5503/
PDF https://www.aclweb.org/anthology/W16-5503
PWC https://paperswithcode.com/paper/empirical-determination-of-basic-heuristics
Repo
Framework

Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016)

Title Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016)
Authors
Abstract
Tasks Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3500/
PDF https://www.aclweb.org/anthology/W16-3500
PWC https://paperswithcode.com/paper/proceedings-of-the-2nd-international-workshop
Repo
Framework

Learning User Perceived Clusters with Feature-Level Supervision

Title Learning User Perceived Clusters with Feature-Level Supervision
Authors Ting-Yu Cheng, Guiguan Lin, Xinyang Gong, Kang-Jun Liu, Shan-Hung Wu
Abstract Semi-supervised clustering algorithms have been proposed to identify data clusters that align with user perceived ones via the aid of side information such as seeds or pairwise constrains. However, traditional side information is mostly at the instance level and subject to the sampling bias, where non-randomly sampled instances in the supervision can mislead the algorithms to wrong clusters. In this paper, we propose learning from the feature-level supervision. We show that this kind of supervision can be easily obtained in the form of perception vectors in many applications. Then we present novel algorithms, called Perception Embedded (PE) clustering, that exploit the perception vectors as well as traditional side information to find clusters perceived by the user. Extensive experiments are conducted on real datasets and the results demonstrate the effectiveness of PE empirically.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6260-learning-user-perceived-clusters-with-feature-level-supervision
PDF http://papers.nips.cc/paper/6260-learning-user-perceived-clusters-with-feature-level-supervision.pdf
PWC https://paperswithcode.com/paper/learning-user-perceived-clusters-with-feature
Repo
Framework

Convex Two-Layer Modeling with Latent Structure

Title Convex Two-Layer Modeling with Latent Structure
Authors Vignesh Ganapathiraman, Xinhua Zhang, Yaoliang Yu, Junfeng Wen
Abstract Unsupervised learning of structured predictors has been a long standing pursuit in machine learning. Recently a conditional random field auto-encoder has been proposed in a two-layer setting, allowing latent structured representation to be automatically inferred. Aside from being nonconvex, it also requires the demanding inference of normalization. In this paper, we develop a convex relaxation of two-layer conditional model which captures latent structure and estimates model parameters, jointly and optimally. We further expand its applicability by resorting to a weaker form of inference—maximum a-posteriori. The flexibility of the model is demonstrated on two structures based on total unimodularity—graph matching and linear chain. Experimental results confirm the promise of the method.
Tasks Graph Matching
Published 2016-12-01
URL http://papers.nips.cc/paper/6314-convex-two-layer-modeling-with-latent-structure
PDF http://papers.nips.cc/paper/6314-convex-two-layer-modeling-with-latent-structure.pdf
PWC https://paperswithcode.com/paper/convex-two-layer-modeling-with-latent
Repo
Framework

A Neural Network Architecture for Multilingual Punctuation Generation

Title A Neural Network Architecture for Multilingual Punctuation Generation
Authors Miguel Ballesteros, Leo Wanner
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1111/
PDF https://www.aclweb.org/anthology/D16-1111
PWC https://paperswithcode.com/paper/a-neural-network-architecture-for
Repo
Framework

The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource

Title The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Authors Andrej {\v{Z}}gank, Mirjam Sepesy Mau{\v{c}}ec, Darinka Verdonik
Abstract This paper presents a new Slovenian spoken language resource built from TEDx Talks. The speech database contains 242 talks in total duration of 54 hours. The annotation and transcription of acquired spoken material was generated automatically, applying acoustic segmentation and automatic speech recognition. The development and evaluation subset was also manually transcribed using the guidelines specified for the Slovenian GOS corpus. The manual transcriptions were used to evaluate the quality of unsupervised transcriptions. The average word error rate for the SI TEDx-UM evaluation subset was 50.7{%}, with out of vocabulary rate of 24{%} and language model perplexity of 390. The unsupervised transcriptions contain 372k tokens, where 32k of them were different.
Tasks Language Modelling, Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1740/
PDF https://www.aclweb.org/anthology/L16-1740
PWC https://paperswithcode.com/paper/the-si-tedx-um-speech-database-a-new
Repo
Framework

Generation from Abstract Meaning Representation using Tree Transducers

Title Generation from Abstract Meaning Representation using Tree Transducers
Authors Jeffrey Flanigan, Chris Dyer, Noah A. Smith, Jaime Carbonell
Abstract
Tasks Language Modelling, Machine Translation, Text Generation
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1087/
PDF https://www.aclweb.org/anthology/N16-1087
PWC https://paperswithcode.com/paper/generation-from-abstract-meaning
Repo
Framework

Process Based Evaluation of Computer Generated Poetry

Title Process Based Evaluation of Computer Generated Poetry
Authors Stephen McGregor, Matthew Purver, Geraint Wiggins
Abstract
Tasks Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-5508/
PDF https://www.aclweb.org/anthology/W16-5508
PWC https://paperswithcode.com/paper/process-based-evaluation-of-computer
Repo
Framework

Identifying Teacher Questions Using Automatic Speech Recognition in Classrooms

Title Identifying Teacher Questions Using Automatic Speech Recognition in Classrooms
Authors Nathaniel Blanchard, Patrick Donnelly, Andrew M. Olney, Samei Borhan, Brooke Ward, Xiaoyi Sun, Sean Kelly, Martin Nystrand, Sidney K. D’Mello
Abstract
Tasks Speech Recognition
Published 2016-09-01
URL https://www.aclweb.org/anthology/papers/W16-3623/w16-3623
PDF https://www.aclweb.org/anthology/W16-3623
PWC https://paperswithcode.com/paper/identifying-teacher-questions-using-automatic
Repo
Framework

International Journal of Computational Linguistics & Chinese Language Processing, Volume 21, Number 2, December 2016

Title International Journal of Computational Linguistics & Chinese Language Processing, Volume 21, Number 2, December 2016
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/O16-3000/
PDF https://www.aclweb.org/anthology/O16-3000
PWC https://paperswithcode.com/paper/international-journal-of-computational-3
Repo
Framework

Syllable based DNN-HMM Cantonese Speech to Text System

Title Syllable based DNN-HMM Cantonese Speech to Text System
Authors Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T.Y. Ng
Abstract This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventional Initial-Final (IF) syllables, or the Onset-Nucleus-Coda (ONC) syllables where finals are further split into nucleus and coda to reflect the intra-syllable variations in Cantonese. By using the Kaldi toolkit, our system is trained using the stochastic gradient descent optimization model with the aid of GPUs for the hybrid Deep Neural Network and Hidden Markov Model (DNN-HMM) with and without I-vector based speaker adaptive training technique. The input features of the same Gaussian Mixture Model with speaker adaptive training (GMM-SAT) to DNN are used in all cases. Experiments show that the ONC-based syllable acoustic modeling with I-vector based DNN-HMM achieves the best performance with the word error rate (WER) of 9.66{%} and the real time factor (RTF) of 1.38812.
Tasks Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1610/
PDF https://www.aclweb.org/anthology/L16-1610
PWC https://paperswithcode.com/paper/syllable-based-dnn-hmm-cantonese-speech-to
Repo
Framework

TextPro-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks

Title TextPro-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks
Authors Bernardo Magnini, Anne-Lyse Minard, Mohammed R. H. Qwaider, Manuela Speranza
Abstract This paper presents TextPro-AL (Active Learning for Text Processing), a platform where human annotators can efficiently work to produce high quality training data for new domains and new languages exploiting Active Learning methodologies. TextPro-AL is a web-based application integrating four components: a machine learning based NLP pipeline, an annotation editor for task definition and text annotations, an incremental re-training procedure based on active learning selection from a large pool of unannotated data, and a graphical visualization of the learning status of the system.
Tasks Active Learning, Domain Adaptation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2028/
PDF https://www.aclweb.org/anthology/C16-2028
PWC https://paperswithcode.com/paper/textpro-al-an-active-learning-platform-for
Repo
Framework
comments powered by Disqus