May 4, 2019

1416 words 7 mins read

Paper Group NANR 177

plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource. MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries. UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement. Empirical Determination of Basic Heuristics for Narrative Content Planning. Proce …

plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource


Title	plWordNet 3.0 – a Comprehensive Lexical-Semantic Resource
Authors	Marek Maziarz, Maciej Piasecki, Ewa Rudnicka, Stan Szpakowicz, Pawe{\l} K{\k{e}}dzia
Abstract	We have released plWordNet 3.0, a very large wordnet for Polish. In addition to what is expected in wordnets {–} richly interrelated synsets {–} it contains sentiment and emotion annotations, a large set of multi-word expressions, and a mapping onto WordNet 3.1. Part of the release is enWordNet 1.0, a substantially enlarged copy of WordNet 3.1, with material added to allow for a more complete mapping. The paper discusses the design principles of plWordNet, its content, its statistical portrait, a comparison with similar resources, and a partial list of applications.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1213/
PDF	https://www.aclweb.org/anthology/C16-1213
PWC	https://paperswithcode.com/paper/plwordnet-30-a-a-comprehensive-lexical
Repo
Framework

MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries


Title	MediaGist: A Cross-lingual Analyser of Aggregated News and Commentaries
Authors	Josef Steinberger
Abstract
Tasks	Sentiment Analysis
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-4025/
PDF	https://www.aclweb.org/anthology/P16-4025
PWC	https://paperswithcode.com/paper/mediagist-a-cross-lingual-analyser-of
Repo
Framework

UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement


Title	UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement
Authors	Hua He, John Wieting, Kevin Gimpel, Jinfeng Rao, Jimmy Lin
Abstract
Tasks	Feature Engineering, Question Answering, Semantic Textual Similarity, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1170/
PDF	https://www.aclweb.org/anthology/S16-1170
PWC	https://paperswithcode.com/paper/umd-ttic-uw-at-semeval-2016-task-1-attention
Repo
Framework

Empirical Determination of Basic Heuristics for Narrative Content Planning


Title	Empirical Determination of Basic Heuristics for Narrative Content Planning
Authors	Pablo Gerv{'a}s
Abstract
Tasks	Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-5503/
PDF	https://www.aclweb.org/anthology/W16-5503
PWC	https://paperswithcode.com/paper/empirical-determination-of-basic-heuristics
Repo
Framework

Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016)


Title	Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016)
Authors
Abstract
Tasks	Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3500/
PDF	https://www.aclweb.org/anthology/W16-3500
PWC	https://paperswithcode.com/paper/proceedings-of-the-2nd-international-workshop
Repo
Framework

Learning User Perceived Clusters with Feature-Level Supervision


Title	Learning User Perceived Clusters with Feature-Level Supervision
Authors	Ting-Yu Cheng, Guiguan Lin, Xinyang Gong, Kang-Jun Liu, Shan-Hung Wu
Abstract	Semi-supervised clustering algorithms have been proposed to identify data clusters that align with user perceived ones via the aid of side information such as seeds or pairwise constrains. However, traditional side information is mostly at the instance level and subject to the sampling bias, where non-randomly sampled instances in the supervision can mislead the algorithms to wrong clusters. In this paper, we propose learning from the feature-level supervision. We show that this kind of supervision can be easily obtained in the form of perception vectors in many applications. Then we present novel algorithms, called Perception Embedded (PE) clustering, that exploit the perception vectors as well as traditional side information to find clusters perceived by the user. Extensive experiments are conducted on real datasets and the results demonstrate the effectiveness of PE empirically.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6260-learning-user-perceived-clusters-with-feature-level-supervision
PDF	http://papers.nips.cc/paper/6260-learning-user-perceived-clusters-with-feature-level-supervision.pdf
PWC	https://paperswithcode.com/paper/learning-user-perceived-clusters-with-feature
Repo
Framework

Convex Two-Layer Modeling with Latent Structure


Title	Convex Two-Layer Modeling with Latent Structure
Authors	Vignesh Ganapathiraman, Xinhua Zhang, Yaoliang Yu, Junfeng Wen
Abstract	Unsupervised learning of structured predictors has been a long standing pursuit in machine learning. Recently a conditional random field auto-encoder has been proposed in a two-layer setting, allowing latent structured representation to be automatically inferred. Aside from being nonconvex, it also requires the demanding inference of normalization. In this paper, we develop a convex relaxation of two-layer conditional model which captures latent structure and estimates model parameters, jointly and optimally. We further expand its applicability by resorting to a weaker form of inference—maximum a-posteriori. The flexibility of the model is demonstrated on two structures based on total unimodularity—graph matching and linear chain. Experimental results confirm the promise of the method.
Tasks	Graph Matching
Published	2016-12-01
URL	http://papers.nips.cc/paper/6314-convex-two-layer-modeling-with-latent-structure
PDF	http://papers.nips.cc/paper/6314-convex-two-layer-modeling-with-latent-structure.pdf
PWC	https://paperswithcode.com/paper/convex-two-layer-modeling-with-latent
Repo
Framework

A Neural Network Architecture for Multilingual Punctuation Generation


Title	A Neural Network Architecture for Multilingual Punctuation Generation
Authors	Miguel Ballesteros, Leo Wanner
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1111/
PDF	https://www.aclweb.org/anthology/D16-1111
PWC	https://paperswithcode.com/paper/a-neural-network-architecture-for
Repo
Framework

The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource


Title	The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Authors	Andrej {\v{Z}}gank, Mirjam Sepesy Mau{\v{c}}ec, Darinka Verdonik
Abstract	This paper presents a new Slovenian spoken language resource built from TEDx Talks. The speech database contains 242 talks in total duration of 54 hours. The annotation and transcription of acquired spoken material was generated automatically, applying acoustic segmentation and automatic speech recognition. The development and evaluation subset was also manually transcribed using the guidelines specified for the Slovenian GOS corpus. The manual transcriptions were used to evaluate the quality of unsupervised transcriptions. The average word error rate for the SI TEDx-UM evaluation subset was 50.7{%}, with out of vocabulary rate of 24{%} and language model perplexity of 390. The unsupervised transcriptions contain 372k tokens, where 32k of them were different.
Tasks	Language Modelling, Speech Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1740/
PDF	https://www.aclweb.org/anthology/L16-1740
PWC	https://paperswithcode.com/paper/the-si-tedx-um-speech-database-a-new
Repo
Framework

Generation from Abstract Meaning Representation using Tree Transducers


Title	Generation from Abstract Meaning Representation using Tree Transducers
Authors	Jeffrey Flanigan, Chris Dyer, Noah A. Smith, Jaime Carbonell
Abstract
Tasks	Language Modelling, Machine Translation, Text Generation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1087/
PDF	https://www.aclweb.org/anthology/N16-1087
PWC	https://paperswithcode.com/paper/generation-from-abstract-meaning
Repo
Framework

Process Based Evaluation of Computer Generated Poetry


Title	Process Based Evaluation of Computer Generated Poetry
Authors	Stephen McGregor, Matthew Purver, Geraint Wiggins
Abstract
Tasks	Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-5508/
PDF	https://www.aclweb.org/anthology/W16-5508
PWC	https://paperswithcode.com/paper/process-based-evaluation-of-computer
Repo
Framework

Identifying Teacher Questions Using Automatic Speech Recognition in Classrooms


Title	Identifying Teacher Questions Using Automatic Speech Recognition in Classrooms
Authors	Nathaniel Blanchard, Patrick Donnelly, Andrew M. Olney, Samei Borhan, Brooke Ward, Xiaoyi Sun, Sean Kelly, Martin Nystrand, Sidney K. D’Mello
Abstract
Tasks	Speech Recognition
Published	2016-09-01
URL	https://www.aclweb.org/anthology/papers/W16-3623/w16-3623
PDF	https://www.aclweb.org/anthology/W16-3623
PWC	https://paperswithcode.com/paper/identifying-teacher-questions-using-automatic
Repo
Framework

International Journal of Computational Linguistics & Chinese Language Processing, Volume 21, Number 2, December 2016


Title	International Journal of Computational Linguistics & Chinese Language Processing, Volume 21, Number 2, December 2016
Authors
Abstract
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/O16-3000/
PDF	https://www.aclweb.org/anthology/O16-3000
PWC	https://paperswithcode.com/paper/international-journal-of-computational-3
Repo
Framework

Syllable based DNN-HMM Cantonese Speech to Text System


Title	Syllable based DNN-HMM Cantonese Speech to Text System
Authors	Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T.Y. Ng
Abstract	This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventional Initial-Final (IF) syllables, or the Onset-Nucleus-Coda (ONC) syllables where finals are further split into nucleus and coda to reflect the intra-syllable variations in Cantonese. By using the Kaldi toolkit, our system is trained using the stochastic gradient descent optimization model with the aid of GPUs for the hybrid Deep Neural Network and Hidden Markov Model (DNN-HMM) with and without I-vector based speaker adaptive training technique. The input features of the same Gaussian Mixture Model with speaker adaptive training (GMM-SAT) to DNN are used in all cases. Experiments show that the ONC-based syllable acoustic modeling with I-vector based DNN-HMM achieves the best performance with the word error rate (WER) of 9.66{%} and the real time factor (RTF) of 1.38812.
Tasks	Speech Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1610/
PDF	https://www.aclweb.org/anthology/L16-1610
PWC	https://paperswithcode.com/paper/syllable-based-dnn-hmm-cantonese-speech-to
Repo
Framework

TextPro-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks


Title	TextPro-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks
Authors	Bernardo Magnini, Anne-Lyse Minard, Mohammed R. H. Qwaider, Manuela Speranza
Abstract	This paper presents TextPro-AL (Active Learning for Text Processing), a platform where human annotators can efficiently work to produce high quality training data for new domains and new languages exploiting Active Learning methodologies. TextPro-AL is a web-based application integrating four components: a machine learning based NLP pipeline, an annotation editor for task definition and text annotations, an incremental re-training procedure based on active learning selection from a large pool of unannotated data, and a graphical visualization of the learning status of the system.
Tasks	Active Learning, Domain Adaptation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2028/
PDF	https://www.aclweb.org/anthology/C16-2028
PWC	https://paperswithcode.com/paper/textpro-al-an-active-learning-platform-for
Repo
Framework