July 26, 2019

1607 words 8 mins read

Paper Group NANR 165

Paper Group NANR 165

Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts. Will my auxiliary tagging task help? Estimating Auxiliary Tasks Effectivity in Multi-Task Learning. CIC-FBK Approach to Native Language Identification. Carrier Sentence Selection for Fill-in-the-blank Items. Vision and Language Integratio …

Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts

Title Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts
Authors Henry Y. Chen, Ethan Zhou, Jinho D. Choi
Abstract This paper presents a novel approach to character identification, that is an entity linking task that maps mentions to characters in dialogues from TV show transcripts. We first augment and correct several cases of annotation errors in an existing corpus so the corpus is clearer and cleaner for statistical learning. We also introduce the agglomerative convolutional neural network that takes groups of features and learns mention and mention-pair embeddings for coreference resolution. We then propose another neural model that employs the embeddings learned and creates cluster embeddings for entity linking. Our coreference resolution model shows comparable results to other state-of-the-art systems. Our entity linking model significantly outperforms the previous work, showing the F1 score of 86.76{%} and the accuracy of 95.30{%} for character identification.
Tasks Coreference Resolution, Entity Linking, Entity Resolution, Question Answering
Published 2017-08-01
URL https://www.aclweb.org/anthology/K17-1023/
PDF https://www.aclweb.org/anthology/K17-1023
PWC https://paperswithcode.com/paper/robust-coreference-resolution-and-entity
Repo
Framework

Will my auxiliary tagging task help? Estimating Auxiliary Tasks Effectivity in Multi-Task Learning

Title Will my auxiliary tagging task help? Estimating Auxiliary Tasks Effectivity in Multi-Task Learning
Authors Johannes Bjerva
Abstract
Tasks Multi-Task Learning
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0225/
PDF https://www.aclweb.org/anthology/W17-0225
PWC https://paperswithcode.com/paper/will-my-auxiliary-tagging-task-help
Repo
Framework

CIC-FBK Approach to Native Language Identification

Title CIC-FBK Approach to Native Language Identification
Authors Ilia Markov, Lingzhen Chen, Carlo Strapparava, Grigori Sidorov
Abstract We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017. Our approach combines features commonly used in previous NLI research, i.e., word n-grams, lemma n-grams, part-of-speech n-grams, and function words, with recently introduced character n-grams from misspelled words, and features that are novel in this task, such as typed character n-grams, and syntactic n-grams of words and of syntactic relation tags. We use log-entropy weighting scheme and perform classification using the Support Vector Machines (SVM) algorithm. Our system achieved 0.8808 macro-averaged F1-score and shared the 1st rank in the NLI Shared Task 2017 scoring.
Tasks Language Identification, Native Language Identification
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5042/
PDF https://www.aclweb.org/anthology/W17-5042
PWC https://paperswithcode.com/paper/cic-fbk-approach-to-native-language
Repo
Framework

Carrier Sentence Selection for Fill-in-the-blank Items

Title Carrier Sentence Selection for Fill-in-the-blank Items
Authors Shu Jiang, John Lee
Abstract Fill-in-the-blank items are a common form of exercise in computer-assisted language learning systems. To automatically generate an effective item, the system must be able to select a high-quality carrier sentence that illustrates the usage of the target word. Previous approaches for carrier sentence selection have considered sentence length, vocabulary difficulty, the position of the target word and the presence of finite verbs. This paper investigates the utility of word co-occurrence statistics and lexical similarity as selection criteria. In an evaluation on generating fill-in-the-blank items for learning Chinese as a foreign language, we show that these two criteria can improve carrier sentence quality.
Tasks
Published 2017-12-01
URL https://www.aclweb.org/anthology/W17-5903/
PDF https://www.aclweb.org/anthology/W17-5903
PWC https://paperswithcode.com/paper/carrier-sentence-selection-for-fill-in-the
Repo
Framework

Vision and Language Integration: Moving beyond Objects

Title Vision and Language Integration: Moving beyond Objects
Authors Ravi Shekhar, S Pezzelle, ro, Aur{'e}lie Herbelot, Moin Nabi, Enver Sangineto, Raffaella Bernardi
Abstract
Tasks Action Classification, Image Captioning, Object Classification, Question Answering, Visual Question Answering
Published 2017-01-01
URL https://www.aclweb.org/anthology/W17-6938/
PDF https://www.aclweb.org/anthology/W17-6938
PWC https://paperswithcode.com/paper/vision-and-language-integration-moving-beyond
Repo
Framework

Hindi Shabdamitra: A Wordnet based E-Learning Tool for Language Learning and Teaching

Title Hindi Shabdamitra: A Wordnet based E-Learning Tool for Language Learning and Teaching
Authors Hanumant Redkar, S Singh, hya, Meenakshi Somasundaram, Dhara Gorasia, Malhar Kulkarni, Pushpak Bhattacharyya
Abstract In today{'}s technology driven digital era, education domain is undergoing a transformation from traditional approaches to more learner controlled and flexible methods of learning. This transformation has opened the new avenues for interdisciplinary research in the field of educational technology and natural language processing in developing quality digital aids for learning and teaching. The tool presented here - Hindi Shabhadamitra, developed using Hindi Wordnet for Hindi language learning, is one such e-learning tool. It has been developed as a teaching and learning aid suitable for formal school based curriculum and informal setup for self learning users. Besides vocabulary, it also provides word based grammar along with images and pronunciation for better learning and retention. This aid demonstrates that how a rich lexical resource like wordnet can be systematically remodeled for practical usage in the educational domain.
Tasks
Published 2017-12-01
URL https://www.aclweb.org/anthology/W17-5904/
PDF https://www.aclweb.org/anthology/W17-5904
PWC https://paperswithcode.com/paper/hindi-shabdamitra-a-wordnet-based-e-learning
Repo
Framework

Suggesting Sentences for ESL using Kernel Embeddings

Title Suggesting Sentences for ESL using Kernel Embeddings
Authors Kent Shioda, Mamoru Komachi, Rue Ikeya, Daichi Mochihashi
Abstract Sentence retrieval is an important NLP application for English as a Second Language (ESL) learners. ESL learners are familiar with web search engines, but generic web search results may not be adequate for composing documents in a specific domain. However, if we build our own search system specialized to a domain, it may be subject to the data sparseness problem. Recently proposed word2vec partially addresses the data sparseness problem, but fails to extract sentences relevant to queries owing to the modeling of the latent intent of the query. Thus, we propose a method of retrieving example sentences using kernel embeddings and N-gram windows. This method implicitly models latent intent of query and sentences, and alleviates the problem of noisy alignment. Our results show that our method achieved higher precision in sentence retrieval for ESL in the domain of a university press release corpus, as compared to a previous unsupervised method used for a semantic textual similarity task.
Tasks Semantic Textual Similarity
Published 2017-12-01
URL https://www.aclweb.org/anthology/W17-5911/
PDF https://www.aclweb.org/anthology/W17-5911
PWC https://paperswithcode.com/paper/suggesting-sentences-for-esl-using-kernel
Repo
Framework

The Sentimental Value of Chinese Sub-Character Components

Title The Sentimental Value of Chinese Sub-Character Components
Authors Yassine Benajiba, Or Biran, Zhiliang Weng, Yong Zhang, Jin Sun
Abstract Sub-character components of Chinese characters carry important semantic information, and recent studies have shown that utilizing this information can improve performance on core semantic tasks. In this paper, we hypothesize that in addition to semantic information, sub-character components may also carry emotional information, and that utilizing it should improve performance on sentiment analysis tasks. We conduct a series of experiments on four Chinese sentiment data sets and show that we can significantly improve the performance in various tasks over that of a character-level embeddings baseline. We then focus on qualitatively assessing multiple examples and trying to explain how the sub-character components affect the results in each case.
Tasks Sentiment Analysis, Word Embeddings
Published 2017-12-01
URL https://www.aclweb.org/anthology/W17-6003/
PDF https://www.aclweb.org/anthology/W17-6003
PWC https://paperswithcode.com/paper/the-sentimental-value-of-chinese-sub
Repo
Framework

Character Sequence-to-Sequence Model with Global Attention for Universal Morphological Reinflection

Title Character Sequence-to-Sequence Model with Global Attention for Universal Morphological Reinflection
Authors Qile Zhu, Yanjun Li, Xiaolin Li
Abstract
Tasks Machine Translation, Question Answering
Published 2017-08-01
URL https://www.aclweb.org/anthology/K17-2009/
PDF https://www.aclweb.org/anthology/K17-2009
PWC https://paperswithcode.com/paper/character-sequence-to-sequence-model-with
Repo
Framework

基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese]

Title 基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese]
Authors Yu-Ding Lu, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang
Abstract
Tasks Speaker Verification
Published 2017-11-01
URL https://www.aclweb.org/anthology/O17-1010/
PDF https://www.aclweb.org/anthology/O17-1010
PWC https://paperswithcode.com/paper/ao14eaa14eac-c14ec14a-a1ee3a34a-c3c-a-replay-1
Repo
Framework

Rule-Enhanced Penalized Regression by Column Generation using Rectangular Maximum Agreement

Title Rule-Enhanced Penalized Regression by Column Generation using Rectangular Maximum Agreement
Authors Jonathan Eckstein, Noam Goldberg, Ai Kagawa
Abstract We describe a learning procedure enhancing L1-penalized regression by adding dynamically generated rules describing multidimensional “box” sets. Our rule-adding procedure is based on the classical column generation method for high-dimensional linear programming. The pricing problem for our column generation procedure reduces to the NP-hard rectangular maximum agreement (RMA) problem of finding a box that best discriminates between two weighted datasets. We solve this problem exactly using a parallel branch-and-bound procedure. The resulting rule-enhanced regression procedure is computation-intensive, but has promising prediction performance.
Tasks
Published 2017-08-01
URL https://icml.cc/Conferences/2017/Schedule?showEvent=604
PDF http://proceedings.mlr.press/v70/eckstein17a/eckstein17a.pdf
PWC https://paperswithcode.com/paper/rule-enhanced-penalized-regression-by-column
Repo
Framework

Coordination in TAG without the Conjoin Operation

Title Coordination in TAG without the Conjoin Operation
Authors Chung-hye Han, Anoop Sarkar
Abstract
Tasks
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-6205/
PDF https://www.aclweb.org/anthology/W17-6205
PWC https://paperswithcode.com/paper/coordination-in-tag-without-the-conjoin
Repo
Framework

Multiword Expression-Aware A* TAG Parsing Revisited

Title Multiword Expression-Aware A* TAG Parsing Revisited
Authors Jakub Waszczuk, Agata Savary, Yannick Parmentier
Abstract
Tasks
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-6209/
PDF https://www.aclweb.org/anthology/W17-6209
PWC https://paperswithcode.com/paper/multiword-expression-aware-a-tag-parsing
Repo
Framework

TAG Parser Evaluation using Textual Entailments

Title TAG Parser Evaluation using Textual Entailments
Authors Pauli Xu, Robert Frank, Jungo Kasai, Owen Rambow
Abstract
Tasks Natural Language Inference
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-6214/
PDF https://www.aclweb.org/anthology/W17-6214
PWC https://paperswithcode.com/paper/tag-parser-evaluation-using-textual
Repo
Framework

Nyström Method with Kernel K-means++ Samples as Landmarks

Title Nyström Method with Kernel K-means++ Samples as Landmarks
Authors Dino Oglic, Thomas Gärtner
Abstract We investigate, theoretically and empirically, the effectiveness of kernel K-means++ samples as landmarks in the Nyström method for low-rank approximation of kernel matrices. Previous empirical studies (Zhang et al., 2008; Kumar et al.,2012) observe that the landmarks obtained using (kernel) K-means clustering define a good low-rank approximation of kernel matrices. However, the existing work does not provide a theoretical guarantee on the approximation error for this approach to landmark selection. We close this gap and provide the first bound on the approximation error of the Nyström method with kernel K-means++ samples as landmarks. Moreover, for the frequently used Gaussian kernel we provide a theoretically sound motivation for performing Lloyd refinements of kernel K-means++ landmarks in the instance space. We substantiate our theoretical results empirically by comparing the approach to several state-of-the-art algorithms.
Tasks
Published 2017-08-01
URL https://icml.cc/Conferences/2017/Schedule?showEvent=595
PDF http://proceedings.mlr.press/v70/oglic17a/oglic17a.pdf
PWC https://paperswithcode.com/paper/nystrom-method-with-kernel-k-means-samples-as
Repo
Framework
comments powered by Disqus