May 5, 2019

1389 words 7 mins read

Paper Group NANR 19

Paper Group NANR 19

The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus. Automatic Identification of Narrative Diegesis and Point of View. An Information Foraging Approach to Determining the Number of Relevant Features. Aligning Texts and Knowledge Bases with Semantic Sentence Simplification. DeepLife: An Entity-aware Search, Analytics and Exploration Pl …

The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus

Title The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
Authors Akira Hayakawa, Saturnino Luz, Loredana Cerrato, Nick Campbell
Abstract This paper presents the multimodal Interlingual Map Task Corpus (ILMT-s2s corpus) collected at Trinity College Dublin, and discuss some of the issues related to the collection and analysis of the data. The corpus design is inspired by the HCRC Map Task Corpus which was initially designed to support the investigation of linguistic phenomena, and has been the focus of a variety of studies of communicative behaviour. The simplicity of the task, and the complexity of phenomena it can elicit, make the map task an ideal object of study. Although there are studies that used replications of the map task to investigate communication in computer mediated tasks, this ILMT-s2s corpus is, to the best of our knowledge, the first investigation of communicative behaviour in the presence of three additional {``}filters{''}: Automatic Speech Recognition (ASR), Machine Translation (MT) and Text To Speech (TTS) synthesis, where the instruction giver and the instruction follower speak different languages. This paper details the data collection setup and completed annotation of the ILMT-s2s corpus, and outlines preliminary results obtained from the data. |
Tasks Machine Translation, Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1096/
PDF https://www.aclweb.org/anthology/L16-1096
PWC https://paperswithcode.com/paper/the-ilmt-s2s-corpus-a-a-multimodal
Repo
Framework

Automatic Identification of Narrative Diegesis and Point of View

Title Automatic Identification of Narrative Diegesis and Point of View
Authors Joshua Eisenberg, Mark Finlayson
Abstract
Tasks Natural Language Inference
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5705/
PDF https://www.aclweb.org/anthology/W16-5705
PWC https://paperswithcode.com/paper/automatic-identification-of-narrative
Repo
Framework

An Information Foraging Approach to Determining the Number of Relevant Features

Title An Information Foraging Approach to Determining the Number of Relevant Features
Authors Brian Connolly, Benjamin Glass, John Pestian
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2923/
PDF https://www.aclweb.org/anthology/W16-2923
PWC https://paperswithcode.com/paper/an-information-foraging-approach-to
Repo
Framework

Aligning Texts and Knowledge Bases with Semantic Sentence Simplification

Title Aligning Texts and Knowledge Bases with Semantic Sentence Simplification
Authors Yassine Mrabet, Pavlos Vougiouklis, Halil Kilicoglu, Claire Gardent, Dina Demner-Fushman, Jonathon Hare, Elena Simperl
Abstract
Tasks Information Retrieval, Text Generation, Text Simplification
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3506/
PDF https://www.aclweb.org/anthology/W16-3506
PWC https://paperswithcode.com/paper/aligning-texts-and-knowledge-bases-with
Repo
Framework

DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences

Title DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences
Authors Patrick Ernst, Amy Siu, Dragan Milchevski, Johannes Hoffart, Gerhard Weikum
Abstract
Tasks Entity Linking
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-4004/
PDF https://www.aclweb.org/anthology/P16-4004
PWC https://paperswithcode.com/paper/deeplife-an-entity-aware-search-analytics-and
Repo
Framework

Neural Morphological Analysis: Encoding-Decoding Canonical Segments

Title Neural Morphological Analysis: Encoding-Decoding Canonical Segments
Authors Katharina Kann, Ryan Cotterell, Hinrich Sch{"u}tze
Abstract
Tasks Keyword Spotting, Machine Translation, Morphological Analysis, Speech Recognition
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1097/
PDF https://www.aclweb.org/anthology/D16-1097
PWC https://paperswithcode.com/paper/neural-morphological-analysis-encoding
Repo
Framework

Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP)

Title Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP)
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4200/
PDF https://www.aclweb.org/anthology/W16-4200
PWC https://paperswithcode.com/paper/proceedings-of-the-clinical-natural-language
Repo
Framework

Synset Ranking of Hindi WordNet

Title Synset Ranking of Hindi WordNet
Authors Sudha Bhingardive, Rajita Shukla, Jaya Saraswati, Laxmi Kashyap, Dhirendra Singh, Pushpak Bhattacharyya
Abstract Word Sense Disambiguation (WSD) is one of the open problems in the area of natural language processing. Various supervised, unsupervised and knowledge based approaches have been proposed for automatically determining the sense of a word in a particular context. It has been observed that such approaches often find it difficult to beat the WordNet First Sense (WFS) baseline which assigns the sense irrespective of context. In this paper, we present our work on creating the WFS baseline for Hindi language by manually ranking the synsets of Hindi WordNet. A ranking tool is developed where human experts can see the frequency of the word senses in the sense-tagged corpora and have been asked to rank the senses of a word by using this information and also his/her intuition. The accuracy of WFS baseline is tested on several standard datasets. F-score is found to be 60{%}, 65{%} and 55{%} on Health, Tourism and News datasets respectively. The created rankings can also be used in other NLP applications viz., Machine Translation, Information Retrieval, Text Summarization, etc.
Tasks Information Retrieval, Machine Translation, Text Summarization, Word Sense Disambiguation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1485/
PDF https://www.aclweb.org/anthology/L16-1485
PWC https://paperswithcode.com/paper/synset-ranking-of-hindi-wordnet
Repo
Framework

Proceedings of the 5th Workshop on Automated Knowledge Base Construction

Title Proceedings of the 5th Workshop on Automated Knowledge Base Construction
Authors
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1300/
PDF https://www.aclweb.org/anthology/W16-1300
PWC https://paperswithcode.com/paper/proceedings-of-the-5th-workshop-on-automated
Repo
Framework

Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context

Title Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context
Authors F{'e}licien Vallet, Jim Uro, J{'e}r{'e}my Andriamakaoly, Hakim Nabi, Mathieu Derval, Jean Carrive
Abstract With the increasing amount of audiovisual and digital data deriving from televisual and radiophonic sources, professional archives such as INA, France{'}s national audiovisual institute, acknowledge a growing need for efficient indexing tools. In this paper, we describe the Speech Trax system that aims at analyzing the audio content of TV and radio documents. In particular, we focus on the speaker tracking task that is very valuable for indexing purposes. First, we detail the overall architecture of the system and show the results obtained on a large-scale experiment, the largest to our knowledge for this type of content (about 1,300 speakers). Then, we present the Speech Trax demonstrator that gathers the results of various automatic speech processing techniques on top of our speaker tracking system (speaker diarization, speech transcription, etc.). Finally, we provide insight on the obtained performances and suggest hints for future improvements.
Tasks Speaker Diarization
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1318/
PDF https://www.aclweb.org/anthology/L16-1318
PWC https://paperswithcode.com/paper/speech-trax-a-bottom-to-the-top-approach-for
Repo
Framework

Antecedent Selection for Sluicing: Structure and Content

Title Antecedent Selection for Sluicing: Structure and Content
Authors Pranav Anand, Daniel Hardt
Abstract
Tasks Question Answering
Published 2016-11-01
URL https://www.aclweb.org/anthology/papers/D16-1131/d16-1131
PDF https://www.aclweb.org/anthology/D16-1131
PWC https://paperswithcode.com/paper/antecedent-selection-for-sluicing-structure
Repo
Framework

Wikipedia Titles As Noun Tag Predictors

Title Wikipedia Titles As Noun Tag Predictors
Authors Armin Hoenen
Abstract In this paper, we investigate a covert labeling cue, namely the probability that a title (by example of the Wikipedia titles) is a noun. If this probability is very large, any list such as or comparable to the Wikipedia titles can be used as a reliable word-class (or part-of-speech tag) predictor or noun lexicon. This may be especially useful in the case of Low Resource Languages (LRL) where labeled data is lacking and putatively for Natural Language Processing (NLP) tasks such as Word Sense Disambiguation, Sentiment Analysis and Machine Translation. Profitting from the ease of digital publication on the web as opposed to print, LRL speaker communities produce resources such as Wikipedia and Wiktionary, which can be used for an assessment. We provide statistical evidence for a strong noun bias for the Wikipedia titles from 2 corpora (English, Persian) and a dictionary (Japanese) and for a typologically balanced set of 17 languages including LRLs. Additionally, we conduct a small experiment on predicting noun tags for out-of-vocabulary items in part-of-speech tagging for English.
Tasks Machine Translation, Part-Of-Speech Tagging, Sentiment Analysis, Word Sense Disambiguation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1335/
PDF https://www.aclweb.org/anthology/L16-1335
PWC https://paperswithcode.com/paper/wikipedia-titles-as-noun-tag-predictors
Repo
Framework

TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features

Title TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features
Authors Francesco Ronzano, Ahmed Abura{'}ed, Luis Espinosa-Anke, Horacio Saggion
Abstract
Tasks Complex Word Identification, Lexical Simplification
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1157/
PDF https://www.aclweb.org/anthology/S16-1157
PWC https://paperswithcode.com/paper/taln-at-semeval-2016-task-11-modelling
Repo
Framework

MacSaar at SemEval-2016 Task 11: Zipfian and Character Features for ComplexWord Identification

Title MacSaar at SemEval-2016 Task 11: Zipfian and Character Features for ComplexWord Identification
Authors Marcos Zampieri, Liling Tan, Josef van Genabith
Abstract
Tasks Complex Word Identification, Lexical Simplification, Text Simplification
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1155/
PDF https://www.aclweb.org/anthology/S16-1155
PWC https://paperswithcode.com/paper/macsaar-at-semeval-2016-task-11-zipfian-and
Repo
Framework

Don’t Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation

Title Don’t Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation
Authors Josiah Wang, Robert Gaizauskas
Abstract
Tasks Image Retrieval, Learning-To-Rank, Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-6631/
PDF https://www.aclweb.org/anthology/W16-6631
PWC https://paperswithcode.com/paper/dont-mention-the-shoe-a-learning-to-rank
Repo
Framework
comments powered by Disqus