May 4, 2019

1704 words 8 mins read

Paper Group NANR 178

Paper Group NANR 178

When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models. Using TEI for textbook research. DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents. Morphological reinflection with conditional random fields and unsupervised features. Deconvolving Feedback Loops in Recommender …

When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models

Title When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models
Authors Alicia Krebs, Denis Paperno
Abstract
Tasks Hyperparameter Optimization, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/papers/S16-2011/s16-2011
PDF https://www.aclweb.org/anthology/S16-2011v2
PWC https://paperswithcode.com/paper/when-hyperparameters-help-beneficial
Repo
Framework

Using TEI for textbook research

Title Using TEI for textbook research
Authors Lena-Luise Stahn, Steffen Hennicke, Ernesto William De Luca
Abstract The following paper describes the first steps in the development of an ontology for the textbook research discipline. The aim of the project WorldViews is to establish a digital edition focussing on views of the world depicted in textbooks. For this purpose an initial TEI profile has been formalised and tested as a use case to enable the semantical encoding of the resource {`}textbook{'}. This profile shall provide a basic data model describing major facets of the textbook{'}s structure relevant to historians. |
Tasks Information Retrieval
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4024/
PDF https://www.aclweb.org/anthology/W16-4024
PWC https://paperswithcode.com/paper/using-tei-for-textbook-research
Repo
Framework

DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

Title DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents
Authors Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li, Jianshe Zhou
Abstract
Tasks Chatbot, Community Question Answering, Information Retrieval, Learning-To-Rank, Machine Translation, Question Answering, Short-Text Conversation, Text Generation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1049/
PDF https://www.aclweb.org/anthology/P16-1049
PWC https://paperswithcode.com/paper/docchat-an-information-retrieval-approach-for
Repo
Framework

Morphological reinflection with conditional random fields and unsupervised features

Title Morphological reinflection with conditional random fields and unsupervised features
Authors Ling Liu, Lingshuang Jack Mao
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2006/
PDF https://www.aclweb.org/anthology/W16-2006
PWC https://paperswithcode.com/paper/morphological-reinflection-with-conditional
Repo
Framework

Deconvolving Feedback Loops in Recommender Systems

Title Deconvolving Feedback Loops in Recommender Systems
Authors Ayan Sinha, David F. Gleich, Karthik Ramani
Abstract Collaborative filtering is a popular technique to infer users’ preferences on new content based on the collective information of all users preferences. Recommender systems then use this information to make personalized suggestions to users. When users accept these recommendations it creates a feedback loop in the recommender system, and these loops iteratively influence the collaborative filtering algorithm’s predictions over time. We investigate whether it is possible to identify items affected by these feedback loops. We state sufficient assumptions to deconvolve the feedback loops while keeping the inverse solution tractable. We furthermore develop a metric to unravel the recommender system’s influence on the entire user-item rating matrix. We use this metric on synthetic and real-world datasets to (1) identify the extent to which the recommender system affects the final rating matrix, (2) rank frequently recommended items, and (3) distinguish whether a user’s rated item was recommended or an intrinsic preference. Our results indicate that it is possible to recover the ratings matrix of intrinsic user preferences using a single snapshot of the ratings matrix without any temporal information.
Tasks Recommendation Systems
Published 2016-12-01
URL http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems
PDF http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems.pdf
PWC https://paperswithcode.com/paper/deconvolving-feedback-loops-in-recommender
Repo
Framework

Automatic Translation of English Text to Indian Sign Language Synthetic Animations

Title Automatic Translation of English Text to Indian Sign Language Synthetic Animations
Authors Lalit Goyal, Vishal Goyal
Abstract
Tasks Lemmatization
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-6319/
PDF https://www.aclweb.org/anthology/W16-6319
PWC https://paperswithcode.com/paper/automatic-translation-of-english-text-to
Repo
Framework

Event Nugget and Event Coreference Annotation

Title Event Nugget and Event Coreference Annotation
Authors Zhiyi Song, Ann Bies, Stephanie Strassel, Joe Ellis, Teruko Mitamura, Hoa Trang Dang, Yukari Yamakawa, Sue Holm
Abstract
Tasks Knowledge Base Population
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1005/
PDF https://www.aclweb.org/anthology/W16-1005
PWC https://paperswithcode.com/paper/event-nugget-and-event-coreference-annotation
Repo
Framework

UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing

Title UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Authors Milan Straka, Jan Haji{\v{c}}, Jana Strakov{'a}
Abstract Automatic natural language processing of large texts often presents recurring challenges in multiple languages: even for most advanced tasks, the texts are first processed by basic processing steps {–} from tokenization to parsing. We present an extremely simple-to-use tool consisting of one binary and one model (per language), which performs these tasks for multiple languages without the need for any other external data. UDPipe, a pipeline processing CoNLL-U-formatted files, performs tokenization, morphological analysis, part-of-speech tagging, lemmatization and dependency parsing for nearly all treebanks of Universal Dependencies 1.2 (namely, the whole pipeline is currently available for 32 out of 37 treebanks). In addition, the pipeline is easily trainable with training data in CoNLL-U format (and in some cases also with additional raw corpora) and requires minimal linguistic knowledge on the users{'} part. The training code is also released.
Tasks Dependency Parsing, Lemmatization, Morphological Analysis, Part-Of-Speech Tagging, Tokenization
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1680/
PDF https://www.aclweb.org/anthology/L16-1680
PWC https://paperswithcode.com/paper/udpipe-trainable-pipeline-for-processing
Repo
Framework

The George Washington University System for the Code-Switching Workshop Shared Task 2016

Title The George Washington University System for the Code-Switching Workshop Shared Task 2016
Authors Mohamed Al-Badrashiny, Mona Diab
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5813/
PDF https://www.aclweb.org/anthology/W16-5813
PWC https://paperswithcode.com/paper/the-george-washington-university-system-for
Repo
Framework

DeQue: A Lexicon of Complex Prepositions and Conjunctions in French

Title DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Authors Carlos Ramisch, Alexis Nasr, Andr{'e} Valli, Jos{'e} Deulofeu
Abstract We introduce DeQue, a lexicon covering French complex prepositions (CPRE) like {}{\`a} partir de{''} (from) and complex conjunctions (CCONJ) like {}bien que{''} (although). The lexicon includes fine-grained linguistic description based on empirical evidence. We describe the general characteristics of CPRE and CCONJ in French, with special focus on syntactic ambiguity. Then, we list the selection criteria used to build the lexicon and the corpus-based methodology employed to collect entries. Finally, we quantify the ambiguity of each construction by annotating around 100 sentences randomly taken from the FRWaC. In addition to its theoretical value, the resource has many potential practical applications. We intend to employ DeQue for treebank annotation and to train a dependency parser that can takes complex constructions into account.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1363/
PDF https://www.aclweb.org/anthology/L16-1363
PWC https://paperswithcode.com/paper/deque-a-lexicon-of-complex-prepositions-and
Repo
Framework

IMS HotCoref DE: A Data-driven Co-reference Resolver for German

Title IMS HotCoref DE: A Data-driven Co-reference Resolver for German
Authors Ina Roesiger, Jonas Kuhn
Abstract This paper presents a data-driven co-reference resolution system for German that has been adapted from IMS HotCoref, a co-reference resolver for English. It describes the difficulties when resolving co-reference in German text, the adaptation process and the features designed to address linguistic challenges brought forth by German. We report performance on the reference dataset T{"u}Ba-D/Z and include a post-task SemEval 2010 evaluation, showing that the resolver achieves state-of-the-art performance. We also include ablation experiments that indicate that integrating linguistic features increases results. The paper also describes the steps and the format necessary to use the resolver on new texts. The tool is freely available for download.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1024/
PDF https://www.aclweb.org/anthology/L16-1024
PWC https://paperswithcode.com/paper/ims-hotcoref-de-a-data-driven-co-reference
Repo
Framework

Language Resource Citation: the ISLRN Dissemination and Further Developments

Title Language Resource Citation: the ISLRN Dissemination and Further Developments
Authors Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Khalid Choukri
Abstract This article presents the latest dissemination activities and technical developments that were carried out for the International Standard Language Resource Number (ISLRN) service. It also recalls the main principle and submission process for providers to obtain their 13-digit ISLRN identifier. Up to March 2016, 2100 Language Resources were allocated an ISLRN number, not only ELRA{'}s and LDC{'}s catalogued Language Resources, but also the ones from other important organisations like the Joint Research Centre (JRC) and the Resource Management Agency (RMA) who expressed their strong support to this initiative.In the research field, not only assigning a unique identification number is important, but also referring to a Language Resource as an object \textit{per se} (like publications) has now become an obvious requirement. The ISLRN could also become an important parameter to be considered to compute a Language Resource Impact Factor (LRIF) in order to recognize the merits of the producers of Language Resources. Integrating the ISLRN number into a LR-oriented bibliographical reference is thus part of the objective. The idea is to make use of a BibTeX entry that would take into account Language Resources items, including ISLRN.The ISLRN being a requested field within the LREC 2016 submission, we expect that several other LRs will be allocated an ISLRN number by the conference date. With this expansion, this number aims to be a spreadly-used LR citation instrument within works referring to LRs.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1254/
PDF https://www.aclweb.org/anthology/L16-1254
PWC https://paperswithcode.com/paper/language-resource-citation-the-islrn
Repo
Framework

Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene

Title Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Authors Izaskun Etxeberria, I{~n}aki Alegria, Larraitz Uria, Mans Hulden
Abstract This paper presents a method for the normalization of historical texts using a combination of weighted finite-state transducers and language models. We have extended our previous work on the normalization of dialectal texts and tested the method against a 17th century literary work in Basque. This preprocessed corpus is made available in the LREC repository. The performance of this method for learning relations between historical and contemporary word forms is evaluated against resources in three languages. The method we present learns to map phonological changes using a noisy channel model. The model is based on techniques commonly used for phonological inference and producing Grapheme-to-Grapheme conversion systems encoded as weighted transducers and produces F-scores above 80{%} in the task for Basque. A wider evaluation shows that the approach performs equally well with all the languages in our evaluation suite: Basque, Spanish and Slovene. A comparison against other methods that address the same task is also provided.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1169/
PDF https://www.aclweb.org/anthology/L16-1169
PWC https://paperswithcode.com/paper/evaluating-the-noisy-channel-model-for-the
Repo
Framework

Continuous Profile Models in ASL Syntactic Facial Expression Synthesis

Title Continuous Profile Models in ASL Syntactic Facial Expression Synthesis
Authors Hernisa Kacorri, Matt Huenerfauth
Abstract
Tasks Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1196/
PDF https://www.aclweb.org/anthology/P16-1196
PWC https://paperswithcode.com/paper/continuous-profile-models-in-asl-syntactic
Repo
Framework

A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap

Title A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap
Authors Carolin Haas, Stefan Riezler
Abstract
Tasks Machine Translation, Semantic Parsing
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1088/
PDF https://www.aclweb.org/anthology/N16-1088
PWC https://paperswithcode.com/paper/a-corpus-and-semantic-parser-for-multilingual
Repo
Framework
comments powered by Disqus