Paper Group NANR 178
When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models. Using TEI for textbook research. DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents. Morphological reinflection with conditional random fields and unsupervised features. Deconvolving Feedback Loops in Recommender …
When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models
Title | When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models |
Authors | Alicia Krebs, Denis Paperno |
Abstract | |
Tasks | Hyperparameter Optimization, Word Embeddings |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/papers/S16-2011/s16-2011 |
https://www.aclweb.org/anthology/S16-2011v2 | |
PWC | https://paperswithcode.com/paper/when-hyperparameters-help-beneficial |
Repo | |
Framework | |
Using TEI for textbook research
Title | Using TEI for textbook research |
Authors | Lena-Luise Stahn, Steffen Hennicke, Ernesto William De Luca |
Abstract | The following paper describes the first steps in the development of an ontology for the textbook research discipline. The aim of the project WorldViews is to establish a digital edition focussing on views of the world depicted in textbooks. For this purpose an initial TEI profile has been formalised and tested as a use case to enable the semantical encoding of the resource {`}textbook{'}. This profile shall provide a basic data model describing major facets of the textbook{'}s structure relevant to historians. | |
Tasks | Information Retrieval |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4024/ |
https://www.aclweb.org/anthology/W16-4024 | |
PWC | https://paperswithcode.com/paper/using-tei-for-textbook-research |
Repo | |
Framework | |
DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents
Title | DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents |
Authors | Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li, Jianshe Zhou |
Abstract | |
Tasks | Chatbot, Community Question Answering, Information Retrieval, Learning-To-Rank, Machine Translation, Question Answering, Short-Text Conversation, Text Generation |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1049/ |
https://www.aclweb.org/anthology/P16-1049 | |
PWC | https://paperswithcode.com/paper/docchat-an-information-retrieval-approach-for |
Repo | |
Framework | |
Morphological reinflection with conditional random fields and unsupervised features
Title | Morphological reinflection with conditional random fields and unsupervised features |
Authors | Ling Liu, Lingshuang Jack Mao |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2006/ |
https://www.aclweb.org/anthology/W16-2006 | |
PWC | https://paperswithcode.com/paper/morphological-reinflection-with-conditional |
Repo | |
Framework | |
Deconvolving Feedback Loops in Recommender Systems
Title | Deconvolving Feedback Loops in Recommender Systems |
Authors | Ayan Sinha, David F. Gleich, Karthik Ramani |
Abstract | Collaborative filtering is a popular technique to infer users’ preferences on new content based on the collective information of all users preferences. Recommender systems then use this information to make personalized suggestions to users. When users accept these recommendations it creates a feedback loop in the recommender system, and these loops iteratively influence the collaborative filtering algorithm’s predictions over time. We investigate whether it is possible to identify items affected by these feedback loops. We state sufficient assumptions to deconvolve the feedback loops while keeping the inverse solution tractable. We furthermore develop a metric to unravel the recommender system’s influence on the entire user-item rating matrix. We use this metric on synthetic and real-world datasets to (1) identify the extent to which the recommender system affects the final rating matrix, (2) rank frequently recommended items, and (3) distinguish whether a user’s rated item was recommended or an intrinsic preference. Our results indicate that it is possible to recover the ratings matrix of intrinsic user preferences using a single snapshot of the ratings matrix without any temporal information. |
Tasks | Recommendation Systems |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems |
http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems.pdf | |
PWC | https://paperswithcode.com/paper/deconvolving-feedback-loops-in-recommender |
Repo | |
Framework | |
Automatic Translation of English Text to Indian Sign Language Synthetic Animations
Title | Automatic Translation of English Text to Indian Sign Language Synthetic Animations |
Authors | Lalit Goyal, Vishal Goyal |
Abstract | |
Tasks | Lemmatization |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-6319/ |
https://www.aclweb.org/anthology/W16-6319 | |
PWC | https://paperswithcode.com/paper/automatic-translation-of-english-text-to |
Repo | |
Framework | |
Event Nugget and Event Coreference Annotation
Title | Event Nugget and Event Coreference Annotation |
Authors | Zhiyi Song, Ann Bies, Stephanie Strassel, Joe Ellis, Teruko Mitamura, Hoa Trang Dang, Yukari Yamakawa, Sue Holm |
Abstract | |
Tasks | Knowledge Base Population |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-1005/ |
https://www.aclweb.org/anthology/W16-1005 | |
PWC | https://paperswithcode.com/paper/event-nugget-and-event-coreference-annotation |
Repo | |
Framework | |
UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Title | UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing |
Authors | Milan Straka, Jan Haji{\v{c}}, Jana Strakov{'a} |
Abstract | Automatic natural language processing of large texts often presents recurring challenges in multiple languages: even for most advanced tasks, the texts are first processed by basic processing steps {–} from tokenization to parsing. We present an extremely simple-to-use tool consisting of one binary and one model (per language), which performs these tasks for multiple languages without the need for any other external data. UDPipe, a pipeline processing CoNLL-U-formatted files, performs tokenization, morphological analysis, part-of-speech tagging, lemmatization and dependency parsing for nearly all treebanks of Universal Dependencies 1.2 (namely, the whole pipeline is currently available for 32 out of 37 treebanks). In addition, the pipeline is easily trainable with training data in CoNLL-U format (and in some cases also with additional raw corpora) and requires minimal linguistic knowledge on the users{'} part. The training code is also released. |
Tasks | Dependency Parsing, Lemmatization, Morphological Analysis, Part-Of-Speech Tagging, Tokenization |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1680/ |
https://www.aclweb.org/anthology/L16-1680 | |
PWC | https://paperswithcode.com/paper/udpipe-trainable-pipeline-for-processing |
Repo | |
Framework | |
The George Washington University System for the Code-Switching Workshop Shared Task 2016
Title | The George Washington University System for the Code-Switching Workshop Shared Task 2016 |
Authors | Mohamed Al-Badrashiny, Mona Diab |
Abstract | |
Tasks | |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/W16-5813/ |
https://www.aclweb.org/anthology/W16-5813 | |
PWC | https://paperswithcode.com/paper/the-george-washington-university-system-for |
Repo | |
Framework | |
DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Title | DeQue: A Lexicon of Complex Prepositions and Conjunctions in French |
Authors | Carlos Ramisch, Alexis Nasr, Andr{'e} Valli, Jos{'e} Deulofeu |
Abstract | We introduce DeQue, a lexicon covering French complex prepositions (CPRE) like {}{\`a} partir de{''} (from) and complex conjunctions (CCONJ) like { }bien que{''} (although). The lexicon includes fine-grained linguistic description based on empirical evidence. We describe the general characteristics of CPRE and CCONJ in French, with special focus on syntactic ambiguity. Then, we list the selection criteria used to build the lexicon and the corpus-based methodology employed to collect entries. Finally, we quantify the ambiguity of each construction by annotating around 100 sentences randomly taken from the FRWaC. In addition to its theoretical value, the resource has many potential practical applications. We intend to employ DeQue for treebank annotation and to train a dependency parser that can takes complex constructions into account. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1363/ |
https://www.aclweb.org/anthology/L16-1363 | |
PWC | https://paperswithcode.com/paper/deque-a-lexicon-of-complex-prepositions-and |
Repo | |
Framework | |
IMS HotCoref DE: A Data-driven Co-reference Resolver for German
Title | IMS HotCoref DE: A Data-driven Co-reference Resolver for German |
Authors | Ina Roesiger, Jonas Kuhn |
Abstract | This paper presents a data-driven co-reference resolution system for German that has been adapted from IMS HotCoref, a co-reference resolver for English. It describes the difficulties when resolving co-reference in German text, the adaptation process and the features designed to address linguistic challenges brought forth by German. We report performance on the reference dataset T{"u}Ba-D/Z and include a post-task SemEval 2010 evaluation, showing that the resolver achieves state-of-the-art performance. We also include ablation experiments that indicate that integrating linguistic features increases results. The paper also describes the steps and the format necessary to use the resolver on new texts. The tool is freely available for download. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1024/ |
https://www.aclweb.org/anthology/L16-1024 | |
PWC | https://paperswithcode.com/paper/ims-hotcoref-de-a-data-driven-co-reference |
Repo | |
Framework | |
Language Resource Citation: the ISLRN Dissemination and Further Developments
Title | Language Resource Citation: the ISLRN Dissemination and Further Developments |
Authors | Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Khalid Choukri |
Abstract | This article presents the latest dissemination activities and technical developments that were carried out for the International Standard Language Resource Number (ISLRN) service. It also recalls the main principle and submission process for providers to obtain their 13-digit ISLRN identifier. Up to March 2016, 2100 Language Resources were allocated an ISLRN number, not only ELRA{'}s and LDC{'}s catalogued Language Resources, but also the ones from other important organisations like the Joint Research Centre (JRC) and the Resource Management Agency (RMA) who expressed their strong support to this initiative.In the research field, not only assigning a unique identification number is important, but also referring to a Language Resource as an object \textit{per se} (like publications) has now become an obvious requirement. The ISLRN could also become an important parameter to be considered to compute a Language Resource Impact Factor (LRIF) in order to recognize the merits of the producers of Language Resources. Integrating the ISLRN number into a LR-oriented bibliographical reference is thus part of the objective. The idea is to make use of a BibTeX entry that would take into account Language Resources items, including ISLRN.The ISLRN being a requested field within the LREC 2016 submission, we expect that several other LRs will be allocated an ISLRN number by the conference date. With this expansion, this number aims to be a spreadly-used LR citation instrument within works referring to LRs. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1254/ |
https://www.aclweb.org/anthology/L16-1254 | |
PWC | https://paperswithcode.com/paper/language-resource-citation-the-islrn |
Repo | |
Framework | |
Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Title | Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene |
Authors | Izaskun Etxeberria, I{~n}aki Alegria, Larraitz Uria, Mans Hulden |
Abstract | This paper presents a method for the normalization of historical texts using a combination of weighted finite-state transducers and language models. We have extended our previous work on the normalization of dialectal texts and tested the method against a 17th century literary work in Basque. This preprocessed corpus is made available in the LREC repository. The performance of this method for learning relations between historical and contemporary word forms is evaluated against resources in three languages. The method we present learns to map phonological changes using a noisy channel model. The model is based on techniques commonly used for phonological inference and producing Grapheme-to-Grapheme conversion systems encoded as weighted transducers and produces F-scores above 80{%} in the task for Basque. A wider evaluation shows that the approach performs equally well with all the languages in our evaluation suite: Basque, Spanish and Slovene. A comparison against other methods that address the same task is also provided. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1169/ |
https://www.aclweb.org/anthology/L16-1169 | |
PWC | https://paperswithcode.com/paper/evaluating-the-noisy-channel-model-for-the |
Repo | |
Framework | |
Continuous Profile Models in ASL Syntactic Facial Expression Synthesis
Title | Continuous Profile Models in ASL Syntactic Facial Expression Synthesis |
Authors | Hernisa Kacorri, Matt Huenerfauth |
Abstract | |
Tasks | Machine Translation |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1196/ |
https://www.aclweb.org/anthology/P16-1196 | |
PWC | https://paperswithcode.com/paper/continuous-profile-models-in-asl-syntactic |
Repo | |
Framework | |
A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap
Title | A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap |
Authors | Carolin Haas, Stefan Riezler |
Abstract | |
Tasks | Machine Translation, Semantic Parsing |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/N16-1088/ |
https://www.aclweb.org/anthology/N16-1088 | |
PWC | https://paperswithcode.com/paper/a-corpus-and-semantic-parser-for-multilingual |
Repo | |
Framework | |