Articles About Natural Language Processing

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency

August 30, 2021 By: Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer Abstract We propose a dynamic encoder transducer (DET) for on-device speech recognition. One DET model scales to multiple devices with different computation capacities without retraining or fine-tuning. To trading off accuracy and latency, DET assigns different encoders to decode different parts of an utterance. We apply and compare the layer dropout […]

Read more

Contrast and Classify: Training Robust VQA Models

Abstract Recent Visual Question Answering (VQA) models have shown impressive performance on the VQA benchmark but remain sensitive to small linguistic variations in input questions. Existing approaches address this by augmenting the dataset with question paraphrases from visual question generation models or adversarial perturbations. These approaches use the combined data to learn an answer classifier by minimizing the standard cross-entropy loss. To more effectively leverage augmented data, we build on the recent success in contrastive learning. We propose a novel […]

Read more

Hugging Face – 🤗 NEWSLETTER ISSUE 11 – SUMMER EDITION 😎 – Sep 27th 2021

News Hi there 🤗 Long time no see. Summer is now officially over and these last few months have been quite busy at Hugging Face 😎. In this special edition we share some of our most exciting news! We hope you enjoy it! Spaces Beta Spaces is a simple and free solution to host Machine Learning demo applications using two awesome Python libraries: Gradio and Streamlit.    

Read more

The NLP Cypher | 10.03.21

RAFT is a few-shot classification benchmark that tests language models: – across multiple domains (lit reviews, medical data, tweets, customer interaction, etc.) – on economically valuable classification tasks (someone inherently cares about the task) – with evaluation that mirrors deployment (50 labeled examples per task, info retrieval allowed, hidden test set)  

Read more

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations

Abstract We propose using self-supervised discrete representations for the task of speech resynthesis. To generate disentangled representation, we separately extract low-bitrate representations for speech content, prosodic information, and speaker identity. This allows to synthesize speech in a controllable manner. We analyze various state-of-the-art, self-supervised representation learning methods and shed light on the advantages of each method while considering reconstruction quality and disentanglement properties. Specifically, we evaluate the F0 reconstruction, speaker identification performance (for both resynthesis and voice conversion), recordings’ intelligibility, […]

Read more

Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications

November 7, 2021 By: Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia Abstract Sentence-level Quality Estimation (QE) of machine translation is traditionally formulated as a regression task, and the performance of QE models is typically measured by Pearson correlation with human labels. Recent QE models have achieved previously-unseen levels of correlation with human judgments, but they rely on large multilingual contextualized language models that are computationally expensive and thus infeasible for many real-world applications. In this […]

Read more

Generalising to German Plural Noun Classes, from the Perspective of a Recurrent Neural Network

Abstract Inflectional morphology has since long been a useful testing ground for broader questions about generalization in language and the viability of neural network models as cognitive models of language. Here, in line with that tradition, we explore how recurrent neural networks acquire the complex German plural system and reflect upon how their strategy compares to human generalization and rule-based models of this system. We perform analyses including behavior experiments, diagnostic classification, representation analysis and causal interventions, suggesting that the […]

Read more

Cross-Policy Compliance Detection via Question Answering

Abstract Policy compliance detection is the task of ensuring that a scenario conforms to a policy (e.g. a claim is valid according to government rules or a post in an online platform conforms to community guidelines). This task has been previously instantiated as a form of textual entailment, which results in poor accuracy due to the complexity of the policies. In this paper we propose to address policy compliance detection via decomposing it into question answering, where questions check whether […]

Read more

Evaluation Paradigms in Question Answering

Abstract Question answering (QA) primarily descends from two branches of research: (1) Alan Turing’s investigation of machine intelligence at Manchester University and (2) Cyril Cleverdon’s comparison of library card catalog indices at Cranfield University. This position paper names and distinguishes these paradigms. Despite substantial overlap, subtle but significant distinctions exert an outsize influence on research. While one evaluation paradigm values creating more intelligent QA systems, the other paradigm values building QA systems that appeal to users. By better understanding the […]

Read more
1 4 5 6 7 8 71