Skip to main content

Showing 1–23 of 23 results for author: Raunak, V

  1. arXiv:2310.15987  [pdf, other

    cs.CL cs.AI

    Dissecting In-Context Learning of Translations in GPTs

    Authors: Vikas Raunak, Hany Hassan Awadalla, Arul Menezes

    Abstract: Most of the recent work in leveraging Large Language Models (LLMs) such as GPT-3 for Machine Translation (MT) has focused on selecting the few-shot samples for prompting. In this work, we try to better understand the role of demonstration attributes for the in-context learning of translations through perturbations of high-quality, in-domain demonstrations. We find that asymmetric perturbation of t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP Findings (+ Minor Updates over Camera-Ready)

  2. arXiv:2309.08832  [pdf, other

    cs.CL cs.AI

    SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window

    Authors: Vikas Raunak, Tom Kocmi, Matt Post

    Abstract: Reference-based metrics that operate at the sentence-level typically outperform quality estimation metrics, which have access only to the source and system output. This is unsurprising, since references resolve ambiguities that may be present in the source. In this paper, we investigate whether additional source context can effectively substitute for a reference. We present a metric named SLIDE (S… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: NAACL 2024

  3. arXiv:2305.16806  [pdf, other

    cs.CL cs.AI

    Do GPTs Produce Less Literal Translations?

    Authors: Vikas Raunak, Arul Menezes, Matt Post, Hany Hassan Awadalla

    Abstract: Large Language Models (LLMs) such as GPT-3 have emerged as general-purpose language models capable of addressing many natural language generation or understanding tasks. On the task of Machine Translation (MT), multiple works have investigated few-shot prompting mechanisms to elicit better translations from LLMs. However, there has been relatively little investigation on how such translations diff… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  4. arXiv:2305.14878  [pdf, other

    cs.CL cs.AI

    Leveraging GPT-4 for Automatic Translation Post-Editing

    Authors: Vikas Raunak, Amr Sharaf, Yiren Wang, Hany Hassan Awadallah, Arul Menezes

    Abstract: While Neural Machine Translation (NMT) represents the leading approach to Machine Translation (MT), the outputs of NMT models still require translation post-editing to rectify errors and enhance quality under critical settings. In this work, we formalize the task of direct translation post-editing with Large Language Models (LLMs) and explore the use of GPT-4 to automatically post-edit NMT outputs… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP Findings 2023

  5. arXiv:2302.09210  [pdf, other

    cs.CL

    How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

    Authors: Amr Hendy, Mohamed Abdelrehim, Amr Sharaf, Vikas Raunak, Mohamed Gabr, Hitokazu Matsushita, Young Jin Kim, Mohamed Afify, Hany Hassan Awadalla

    Abstract: Generative Pre-trained Transformer (GPT) models have shown remarkable capabilities for natural language generation, but their performance for machine translation has not been thoroughly investigated. In this paper, we present a comprehensive evaluation of GPT models for machine translation, covering various aspects such as quality of different GPT models in comparison with state-of-the-art researc… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  6. arXiv:2212.00006  [pdf, other

    cs.HC cs.CL cs.CV cs.CY

    Operationalizing Specifications, In Addition to Test Sets for Evaluating Constrained Generative Models

    Authors: Vikas Raunak, Matt Post, Arul Menezes

    Abstract: In this work, we present some recommendations on the evaluation of state-of-the-art generative models for constrained generation tasks. The progress on generative models has been rapid in recent years. These large-scale models have had three impacts: firstly, the fluency of generation in both language and vision modalities has rendered common average-case evaluation metrics much less useful in dia… ▽ More

    Submitted 19 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 Workshop on Human Evaluation of Generative Models

  7. arXiv:2211.13317  [pdf, other

    cs.CL cs.AI

    Rank-One Editing of Encoder-Decoder Models

    Authors: Vikas Raunak, Arul Menezes

    Abstract: Large sequence to sequence models for tasks such as Neural Machine Translation (NMT) are usually trained over hundreds of millions of samples. However, training is just the origin of a model's life-cycle. Real-world deployments of models require further behavioral adaptations as new requirements emerge or shortcomings become known. Typically, in the space of model behaviors, behavior deletion requ… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: The Second Workshop On Interactive Learning For Natural Language Processing (InterNLP 2022), NeurIPS 2022

  8. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  9. arXiv:2210.12929  [pdf, other

    cs.CL cs.AI cs.LG

    Finding Memo: Extractive Memorization in Constrained Sequence Generation Tasks

    Authors: Vikas Raunak, Arul Menezes

    Abstract: Memorization presents a challenge for several constrained Natural Language Generation (NLG) tasks such as Neural Machine Translation (NMT), wherein the proclivity of neural models to memorize noisy and atypical samples reacts adversely with the noisy (web crawled) datasets. However, previous studies of memorization in constrained NLG tasks have only focused on counterfactual memorization, linking… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: EMNLP Findings 2022

  10. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2205.09988  [pdf, other

    cs.CL cs.AI

    SALTED: A Framework for SAlient Long-Tail Translation Error Detection

    Authors: Vikas Raunak, Matt Post, Arul Menezes

    Abstract: Traditional machine translation (MT) metrics provide an average measure of translation quality that is insensitive to the long tail of behavioral problems in MT. Examples include translation of numbers, physical units, dropped content and hallucinations. These errors, which occur rarely and unpredictably in Neural Machine Translation (NMT), greatly undermine the reliability of state-of-the-art MT… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  13. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  14. arXiv:2105.00573  [pdf, other

    cs.CL eess.AS

    Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks

    Authors: Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe

    Abstract: End-to-end approaches for sequence tasks are becoming increasingly popular. Yet for complex sequence tasks, like speech translation, systems that cascade several models trained on sub-tasks have shown to be superior, suggesting that the compositionality of cascaded systems simplifies learning and enables sophisticated search capabilities. In this work, we present an end-to-end framework that explo… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: NAACL 2021. All code and models are released as part of the ESPnet toolkit: https://github.com/espnet/espnet

  15. arXiv:2104.06683  [pdf, other

    cs.CL cs.AI cs.LG

    The Curious Case of Hallucinations in Neural Machine Translation

    Authors: Vikas Raunak, Arul Menezes, Marcin Junczys-Dowmunt

    Abstract: In this work, we study hallucinations in Neural Machine Translation (NMT), which lie at an extreme end on the spectrum of NMT pathologies. Firstly, we connect the phenomenon of hallucinations under source perturbation to the Long-Tail theory of Feldman (2020), and present an empirically validated hypothesis that explains hallucinations under source perturbation. Secondly, we consider hallucination… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL 2021

  16. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  17. arXiv:2010.04924  [pdf, other

    cs.CL cs.AI cs.LG

    On Long-Tailed Phenomena in Neural Machine Translation

    Authors: Vikas Raunak, Siddharth Dalmia, Vivek Gupta, Florian Metze

    Abstract: State-of-the-art Neural Machine Translation (NMT) models struggle with generating low-frequency tokens, tackling which remains a major challenge. The analysis of long-tailed phenomena in the context of structured prediction tasks is further hindered by the added complexities of search during inference. In this work, we quantitatively characterize such long-tailed phenomena at two levels of abstrac… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: Accepted to Findings of EMNLP 2020

  18. arXiv:2008.07688  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Ranking Clarification Questions via Natural Language Inference

    Authors: Vaibhav Kumar, Vikas Raunak, Jamie Callan

    Abstract: Given a natural language query, teaching machines to ask clarifying questions is of immense utility in practical natural language processing systems. Such interactions could help in filling information gaps for better machine comprehension of the query. For the task of ranking clarification questions, we hypothesize that determining whether a clarification question pertains to a missing entry in a… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at CIKM 2020

  19. arXiv:1911.01497  [pdf, ps, other

    cs.CL cs.LG

    On Compositionality in Neural Machine Translation

    Authors: Vikas Raunak, Vaibhav Kumar, Florian Metze

    Abstract: We investigate two specific manifestations of compositionality in Neural Machine Translation (NMT) : (1) Productivity - the ability of the model to extend its predictions beyond the observed length in training data and (2) Systematicity - the ability of the model to systematically recombine known parts and rules. We evaluate a standard Sequence to Sequence model on tests designed to assess these t… ▽ More

    Submitted 14 December, 2019; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: Accepted at Context and Compositionality Workshop, NeurIPS 2019

  20. arXiv:1910.02754  [pdf, other

    cs.CL cs.LG

    On Leveraging the Visual Modality for Neural Machine Translation

    Authors: Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze

    Abstract: Leveraging the visual modality effectively for Neural Machine Translation (NMT) remains an open problem in computational linguistics. Recently, Caglayan et al. posit that the observed gains are limited mainly due to the very simple, short, repetitive sentences of the Multi30k dataset (the only multimodal MT dataset available at the time), which renders the source text sufficient for context. In th… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: Accepted to INLG 2019

  21. arXiv:1910.02211  [pdf, other

    cs.CL cs.LG

    On Dimensional Linguistic Properties of the Word Embedding Space

    Authors: Vikas Raunak, Vaibhav Kumar, Vivek Gupta, Florian Metze

    Abstract: Word embeddings have become a staple of several natural language processing tasks, yet much remains to be understood about their properties. In this work, we analyze word embeddings in terms of their principal components and arrive at a number of novel and counterintuitive observations. In particular, we characterize the utility of variance explained by the principal components as a proxy for down… ▽ More

    Submitted 20 May, 2020; v1 submitted 5 October, 2019; originally announced October 2019.

    Comments: Published at ACL RepL4NLP 2020

  22. arXiv:1902.06833  [pdf, other

    cs.CL cs.SD eess.AS

    Learned In Speech Recognition: Contextual Acoustic Word Embeddings

    Authors: Shruti Palaskar, Vikas Raunak, Florian Metze

    Abstract: End-to-end acoustic-to-word speech recognition models have recently gained popularity because they are easy to train, scale well to large amounts of training data, and do not require a lexicon. In addition, word models may also be easier to integrate with downstream tasks such as spoken language understanding, because inference (search) is much simplified compared to phoneme, character or any othe… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: Accepted at ICASSP 2019, 5 pages, 1 figure, 3 tables

  23. arXiv:1708.03629  [pdf, other

    cs.CL

    Simple and Effective Dimensionality Reduction for Word Embeddings

    Authors: Vikas Raunak

    Abstract: Word embeddings have become the basic building blocks for several natural language processing and information retrieval tasks. Pre-trained word embeddings are used in several downstream applications as well as for constructing representations for sentences, paragraphs and documents. Recently, there has been an emphasis on further improving the pre-trained word vectors through post-processing algor… ▽ More

    Submitted 21 November, 2017; v1 submitted 11 August, 2017; originally announced August 2017.

    Comments: Accepted at NIPS 2017 LLD Workshop