Skip to main content

Showing 1–6 of 6 results for author: Aarohi

  1. arXiv:2404.07304  [pdf, other

    cs.CL

    We're Calling an Intervention: Exploring the Fundamental Hurdles in Adapting Language Models to Nonstandard Text

    Authors: Aarohi Srivastava, David Chiang

    Abstract: We present a suite of experiments that allow us to understand the underlying challenges of language model adaptation to nonstandard text. We do so by designing interventions that approximate several types of linguistic variation and their interactions with existing biases of language models. Applying our interventions during language model adaptation with varying size and nature of training data,… ▽ More

    Submitted 15 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Preprint

  2. arXiv:2403.11009  [pdf, other

    cs.CL cs.AI

    DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

    Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More

    Submitted 7 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

  3. arXiv:2311.00116  [pdf, other

    cs.CL

    BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text

    Authors: Aarohi Srivastava, David Chiang

    Abstract: Real-world NLP applications often deal with nonstandard text (e.g., dialectal, informal, or misspelled text). However, language models like BERT deteriorate in the face of dialect variation or noise. How do we push BERT's modeling capabilities to encompass nonstandard text? Fine-tuning helps, but it is designed for specializing a model to a task and does not seem to bring about the deeper, more pe… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted for publication in Findings of the ACL: EMNLP 2023

  4. arXiv:2303.17683  [pdf, other

    cs.CL cs.AI

    Fine-Tuning BERT with Character-Level Noise for Zero-Shot Transfer to Dialects and Closely-Related Languages

    Authors: Aarohi Srivastava, David Chiang

    Abstract: In this work, we induce character-level noise in various forms when fine-tuning BERT to enable zero-shot cross-lingual transfer to unseen dialects and languages. We fine-tune BERT on three sentence-level classification tasks and evaluate our approach on an assortment of unseen dialects and languages. We find that character-level noise can be an extremely effective agent of cross-lingual transfer u… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at VarDial 2023

  5. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  6. Trend-Based Networking Driven by Big Data Telemetry for SDN and Traditional Networks

    Authors: Ankur Jain, Arohi Gupta, Ashutosh Gupta, Dewang Gedia, Leidy Pérez, Levi Perigo, Rahil Gandotra, Sanjay Murthy

    Abstract: Organizations face a challenge of accurately analyzing network data and providing automated action based on the observed trend. This trend-based analytics is beneficial to minimize the downtime and improve the performance of the network services, but organizations use different network management tools to understand and visualize the network traffic with limited abilities to dynamically optimize t… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

    Journal ref: International Journal of Next-Generation Networks (IJNGN) Vol.11, No.1, March 2019