Skip to main content

Showing 1–16 of 16 results for author: Dhole, K D

  1. arXiv:2405.17658  [pdf, other

    cs.IR cs.CL

    Generative Query Reformulation Using Ensemble Prompting, Document Fusion, and Relevance Feedback

    Authors: Kaustubh D. Dhole, Ramraj Chandradevan, Eugene Agichtein

    Abstract: Query Reformulation (QR) is a set of techniques used to transform a user's original search query to a text that better aligns with the user's intent and improves their search experience. Recently, zero-shot QR has been a promising approach due to its ability to exploit knowledge inherent in large language models. Inspired by the success of ensemble prompting strategies which have benefited other t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Extended Work of GenQREnsemble: Zero-Shot LLM Ensemble Prompting for Generative Query Reformulation, Dhole and Agichtein, ECIR 2024. arXiv admin note: text overlap with arXiv:2404.03746

    ACM Class: H.3.3; I.2.7

  2. arXiv:2404.02489  [pdf, other

    cs.IR cs.CL

    DUQGen: Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation

    Authors: Ramraj Chandradevan, Kaustubh D. Dhole, Eugene Agichtein

    Abstract: State-of-the-art neural rankers pre-trained on large task-specific training data such as MS-MARCO, have been shown to exhibit strong performance on various ranking tasks without domain adaptation, also called zero-shot. However, zero-shot neural ranking may be sub-optimal, as it does not take advantage of the target domain information. Unfortunately, acquiring sufficiently large and high quality t… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: NAACL 2024 Main Conference

  3. arXiv:2403.15667  [pdf, other

    cs.IR

    QueryExplorer: An Interactive Query Generation Assistant for Search and Exploration

    Authors: Kaustubh D. Dhole, Shivam Bajaj, Ramraj Chandradevan, Eugene Agichtein

    Abstract: Formulating effective search queries remains a challenging task, particularly when users lack expertise in a specific domain or are not proficient in the language of the content. Providing example documents of interest might be easier for a user. However, such query-by-example scenarios are prone to concept drift, and the retrieval effectiveness is highly sensitive to the query generation method,… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 Demonstration Track

  4. arXiv:2401.16454  [pdf, other

    cs.HC cs.AI cs.CL cs.IR

    KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants

    Authors: Kaustubh D. Dhole

    Abstract: An effective multi-turn instruction-following assistant can be developed by creating a simulator that can generate useful interaction data. Apart from relying on its intrinsic weights, an ideal user simulator should also be able to bootstrap external knowledge rapidly in its raw form to simulate the multifarious diversity of text available over the internet. Previous user simulators generally lack… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Simulation of Conversational Intelligence in Chat, EACL 2024

    ACM Class: I.2.7; H.3.3

  5. arXiv:2311.11226  [pdf, other

    cs.AI cs.IR

    An Interactive Query Generation Assistant using LLM-based Prompt Modification and User Feedback

    Authors: Kaustubh D. Dhole, Ramraj Chandradevan, Eugene Agichtein

    Abstract: While search is the predominant method of accessing information, formulating effective queries remains a challenging task, especially for situations where the users are not familiar with a domain, or searching for documents in other languages, or looking for complex information such as events, which are not easily expressible as queries. Providing example documents or passages of interest, might b… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Intelligence Advanced Research Projects Activity (IARPA) BETTER Research Program

  6. arXiv:2212.09648  [pdf, other

    cs.CL cs.AI

    NusaCrowd: Open Source Initiative for Indonesian NLP Resources

    Authors: Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri , et al. (22 additional authors not shown)

    Abstract: We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources. Through this initiative, we have brought together 137 datasets and 118 standardized data loaders. The quality of the datasets has been assessed manually and automatically, and their value is demonstrated through multiple exp… ▽ More

    Submitted 21 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  7. arXiv:2211.06740  [pdf, other

    cs.NI cs.CY cs.HC

    Lessons from Digital India for the Right to Internet Access

    Authors: Kaustubh D. Dhole

    Abstract: With only 65% of Indian houses having access to the Internet, digital India faces a significant Internet divide across gender and city types. Rendering essential services inaccessible to almost a third of the population necessitates not only provisioning a fundamental right to Internet access but taking specific constructive steps to assure its simple, affordable and safe accessibility. Establishi… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: Situating Network Infrastructure with People, Practices, and Beyond, CSCW 2022

  8. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  9. arXiv:2206.02849  [pdf, other

    cs.LG cs.AI cs.CL

    A Bird's-Eye Tutorial of Graph Attention Architectures

    Authors: Kaustubh D. Dhole, Carl Yang

    Abstract: Graph Neural Networks (GNNs) have shown tremendous strides in performance for graph-structured problems especially in the domains of natural language processing, computer vision and recommender systems. Inspired by the success of the transformer architecture, there has been an ever-growing body of work on attention variants of GNNs attempting to advance the state of the art in many of these proble… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 8 pages Tutorial

  10. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  11. arXiv:2107.03884  [pdf, other

    cs.CL cs.AI

    CANDLE: Decomposing Conditional and Conjunctive Queries for Task-Oriented Dialogue Systems

    Authors: Aadesh Gupta, Kaustubh D. Dhole, Rahul Tarway, Swetha Prabhakar, Ashish Shrivastava

    Abstract: Domain-specific dialogue systems generally determine user intents by relying on sentence level classifiers that mainly focus on single action sentences. Such classifiers are not designed to effectively handle complex queries composed of conditional and sequential clauses that represent multiple actions. We attempt to decompose such queries into smaller single action subqueries that are reasonable… ▽ More

    Submitted 23 November, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

  12. arXiv:2106.09069  [pdf, other

    cs.CL cs.LG

    Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

    Authors: Simon Mille, Kaustubh D. Dhole, Saad Mahamood, Laura Perez-Beltrachini, Varun Gangal, Mihir Kale, Emiel van Miltenburg, Sebastian Gehrmann

    Abstract: Machine learning approaches applied to NLP are often evaluated by summarizing their performance in a single number, for example accuracy. Since most test sets are constructed as an i.i.d. sample from the overall data, this approach overly simplifies the complexity of language and encourages overfitting to the head of the data distribution. As such, rare language phenomena or text about underrepres… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  13. arXiv:2102.01672  [pdf, other

    cs.CL cs.AI cs.LG

    The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

    Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

    Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More

    Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  14. arXiv:2008.07559  [pdf, other

    cs.AI cs.CL cs.LG

    Resolving Intent Ambiguities by Retrieving Discriminative Clarifying Questions

    Authors: Kaustubh D. Dhole

    Abstract: Task oriented Dialogue Systems generally employ intent detection systems in order to map user queries to a set of pre-defined intents. However, user queries appearing in natural language can be easily ambiguous and hence such a direct mapping might not be straightforward harming intent detection and eventually the overall performance of a dialogue system. Moreover, acquiring domain-specific clarif… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  15. arXiv:2006.00533  [pdf, other

    cs.CL cs.IR cs.LG

    Benchmarking BioRelEx for Entity Tagging and Relation Extraction

    Authors: Abhinav Bhatt, Kaustubh D. Dhole

    Abstract: Extracting relationships and interactions between different biological entities is still an extremely challenging problem but has not received much attention as much as extraction in other generic domains. In addition to the lack of annotated data, low benchmarking is still a major reason for slow progress. In order to fill this gap, we compare multiple existing entity and relation extraction mode… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

  16. arXiv:2004.08694  [pdf, other

    cs.CL cs.AI

    Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation

    Authors: Kaustubh D. Dhole, Christopher D. Manning

    Abstract: Question Generation (QG) is fundamentally a simple syntactic transformation; however, many aspects of semantics influence what questions are good to form. We implement this observation by developing SynQG, a set of transparent syntactic rules leveraging universal dependencies, shallow semantic parsing, lexical resources, and custom rules which transform declarative sentences into question-answer p… ▽ More

    Submitted 28 November, 2022; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: Removed Table 5 of earlier version since row 1,4 couldn't be reproduced