Skip to main content

Showing 1–50 of 157 results for author: Fung, P

  1. arXiv:2407.03282  [pdf, other

    cs.CL

    LLM Internal States Reveal Hallucination Risk Faced With a Query

    Authors: Ziwei Ji, Delong Chen, Etsuko Ishii, Samuel Cahyawijaya, Yejin Bang, Bryan Wilie, Pascale Fung

    Abstract: The hallucination problem of Large Language Models (LLMs) significantly limits their reliability and trustworthiness. Humans have a self-awareness process that allows us to recognize what we don't know when faced with queries. Inspired by this, our paper investigates whether LLMs can estimate their own hallucination risk before response generation. We analyze the internal mechanisms of LLMs broadl… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.19764  [pdf, other

    cs.CL

    Belief Revision: The Adaptability of Large Language Models Reasoning

    Authors: Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung

    Abstract: The capability to reason from text is crucial for real-world NLP applications. Real-world scenarios often involve incomplete or evolving data. In response, individuals update their beliefs and understandings accordingly. However, most existing evaluations assume that language models (LMs) operate with consistent information. We introduce Belief-R, a new dataset designed to test LMs' belief revisio… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2405.00485  [pdf, other

    cs.CV

    The Pyramid of Captions

    Authors: Delong Chen, Samuel Cahyawijaya, Etsuko Ishii, Ho Shu Chan, Yejin Bang, Pascale Fung

    Abstract: We introduce a formal information-theoretic framework for image captioning by regarding it as a representation learning task. Our framework defines three key objectives: task sufficiency, minimal redundancy, and human interpretability. Building upon this foundation, we propose a novel Pyramid of Captions (PoCa) method, which constructs caption pyramids by generating localized captions for zoomed-i… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  4. arXiv:2404.07900  [pdf, other

    cs.CL cs.AI

    High-Dimension Human Value Representation in Large Language Models

    Authors: Samuel Cahyawijaya, Delong Chen, Yejin Bang, Leila Khalatbari, Bryan Wilie, Ziwei Ji, Etsuko Ishii, Pascale Fung

    Abstract: The widespread application of Large Language Models (LLMs) across various tasks and fields has necessitated the alignment of these models with human values and preferences. Given various approaches of human value alignment, ranging from Reinforcement Learning with Human Feedback (RLHF), to constitutional learning, etc. there is an urgent need to understand the scope and nature of human values inje… ▽ More

    Submitted 25 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2404.06138  [pdf, other

    cs.CL

    Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages

    Authors: Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Rifki Afina Putri, Emmanuel Dave, Jhonson Lee, Nuur Shadieq, Wawan Cenggoro, Salsabil Maulana Akbar, Muhammad Ihza Mahendra, Dea Annisayanti Putri, Bryan Wilie, Genta Indra Winata, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

    Abstract: Large language models (LLMs) show remarkable human-like capability in various domains and languages. However, a notable quality gap arises in low-resource languages, e.g., Indonesian indigenous languages, rendering them ineffective and inefficient in such linguistic contexts. To bridge this quality gap, we introduce Cendol, a collection of Indonesian LLMs encompassing both decoder-only and encoder… ▽ More

    Submitted 7 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Cendol models are released under Apache 2.0 license and will be made publicly available soon

  6. arXiv:2403.18932  [pdf, other

    cs.CL cs.AI

    Measuring Political Bias in Large Language Models: What Is Said and How It Is Said

    Authors: Yejin Bang, Delong Chen, Nayeon Lee, Pascale Fung

    Abstract: We propose to measure political bias in LLMs by analyzing both the content and style of their generated content regarding political issues. Existing benchmarks and measures focus on gender and racial biases. However, political bias exists in LLMs and can lead to polarization and other harms in downstream applications. In order to provide transparency to users, we advocate that there should be fine… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 16 pages

  7. arXiv:2403.16512  [pdf, other

    cs.CL cs.AI

    LLMs Are Few-Shot In-Context Low-Resource Language Learners

    Authors: Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

    Abstract: In-context learning (ICL) empowers large language models (LLMs) to perform diverse tasks in underrepresented languages using only short in-context information, offering a crucial avenue for narrowing the gap between high-resource and low-resource languages. Nonetheless, there is only a handful of works explored ICL for low-resource languages with most of them focusing on relatively high-resource l… ▽ More

    Submitted 25 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  8. arXiv:2402.14327  [pdf, other

    cs.CV cs.CL

    Subobject-level Image Tokenization

    Authors: Delong Chen, Samuel Cahyawijaya, Jianfeng Liu, Baoyuan Wang, Pascale Fung

    Abstract: Transformer-based vision models typically tokenize images into fixed-size square patches as input units, which lacks the adaptability to image content and overlooks the inherent pixel grouping structure. Inspired by the subword tokenization widely adopted in language models, we propose an image tokenizer at a subobject level, where the subobjects are represented by semantically meaningful image se… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Work in progress

  9. arXiv:2312.04032  [pdf, other

    cs.CL cs.LG

    RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training

    Authors: Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang, Pascale Fung, Qifan Wang, Fuli Feng, Lifu Huang, Madian Khabsa

    Abstract: Fine-tuning pre-trained language models (LMs) has become the de facto standard in many NLP tasks. Nevertheless, fine-tuned LMs are still prone to robustness issues, such as adversarial robustness and model calibration. Several perspectives of robustness for LMs have been studied independently, but lacking a unified consideration in multiple perspectives. In this paper, we propose Robustifying LMs… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 33 pages, accepted at EMNLP 2023 Findings

  10. arXiv:2311.12405  [pdf, other

    cs.CL

    IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages

    Authors: Muhammad Farid Adilazuarda, Samuel Cahyawijaya, Genta Indra Winata, Pascale Fung, Ayu Purwarianti

    Abstract: Significant progress has been made on Indonesian NLP. Nevertheless, exploration of the code-mixing phenomenon in Indonesian is limited, despite many languages being frequently mixed with Indonesian in daily conversation. In this work, we explore code-mixing in Indonesian with four embedded languages, i.e., English, Sundanese, Javanese, and Malay; and introduce IndoRobusta, a framework to evaluate… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  11. arXiv:2311.01817  [pdf, other

    cs.CL

    Mitigating Framing Bias with Polarity Minimization Loss

    Authors: Yejin Bang, Nayeon Lee, Pascale Fung

    Abstract: Framing bias plays a significant role in exacerbating political polarization by distorting the perception of actual events. Media outlets with divergent political stances often use polarized language in their reporting of the same event. We propose a new loss function that encourages the model to minimize the polarity difference between the polarized input articles to reduce framing bias. Specific… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 11 pages, EMNLP2023

  12. arXiv:2310.12467  [pdf, other

    cs.CL

    Contrastive Learning for Inference in Dialogue

    Authors: Etsuko Ishii, Yan Xu, Bryan Wilie, Ziwei Ji, Holy Lovenia, Willy Chung, Pascale Fung

    Abstract: Inference, especially those derived from inductive processes, is a crucial component in our conversation to complement the information implicitly or explicitly conveyed by a speaker. While recent large language models show remarkable advances in inference tasks, their performance in inductive reasoning, where not all information is present in the context, is far behind deductive reasoning. In this… ▽ More

    Submitted 12 November, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP2023

  13. arXiv:2310.08885  [pdf, other

    cs.CL

    InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems

    Authors: Willy Chung, Samuel Cahyawijaya, Bryan Wilie, Holy Lovenia, Pascale Fung

    Abstract: Large language models (LLMs) have been used for diverse tasks in natural language processing (NLP), yet remain under-explored for task-oriented dialogue systems (TODS), especially for end-to-end TODS. We present InstructTODS, a novel off-the-shelf framework for zero-shot end-to-end task-oriented dialogue systems that can adapt to diverse domains without fine-tuning. By leveraging LLMs, InstructTOD… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  14. arXiv:2310.06271  [pdf, other

    cs.CL cs.AI

    Towards Mitigating Hallucination in Large Language Models via Self-Reflection

    Authors: Ziwei Ji, Tiezheng Yu, Yan Xu, Nayeon Lee, Etsuko Ishii, Pascale Fung

    Abstract: Large language models (LLMs) have shown promise for generative and knowledge-intensive tasks including question-answering (QA) tasks. However, the practical deployment still faces challenges, notably the issue of "hallucination", where models generate plausible-sounding but unfaithful or nonsensical information. This issue becomes particularly critical in the medical domain due to the uncommon pro… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted by the findings of EMNLP 2023

  15. arXiv:2310.05338  [pdf, other

    cs.CV cs.CL

    Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models

    Authors: Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung

    Abstract: Object hallucination poses a significant challenge in vision-language (VL) models, often leading to the generation of nonsensical or unfaithful responses with non-existent objects. However, the absence of a general measurement for evaluating object hallucination in VL models has hindered our understanding and ability to mitigate this issue. In this work, we present NOPE (Negative Object Presence E… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  16. arXiv:2309.14381  [pdf, other

    cs.CL cs.AI

    Survey of Social Bias in Vision-Language Models

    Authors: Nayeon Lee, Yejin Bang, Holy Lovenia, Samuel Cahyawijaya, Wenliang Dai, Pascale Fung

    Abstract: In recent years, the rapid advancement of machine learning (ML) models, particularly transformer-based pre-trained models, has revolutionized Natural Language Processing (NLP) and Computer Vision (CV) fields. However, researchers have discovered that these models can inadvertently capture and reinforce social biases present in their training datasets, leading to potential social harms, such as une… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  17. arXiv:2309.10661  [pdf, other

    cs.CL cs.AI

    NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

    Authors: Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Maulana Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Wahyuning Linuwih, Bryan Wilie, Galih Pradipta Muridan, Genta Indra Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

    Abstract: Democratizing access to natural language processing (NLP) technology is crucial, especially for underrepresented and extremely low-resource languages. Previous research has focused on developing labeled and unlabeled corpora for these languages through online scraping and document translation. While these methods have proven effective and cost-efficient, we have identified limitations in the resul… ▽ More

    Submitted 19 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  18. arXiv:2309.10413  [pdf, other

    cs.CL

    PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems

    Authors: Bryan Wilie, Yan Xu, Willy Chung, Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

    Abstract: Grounding dialogue response generation on external knowledge is proposed to produce informative and engaging responses. However, current knowledge-grounded dialogue (KGD) systems often fail to align the generated responses with human-preferred qualities due to several issues like hallucination and the lack of coherence. Upon analyzing multiple language model generations, we observe the presence of… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  19. arXiv:2309.02105  [pdf, other

    cs.CL cs.AI

    Improving Query-Focused Meeting Summarization with Query-Relevant Knowledge

    Authors: Tiezheng Yu, Ziwei Ji, Pascale Fung

    Abstract: Query-Focused Meeting Summarization (QFMS) aims to generate a summary of a given meeting transcript conditioned upon a query. The main challenges for QFMS are the long input text length and sparse query-relevant information in the meeting transcript. In this paper, we propose a knowledge-enhanced two-stage framework called Knowledge-Aware Summarizer (KAS) to tackle the challenges. In the first sta… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: AACL 2023 Findings

  20. arXiv:2306.14517  [pdf, other

    cs.CL cs.SD eess.AS

    Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition

    Authors: Samuel Cahyawijaya, Holy Lovenia, Willy Chung, Rita Frieske, Zihan Liu, Pascale Fung

    Abstract: Speech emotion recognition plays a crucial role in human-computer interactions. However, most speech emotion recognition research is biased toward English-speaking adults, which hinders its applicability to other demographic groups in different languages and age groups. In this work, we analyze the transferability of emotion recognition across three different languages--English, Mandarin Chinese,… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted in INTERSPEECH 2023

  21. arXiv:2306.06083  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering

    Authors: Irina-Elena Veliche, Pascale Fung

    Abstract: The challenge of fairness arises when Automatic Speech Recognition (ASR) systems do not perform equally well for all sub-groups of the population. In the past few years there have been many improvements in overall speech recognition quality, but without any particular focus on advancing Equality and Equity for all user groups for whom systems do not perform well. ASR fairness is therefore also a r… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Journal ref: ICASSP 2023

  22. arXiv:2306.01153  [pdf, other

    cs.CL

    Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference

    Authors: Yan Xu, Deqian Kong, Dehong Xu, Ziwei Ji, Bo Pang, Pascale Fung, Ying Nian Wu

    Abstract: The capability to generate responses with diversity and faithfulness using factual knowledge is paramount for creating a human-like, trustworthy dialogue system. Common strategies either adopt a two-step paradigm, which optimizes knowledge selection and response generation separately, and may overlook the inherent correlation between these two tasks, or leverage conditional variational method to j… ▽ More

    Submitted 5 August, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  23. arXiv:2305.13627  [pdf, other

    cs.CL cs.AI

    InstructAlign: High-and-Low Resource Language Alignment via Continual Crosslingual Instruction Tuning

    Authors: Samuel Cahyawijaya, Holy Lovenia, Tiezheng Yu, Willy Chung, Pascale Fung

    Abstract: Large language models (LLMs) that are tuned with instructions have demonstrated remarkable capabilities in various tasks and languages. However, their ability to generalize to underrepresented languages is limited due to the scarcity of available data. Additionally, directly adapting new languages to instruction-tuned LLMs can result in catastrophic forgetting, which leads to the loss of multitask… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  24. arXiv:2305.06500  [pdf, other

    cs.CV cs.LG

    InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

    Authors: Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi

    Abstract: Large-scale pre-training and instruction tuning have been successful at creating general-purpose language models with broad competence. However, building general-purpose vision-language models is challenging due to the rich input distributions and task diversity resulting from the additional visual input. Although vision-language pretraining has been widely studied, vision-language instruction tun… ▽ More

    Submitted 15 June, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: preprint

  25. arXiv:2304.11220  [pdf, other

    cs.CL

    Learn What NOT to Learn: Towards Generative Safety in Chatbots

    Authors: Leila Khalatbari, Yejin Bang, Dan Su, Willy Chung, Saeed Ghadimi, Hossein Sameti, Pascale Fung

    Abstract: Conversational models that are generative and open-domain are particularly susceptible to generating unsafe content since they are trained on web-based social data. Prior approaches to mitigating this issue have drawbacks, such as disrupting the flow of conversation, limited generalization to unseen toxic input contexts, and sacrificing the quality of the dialogue for the sake of safety. In this p… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 9 pages, 3 tables, 3 figures

  26. arXiv:2302.14680  [pdf, other

    cs.CL cs.AI cs.CV

    Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue

    Authors: Holy Lovenia, Samuel Cahyawijaya, Pascale Fung

    Abstract: The demand for multimodal dialogue systems has been rising in various domains, emphasizing the importance of interpreting multimodal inputs from conversational and situational contexts. We explore three methods to tackle this problem and evaluate them on the largest situated dialogue dataset, SIMMC 2.1. Our best method, scene-dialogue alignment, improves the performance by ~20% F1-score compared t… ▽ More

    Submitted 15 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL SRW 2023

  27. arXiv:2302.04023  [pdf, other

    cs.CL cs.AI

    A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

    Authors: Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung

    Abstract: This paper proposes a framework for quantitatively evaluating interactive LLMs such as ChatGPT using publicly available data sets. We carry out an extensive technical evaluation of ChatGPT using 23 data sets covering 8 different common NLP application tasks. We evaluate the multitask, multilingual and multi-modal aspects of ChatGPT based on these data sets and a newly designed multimodal dataset.… ▽ More

    Submitted 28 November, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 45 pages, AACL 2023

  28. arXiv:2212.09648  [pdf, other

    cs.CL cs.AI

    NusaCrowd: Open Source Initiative for Indonesian NLP Resources

    Authors: Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri , et al. (22 additional authors not shown)

    Abstract: We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources. Through this initiative, we have brought together 137 datasets and 118 standardized data loaders. The quality of the datasets has been assessed manually and automatically, and their value is demonstrated through multiple exp… ▽ More

    Submitted 21 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  29. arXiv:2212.01588  [pdf, other

    cs.CL cs.AI

    RHO ($ρ$): Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding

    Authors: Ziwei Ji, Zihan Liu, Nayeon Lee, Tiezheng Yu, Bryan Wilie, Min Zeng, Pascale Fung

    Abstract: Dialogue systems can leverage large pre-trained language models and knowledge to generate fluent and informative responses. However, these models are still prone to produce hallucinated responses not supported by the input source, which greatly hinders their application. The heterogeneity between external knowledge and dialogue context challenges representation learning and source integration, and… ▽ More

    Submitted 12 May, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

    Comments: accepted by ACL 2023 Findings

  30. arXiv:2211.07713  [pdf, other

    cs.CL cs.AI

    How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling

    Authors: Samuel Cahyawijaya, Bryan Wilie, Holy Lovenia, Huan Zhong, MingQian Zhong, Yuk-Yu Nancy Ip, Pascale Fung

    Abstract: Large pre-trained language models (LMs) have been widely adopted in biomedical and clinical domains, introducing many powerful LMs such as bio-lm and BioELECTRA. However, the applicability of these methods to real clinical use cases is hindered, due to the limitation of pre-trained LMs in processing long textual data with thousands of words, which is a common length for a clinical note. In this wo… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

  31. arXiv:2211.05809  [pdf, other

    cs.CV cs.AI cs.CL cs.CY

    Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness

    Authors: Caner Hazirbas, Yejin Bang, Tiezheng Yu, Parisa Assar, Bilal Porgali, Vítor Albiero, Stefan Hermanek, Jacqueline Pan, Emily McReynolds, Miranda Bogen, Pascale Fung, Cristian Canton Ferrer

    Abstract: Developing robust and fair AI systems require datasets with comprehensive set of labels that can help ensure the validity and legitimacy of relevant measurements. Recent efforts, therefore, focus on collecting person-related datasets that have carefully selected labels, including sensitive characteristics, and consent forms in place to use those attributes for model testing and development. Respon… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  32. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  33. arXiv:2210.07688  [pdf, other

    cs.CL cs.CV

    Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training

    Authors: Wenliang Dai, Zihan Liu, Ziwei Ji, Dan Su, Pascale Fung

    Abstract: Large-scale vision-language pre-trained (VLP) models are prone to hallucinate non-existent visual objects when generating text based on visual information. In this paper, we systematically study the object hallucination problem from three aspects. First, we examine recent state-of-the-art VLP models, showing that they still hallucinate frequently, and models achieving better scores on standard met… ▽ More

    Submitted 9 February, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at EACL 2023

  34. arXiv:2210.07652  [pdf, other

    cs.CL cs.AI

    Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

    Authors: Yejin Bang, Tiezheng Yu, Andrea Madotto, Zhaojiang Lin, Mona Diab, Pascale Fung

    Abstract: Many NLP classification tasks, such as sexism/racism detection or toxicity detection, are based on human values. Yet, human values can vary under diverse cultural conditions. Therefore, we introduce a framework for value-aligned classification that performs prediction based on explicitly written human values in the command. Along with the task, we propose a practical approach that distills value-a… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  35. arXiv:2210.06349  [pdf, other

    cs.CL cs.AI

    Context Generation Improves Open Domain Question Answering

    Authors: Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro

    Abstract: Closed-book question answering (QA) requires a model to directly answer an open-domain question without access to any external knowledge. Prior work on closed-book QA either directly finetunes or prompts a pretrained language model (LM) to leverage the stored knowledge. However, they do not fully exploit the parameterized knowledge. To address this issue, we propose a two-stage, closed-book QA fra… ▽ More

    Submitted 27 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 8 pages; Accepted at EACL2023

  36. arXiv:2209.01638  [pdf, other

    cs.CL

    Every picture tells a story: Image-grounded controllable stylistic story generation

    Authors: Holy Lovenia, Bryan Wilie, Romain Barraud, Samuel Cahyawijaya, Willy Chung, Pascale Fung

    Abstract: Generating a short story out of an image is arduous. Unlike image captioning, story generation from an image poses multiple challenges: preserving the story coherence, appropriately assessing the quality of the story, steering the generated story into a certain style, and addressing the scarcity of image-story pair reference datasets limiting supervision during training. In this work, we introduce… ▽ More

    Submitted 11 September, 2022; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: Accepted in LaTeCH-CLfL 2022 (6th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature), COLING 2022

  37. arXiv:2207.02663  [pdf, other

    cs.CL cs.SD eess.AS

    Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands

    Authors: Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J Barezi, Pascale Fung

    Abstract: With the rise of deep learning and intelligent vehicles, the smart assistant has become an essential in-car component to facilitate driving and provide extra functionalities. In-car smart assistants should be able to process general as well as car-related commands and perform corresponding actions, which eases driving and improves safety. However, in this research field, most datasets are in major… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  38. arXiv:2206.04624  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Factuality Enhanced Language Models for Open-Ended Text Generation

    Authors: Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro

    Abstract: Pretrained language models (LMs) are susceptible to generate text with nonfactual information. In this work, we measure and improve the factual accuracy of large-scale LMs for open-ended text generation. We design the FactualityPrompts test set and metrics to measure the factuality of LM generations. Based on that, we study the factual accuracy of LMs with parameter sizes ranging from 126M to 530B… ▽ More

    Submitted 2 March, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  39. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  40. arXiv:2205.15960  [pdf, other

    cs.CL

    NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages

    Authors: Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich, Sebastian Ruder

    Abstract: Natural language processing (NLP) has a significant impact on society via technologies such as machine translation and search engines. Despite its success, NLP technology is only widely available for high-resource languages such as English and Chinese, while it remains inaccessible to many languages due to the unavailability of data resources and benchmarks. In this work, we focus on developing re… ▽ More

    Submitted 12 April, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: EACL 2023

  41. arXiv:2205.12495  [pdf, other

    cs.CL

    ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

    Authors: Badr AlKhamissi, Faisal Ladhak, Srini Iyer, Ves Stoyanov, Zornitsa Kozareva, Xian Li, Pascale Fung, Lambert Mathias, Asli Celikyilmaz, Mona Diab

    Abstract: Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts… ▽ More

    Submitted 20 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 2109-2120, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics

  42. arXiv:2205.05989  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Answering Open-ended Ethical Quandary Questions

    Authors: Yejin Bang, Nayeon Lee, Tiezheng Yu, Leila Khalatbari, Yan Xu, Samuel Cahyawijaya, Dan Su, Bryan Wilie, Romain Barraud, Elham J. Barezi, Andrea Madotto, Hayden Kee, Pascale Fung

    Abstract: Considerable advancements have been made in various NLP tasks based on the impressive power of large language models (LLMs) and many NLP applications are deployed in our daily lives. In this work, we challenge the capability of LLMs with the new task of Ethical Quandary Generative Question Answering. Ethical quandary questions are more challenging to address because multiple conflicting answers ma… ▽ More

    Submitted 1 February, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: 16 pages

  43. arXiv:2204.06699  [pdf, other

    cs.LG cs.AI

    SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study

    Authors: Samuel Cahyawijaya, Tiezheng Yu, Zihan Liu, Tiffany T. W. Mak, Xiaopu Zhou, Nancy Y. Ip, Pascale Fung

    Abstract: Self-supervised pre-training methods have brought remarkable breakthroughs in the understanding of text, image, and speech. Recent developments in genomics has also adopted these pre-training methods for genome understanding. However, they focus only on understanding haploid sequences, which hinders their applicability towards understanding genetic variations, also known as single nucleotide polym… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  44. arXiv:2204.04902  [pdf, other

    cs.CL

    NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias

    Authors: Nayeon Lee, Yejin Bang, Tiezheng Yu, Andrea Madotto, Pascale Fung

    Abstract: Media news framing bias can increase political polarization and undermine civil society. The need for automatic mitigation methods is therefore growing. We propose a new task, a neutral summary generation from multiple news articles of the varying political leanings to facilitate balanced and unbiased news reading. In this paper, we first collect a new dataset, illustrate insights about framing bi… ▽ More

    Submitted 3 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: NAACL2022 Long Paper

  45. Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension

    Authors: Holy Lovenia, Bryan Wilie, Willy Chung, Min Zeng, Samuel Cahyawijaya, Su Dan, Pascale Fung

    Abstract: Task-adaptive pre-training (TAPT) alleviates the lack of labelled data and provides performance lift by adapting unlabelled data to downstream task. Unfortunately, existing adaptations mainly involve deterministic rules that cannot generalize well. Here, we propose Clozer, a sequence-tagging based cloze answer extraction method used in TAPT that is extendable for adaptation on any cloze-style mach… ▽ More

    Submitted 12 September, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the 7th Workshop on Representation Learning for NLP (RepL4NLP 2022)

  46. arXiv:2203.06386  [pdf, other

    cs.CL cs.AI cs.CV

    Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation

    Authors: Wenliang Dai, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung

    Abstract: The recent large-scale vision-language pre-training (VLP) of dual-stream architectures (e.g., CLIP) with a tremendous amount of image-text pair data, has shown its superiority on various multimodal alignment tasks. Despite its success, the resulting models are not capable of multimodal generative tasks due to the weak text encoder. To tackle this problem, we propose to augment the dual-stream VLP… ▽ More

    Submitted 30 March, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  47. arXiv:2203.00343  [pdf, other

    cs.CL cs.AI

    Read before Generate! Faithful Long Form Question Answering with Machine Reading

    Authors: Dan Su, Xiaoguang Li, Jindi Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung

    Abstract: Long-form question answering (LFQA) aims to generate a paragraph-length answer for a given question. While current work on LFQA using large pre-trained model for generation are effective at producing fluent and somewhat relevant content, one primary challenge lies in how to generate a faithful answer that has less hallucinated content. We propose a new end-to-end framework that jointly models answ… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: long paper, accepted to ACL 2022 findings

  48. arXiv:2203.00314  [pdf, other

    cs.CL

    VScript: Controllable Script Generation with Visual Presentation

    Authors: Ziwei Ji, Yan Xu, I-Tsun Cheng, Samuel Cahyawijaya, Rita Frieske, Etsuko Ishii, Min Zeng, Andrea Madotto, Pascale Fung

    Abstract: In order to offer a customized script tool and inspire professional scriptwriters, we present VScript. It is a controllable pipeline that generates complete scripts, including dialogues and scene descriptions, as well as presents visually using video retrieval. With an interactive interface, our system allows users to select genres and input starting words that control the theme and development of… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Journal ref: AACL Demo (2022)

  49. arXiv:2202.06538  [pdf, other

    cs.CL cs.AI

    QA4QG: Using Question Answering to Constrain Multi-Hop Question Generation

    Authors: Dan Su, Peng Xu, Pascale Fung

    Abstract: Multi-hop question generation (MQG) aims to generate complex questions which require reasoning over multiple pieces of information of the input passage. Most existing work on MQG has focused on exploring graph-based networks to equip the traditional Sequence-to-sequence framework with reasoning ability. However, these models do not take full advantage of the constraint between questions and answer… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 4 pages, accepted by ICASSP2022

  50. Survey of Hallucination in Natural Language Generation

    Authors: Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Yejin Bang, Delong Chen, Wenliang Dai, Ho Shu Chan, Andrea Madotto, Pascale Fung

    Abstract: Natural Language Generation (NLG) has improved exponentially in recent years thanks to the development of sequence-to-sequence deep learning technologies such as Transformer-based language models. This advancement has led to more fluent and coherent NLG, leading to improved development in downstream tasks such as abstractive summarization, dialogue generation and data-to-text generation. However,… ▽ More

    Submitted 14 July, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

    ACM Class: A.1

    Journal ref: ACM Computing Surveys (2022)