Skip to main content

Showing 1–50 of 82 results for author: Ray, A

  1. arXiv:2405.00427  [pdf, other

    cs.DS

    Improved linearly ordered colorings of hypergraphs via SDP rounding

    Authors: Anand Louis, Alantha Newman, Arka Ray

    Abstract: We consider the problem of linearly ordered (LO) coloring of hypergraphs. A hypergraph has an LO coloring if there is a vertex coloring, using a set of ordered colors, so that (i) no edge is monochromatic, and (ii) each edge has a unique maximum color. It is an open question as to whether or not a 2-LO colorable 3-uniform hypergraph can be LO colored with 3 colors in polynomial time. Nakajima and… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 19 pages; 13 pages for the main body

  2. arXiv:2403.16143  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    CFAT: Unleashing TriangularWindows for Image Super-resolution

    Authors: Abhisek Ray, Gaurav Kumar, Maheshkumar H. Kolekar

    Abstract: Transformer-based models have revolutionized the field of image super-resolution (SR) by harnessing their inherent ability to capture complex contextual features. The overlapping rectangular shifted window technique used in transformer architecture nowadays is a common practice in super-resolution models to improve the quality and robustness of image upscaling. However, it suffers from distortion… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  3. arXiv:2403.08094  [pdf, other

    cs.RO

    Task and Motion Planning in Hierarchical 3D Scene Graphs

    Authors: Aaron Ray, Christopher Bradley, Luca Carlone, Nicholas Roy

    Abstract: Recent work in the construction of 3D scene graphs has enabled mobile robots to build large-scale hybrid metric-semantic hierarchical representations of the world. These detailed models contain information that is useful for planning, however how to derive a planning domain from a 3D scene graph that enables efficient computation of executable plans is an open question. In this work, we present a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    MSC Class: 68T40; 68T20 ACM Class: I.2.9; I.2.4; I.2.8

  4. arXiv:2402.14855  [pdf

    cs.CL cs.AI

    An LLM Maturity Model for Reliable and Transparent Text-to-Query

    Authors: Lei Yu, Abir Ray

    Abstract: Recognizing the imperative to address the reliability and transparency issues of Large Language Models (LLM), this work proposes an LLM maturity model tailored for text-to-query applications. This maturity model seeks to fill the existing void in evaluating LLMs in such applications by incorporating dimensions beyond mere correctness or accuracy. Moreover, this work introduces a real-world use cas… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures

  5. arXiv:2402.04770  [pdf, other

    quant-ph cs.CR

    Continuous-Variable QKD with key rates far above Devetak-Winter

    Authors: Arpan Akash Ray, Boris Skoric

    Abstract: Continuous-Variable Quantum Key Distribution (CVQKD) at large distances has such high noise levels that the employed error-correcting codes must have very low rate. In this regime it becomes feasible to implement random-codebook error correction, which is known to perform close to capacity. We propose a random-codebook reverse reconciliation scheme for CVQKD that is inspired by spread-spectrum wat… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2312.15815  [pdf, other

    cs.CL

    Compositional Generalization in Spoken Language Understanding

    Authors: Avik Ray, Yilin Shen, Hongxia Jin

    Abstract: State-of-the-art spoken language understanding (SLU) models have shown tremendous success in benchmark SLU datasets, yet they still fail in many practical scenario due to the lack of model compositionality when trained on limited training data. In this paper, we study two types of compositionality: (a) novel slot combination, and (b) length generalization. We first conduct in-depth analysis, and f… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: Published in INTERSPEECH 2023

    Journal ref: Proceedings of 24th INTERSPEECH Conference (INTERSPEECH 2023), Dublin, Ireland

  7. arXiv:2312.15136  [pdf, other

    physics.comp-ph cs.AI cs.CV

    Towards End-to-End Structure Solutions from Information-Compromised Diffraction Data via Generative Deep Learning

    Authors: Gabe Guo, Judah Goldfeder, Ling Lan, Aniv Ray, Albert Hanming Yang, Boyuan Chen, Simon JL Billinge, Hod Lipson

    Abstract: The revolution in materials in the past century was built on a knowledge of the atomic arrangements and the structure-property relationship. The sine qua non for obtaining quantitative structural information is single crystal crystallography. However, increasingly we need to solve structures in cases where the information content in our input signal is significantly degraded, for example, due to o… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  8. arXiv:2312.15035  [pdf, other

    cs.PL

    Hardcaml: An OCaml Hardware Domain-Specific Language for Efficient and Robust Design

    Authors: Andy Ray, Benjamin Devlin, Fu Yong Quah, Rahul Yesantharao

    Abstract: This paper introduces Hardcaml, an embedded hardware design domain specific language (DSL) implemented in the OCaml programming language. Unlike high level synthesis (HLS), Hardcaml allows for low level control of the underlying hardware for maximum productivity, while abstracting away many of the tedious aspects of traditional hardware definition languages (HDLs) such as Verilog or VHDL. The rich… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    ACM Class: B.6.3

  9. arXiv:2312.12716  [pdf, other

    cs.CV cs.CL cs.LG

    BloomVQA: Assessing Hierarchical Multi-modal Comprehension

    Authors: Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran

    Abstract: We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks. Unlike current benchmarks that often focus on fact-based memorization and simple reasoning tasks without theoretical grounding, we collect multiple-choice samples based on picture stories that reflect different levels of comprehension, as laid out in Bloom's Taxo… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ACL Findings (2024). Dataset available at https://huggingface.co/datasets/ygong/BloomVQA

  10. arXiv:2312.00833  [pdf, other

    cs.CV

    Lasagna: Layered Score Distillation for Disentangled Object Relighting

    Authors: Dina Bashkirova, Arijit Ray, Rupayan Mallick, Sarah Adel Bargal, Jianming Zhang, Ranjay Krishna, Kate Saenko

    Abstract: Professional artists, photographers, and other visual content creators use object relighting to establish their photo's desired effect. Unfortunately, manual tools that allow relighting have a steep learning curve and are difficult to master. Although generative editing methods now enable some forms of image editing, relighting is still beyond today's capabilities; existing methods struggle to kee… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  11. arXiv:2311.00991  [pdf, other

    cs.CV

    IR-UWB Radar-based Situational Awareness System for Smartphone-Distracted Pedestrians

    Authors: Jamsheed Manja Ppallan, Ruchi Pandey, Yellappa Damam, Vijay Narayan Tiwari, Karthikeyan Arunachalam, Antariksha Ray

    Abstract: With the widespread adoption of smartphones, ensuring pedestrian safety on roads has become a critical concern due to smartphone distraction. This paper proposes a novel and real-time assistance system called UWB-assisted Safe Walk (UASW) for obstacle detection and warns users about real-time situations. The proposed method leverages Impulse Radio Ultra-Wideband (IR-UWB) radar embedded in the smar… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  12. arXiv:2310.10380  [pdf, ps, other

    cs.CL

    Contextual Data Augmentation for Task-Oriented Dialog Systems

    Authors: Dustin Axman, Avik Ray, Shubham Garg, Jing Huang

    Abstract: Collection of annotated dialogs for training task-oriented dialog systems have been one of the key bottlenecks in improving current models. While dialog response generation has been widely studied on the agent side, it is not evident if similar generative models can be used to generate a large variety of, and often unexpected, user inputs that real dialog systems encounter in practice. Existing da… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: ECML-PKDD 2023 Workshop on Challenges and Opportunities of Large Language Models in Real-World Machine Learning Applications (COLLM)

  13. arXiv:2308.16741  [pdf, other

    cs.AI cs.CV

    Socratis: Are large multimodal models emotionally aware?

    Authors: Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko

    Abstract: Existing emotion prediction benchmarks contain coarse emotion labels which do not consider the diversity of emotions that an image and text can elicit in humans due to various reasons. Learning diverse reactions to multimodal content is important as intelligent machines take a central role in generating and delivering content to society. To address this gap, we propose Socratis, a societal reactio… ▽ More

    Submitted 2 November, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 WECIA

  14. arXiv:2308.11042  [pdf, other

    cs.CR cs.AR

    Unlocking Hardware Security Assurance: The Potential of LLMs

    Authors: Xingyu Meng, Amisha Srivastava, Ayush Arunachalam, Avik Ray, Pedro Henrique Silva, Rafail Psiakis, Yiorgos Makris, Kanad Basu

    Abstract: System-on-Chips (SoCs) form the crux of modern computing systems. SoCs enable high-level integration through the utilization of multiple Intellectual Property (IP) cores. However, the integration of multiple IP cores also presents unique challenges owing to their inherent vulnerabilities, thereby compromising the security of the entire system. Hence, it is imperative to perform hardware security v… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  15. arXiv:2308.06351  [pdf, other

    cs.RO

    Aggressive Aerial Grasping using a Soft Drone with Onboard Perception

    Authors: Samuel Ubellacker, Aaron Ray, James Bern, Jared Strader, Luca Carlone

    Abstract: Contrary to the stunning feats observed in birds of prey, aerial manipulation and grasping with flying robots still lack versatility and agility. Conventional approaches using rigid manipulators require precise positioning and are subject to large reaction forces at grasp, which limit performance at high speeds. The few reported examples of aggressive aerial grasping rely on motion capture systems… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    MSC Class: 68T40; 70B15; 70E60; 74Pxx; 65D19 ACM Class: I.2.9; G.1.6; I.5.2; I.4.5

  16. arXiv:2305.16724  [pdf, other

    cs.CL cs.AI

    Code-Switched Text Synthesis in Unseen Language Pairs

    Authors: I-Hung Hsu, Avik Ray, Shubham Garg, Nanyun Peng, Jing Huang

    Abstract: Existing efforts on text synthesis for code-switching mostly require training on code-switched texts in the target language pairs, limiting the deployment of the models to cases lacking code-switched data. In this work, we study the problem of synthesizing code-switched texts for language pairs absent from the training data. We introduce GLOSS, a model built on top of a pre-trained multilingual ma… ▽ More

    Submitted 7 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Paper accepted by ACL2023 as a Finding paper

  17. arXiv:2305.15376  [pdf, other

    cs.RO

    DeepCollide: Scalable Data-Driven High DoF Configuration Space Modeling using Implicit Neural Representations

    Authors: Gabriel Guo, Judah Goldfeder, Aniv Ray, Tony Dear, Hod Lipson

    Abstract: Collision detection is essential to virtually all robotics applications. However, traditional geometric collision detection methods generally require pre-existing workspace geometry representations; thus, they are unable to infer the collision detection function from sampled data when geometric information is unavailable. Learning-based approaches can overcome this limitation. Following this line… ▽ More

    Submitted 13 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  18. arXiv:2305.05826  [pdf, ps, other

    cs.DS math.NA

    Universal Matrix Sparsifiers and Fast Deterministic Algorithms for Linear Algebra

    Authors: Rajarshi Bhattacharjee, Gregory Dexter, Cameron Musco, Archan Ray, Sushant Sachdeva, David P Woodruff

    Abstract: Let $\mathbf S \in \mathbb R^{n \times n}$ satisfy $\|\mathbf 1-\mathbf S\|_2\leεn$, where $\mathbf 1$ is the all ones matrix and $\|\cdot\|_2$ is the spectral norm. It is well-known that there exists such an $\mathbf S$ with just $O(n/ε^2)$ non-zero entries: we can let $\mathbf S$ be the scaled adjacency matrix of a Ramanujan expander graph. We show that such an $\mathbf S$ yields a $universal$… ▽ More

    Submitted 12 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 41 pages

    ACM Class: F.2.1; G.1.3; G.1.2; G.4; I.1.2

  19. arXiv:2305.03689  [pdf, other

    cs.CV

    COLA: A Benchmark for Compositional Text-to-image Retrieval

    Authors: Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko

    Abstract: Compositional reasoning is a hallmark of human visual intelligence. Yet, despite the size of large vision-language models, they struggle to represent simple compositions by combining objects with their attributes. To measure this lack of compositional capability, we design Cola, a text-to-image retrieval benchmark to Compose Objects Localized with Attributes. To solve Cola, a model must retrieve i… ▽ More

    Submitted 2 November, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023. Webpage: https://cs-people.bu.edu/array/research/cola/

  20. arXiv:2304.13487  [pdf, other

    cs.RO

    Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams

    Authors: Yun Chang, Nathan Hughes, Aaron Ray, Luca Carlone

    Abstract: 3D scene graphs have recently emerged as an expressive high-level map representation that describes a 3D environment as a layered graph where nodes represent spatial concepts at multiple levels of abstraction (e.g., objects, rooms, buildings) and edges represent relations between concepts (e.g., inclusion, adjacency). This paper describes Hydra-Multi, the first multi-robot spatial perception syste… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 8 pages, 10 figures

  21. arXiv:2304.06901  [pdf, other

    cs.LG cs.CY cs.MA

    Systemic Fairness

    Authors: Arindam Ray, Balaji Padmanabhan, Lina Bouayad

    Abstract: Machine learning algorithms are increasingly used to make or support decisions in a wide range of settings. With such expansive use there is also growing concern about the fairness of such methods. Prior literature on algorithmic fairness has extensively addressed risks and in many cases presented approaches to manage some of them. However, most studies have focused on fairness issues that arise f… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  22. arXiv:2303.16342  [pdf, other

    cs.CV cs.AI cs.CL

    Language-Guided Audio-Visual Source Separation via Trimodal Consistency

    Authors: Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko

    Abstract: We propose a self-supervised approach for learning to perform audio source separation in videos based on natural language queries, using only unlabeled video and audio pairs as training data. A key challenge in this task is learning to associate the linguistic description of a sound-emitting object to its visual features and the corresponding components of the audio waveform, all without access to… ▽ More

    Submitted 23 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  23. arXiv:2301.09272  [pdf, other

    cs.DS cs.CC cs.CG

    Improved Hardness of Approximation for Geometric Bin Packing

    Authors: Arka Ray, Sai Sandeep

    Abstract: The Geometric Bin Packing (GBP) problem is a generalization of Bin Packing where the input is a set of $d$-dimensional rectangles, and the goal is to pack them into unit $d$-dimensional cubes efficiently. It is NP-Hard to obtain a PTAS for the problem, even when $d=2$. For general $d$, the best-known approximation algorithm has an approximation guarantee exponential in $d$, while the best hardness… ▽ More

    Submitted 2 October, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 11 pages; bug fixes

  24. arXiv:2212.13406  [pdf, other

    cs.DS math.CO

    Sparse Cuts in Hypergraphs from Random Walks on Simplicial Complexes

    Authors: Anand Louis, Rameesh Paul, Arka Ray

    Abstract: There are a lot of recent works on generalizing the spectral theory of graphs and graph partitioning to hypergraphs. There have been two broad directions toward this goal. One generalizes the notion of graph conductance to hypergraph conductance [LM16, CLTZ18]. In the second approach one can view a hypergraph as a simplicial complex and study its various topological properties [LM06, MW09, DKW16,… ▽ More

    Submitted 3 October, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: 27 pages;

  25. arXiv:2211.00004  [pdf, other

    quant-ph cs.CR cs.LG

    Classical ensemble of Quantum-classical ML algorithms for Phishing detection in Ethereum transaction networks

    Authors: Anupama Ray, Sai Sakunthala Guddanti, Vishnu Ajith, Dhinakaran Vinayagamurthy

    Abstract: Ethereum is one of the most valuable blockchain networks in terms of the total monetary value locked in it, and arguably been the most active network where new blockchain innovations in research and applications are demonstrated. But, this also leads to Ethereum network being susceptible to a wide variety of threats and attacks in an attempt to gain unreasonable advantage or to undermine the value… ▽ More

    Submitted 30 October, 2022; originally announced November 2022.

  26. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  27. arXiv:2206.02119  [pdf, other

    cs.CL

    A Multimodal Corpus for Emotion Recognition in Sarcasm

    Authors: Anupama Ray, Shubham Mishra, Apoorva Nunna, Pushpak Bhattacharyya

    Abstract: While sentiment and emotion analysis have been studied extensively, the relationship between sarcasm and emotion has largely remained unexplored. A sarcastic expression may have a variety of underlying emotions. For example, "I love being ignored" belies sadness, while "my mobile is fabulous with a battery backup of only 15 minutes!" expresses frustration. Detecting the emotion behind a sarcastic… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  28. Multi-robot Task Assignment for Aerial Tracking with Viewpoint Constraints

    Authors: Aaron Ray, Alyssa Pierson, Hai Zhu, Javier Alonso-Mora, Daniela Rus

    Abstract: We address the problem of assigning a team of drones to autonomously capture a set desired shots of a dynamic target in the presence of obstacles. We present a two-stage planning pipeline that generates offline an assignment of drone to shots and locally optimizes online the viewpoint. Given desired shot parameters, the high-level planner uses a visibility heuristic to predict good times for captu… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Journal ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1515-1522

  29. arXiv:2205.15473  [pdf, other

    cs.RO

    Free-Space Ellipsoid Graphs for Multi-Agent Target Monitoring

    Authors: Aaron Ray, Alyssa Pierson, Daniela Rus

    Abstract: We apply a novel framework for decomposing and reasoning about free space in an environment to a multi-agent persistent monitoring problem. Our decomposition method represents free space as a collection of ellipsoids associated with a weighted connectivity graph. The same ellipsoids used for reasoning about connectivity and distance during high level planning can be used as state constraints in a… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: IEEE Intl. Conf. on Robotics and Automation (ICRA) 2022

  30. arXiv:2203.02155  [pdf, other

    cs.CL cs.AI cs.LG

    Training language models to follow instructions with human feedback

    Authors: Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

    Abstract: Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not helpful to the user. In other words, these models are not aligned with their users. In this paper, we show an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning wi… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  31. arXiv:2112.09631  [pdf, other

    cs.LG cs.CL

    Sublinear Time Approximation of Text Similarity Matrices

    Authors: Archan Ray, Nicholas Monath, Andrew McCallum, Cameron Musco

    Abstract: We study algorithms for approximating pairwise similarity matrices that arise in natural language processing. Generally, computing a similarity matrix for $n$ data points requires $Ω(n^2)$ similarity computations. This quadratic scaling is a significant bottleneck, especially when similarities are computed via expensive functions, e.g., via transformer models. Approximation methods reduce this qua… ▽ More

    Submitted 27 April, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: 25 pages, 10 figures

    MSC Class: F.2.1

  32. arXiv:2110.06863  [pdf, other

    cs.CV cs.AI cs.HC

    Improving Users' Mental Model with Attention-directed Counterfactual Edits

    Authors: Kamran Alipour, Arijit Ray, Xiao Lin, Michael Cogswell, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

    Abstract: In the domain of Visual Question Answering (VQA), studies have shown improvement in users' mental model of the VQA system when they are exposed to examples of how these systems answer certain Image-Question (IQ) pairs. In this work, we show that showing controlled counterfactual image-question examples are more effective at improving the mental model of users as compared to simply showing random e… ▽ More

    Submitted 15 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Applied AI Letters

  33. arXiv:2110.05448  [pdf, other

    cs.CL cs.AI

    Unsupervised Neural Machine Translation with Generative Language Models Only

    Authors: Jesse Michael Han, Igor Babuschkin, Harrison Edwards, Arvind Neelakantan, Tao Xu, Stanislas Polu, Alex Ray, Pranav Shyam, Aditya Ramesh, Alec Radford, Ilya Sutskever

    Abstract: We show how to derive state-of-the-art unsupervised neural machine translation systems from generatively pre-trained language models. Our method consists of three steps: few-shot amplification, distillation, and backtranslation. We first use the zero-shot translation ability of large pre-trained language models to generate translations for a small set of unlabeled sentences. We then amplify these… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 10 pages

  34. arXiv:2109.07647  [pdf, other

    cs.DS math.NA

    Sublinear Time Eigenvalue Approximation via Random Sampling

    Authors: Rajarshi Bhattacharjee, Gregory Dexter, Petros Drineas, Cameron Musco, Archan Ray

    Abstract: We study the problem of approximating the eigenspectrum of a symmetric matrix $\mathbf A \in \mathbb{R}^{n \times n}$ with bounded entries (i.e., $\|\mathbf A\|_{\infty} \leq 1$). We present a simple sublinear time algorithm that approximates all eigenvalues of $\mathbf{A}$ up to additive error $\pm εn$ using those of a randomly sampled… ▽ More

    Submitted 21 July, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 58 pages, 4 figures

    MSC Class: F.2.1; G.1.3; G.1.2; G.4; I.1.2

  35. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  36. arXiv:2106.14464  [pdf, other

    cs.CL cs.AI

    Enhancing the Generalization for Intent Classification and Out-of-Domain Detection in SLU

    Authors: Yilin Shen, Yen-Chang Hsu, Avik Ray, Hongxia Jin

    Abstract: Intent classification is a major task in spoken language understanding (SLU). Since most models are built with pre-collected in-domain (IND) training utterances, their ability to detect unsupported out-of-domain (OOD) utterances has a critical effect in practical use. Recent works have shown that using extra data and labels can improve the OOD detection performance, yet it could be costly to colle… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  37. arXiv:2106.13898  [pdf, other

    cs.LG cs.AI cs.NE cs.RO math.DS

    Closed-form Continuous-time Neural Models

    Authors: Ramin Hasani, Mathias Lechner, Alexander Amini, Lucas Liebenwein, Aaron Ray, Max Tschaikowski, Gerald Teschl, Daniela Rus

    Abstract: Continuous-time neural processes are performant sequential decision-makers that are built by differential equations (DE). However, their expressive power when they are deployed on computers is bottlenecked by numerical DE solvers. This limitation has significantly slowed down the scaling and understanding of numerous natural physical phenomena such as the dynamics of nervous systems. Ideally, we w… ▽ More

    Submitted 2 March, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: 40 pages

    Journal ref: Nature Machine Intelligence 4, 992--1003 (2022)

  38. Optimized ensemble deep learning framework for scalable forecasting of dynamics containing extreme events

    Authors: Arnob Ray, Tanujit Chakraborty, Dibakar Ghosh

    Abstract: The remarkable flexibility and adaptability of both deep learning models and ensemble methods have led to the proliferation for their application in understanding many physical phenomena. Traditionally, these two techniques have largely been treated as independent methodologies in practical applications. This study develops an optimized ensemble deep learning (OEDL) framework wherein these two mac… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 14 pages, 8 figures, any comments are welcome

  39. arXiv:2105.05395  [pdf, other

    cs.AI

    Bayesian Model Averaging for Data Driven Decision Making when Causality is Partially Known

    Authors: Marios Papamichalis, Abhishek Ray, Ilias Bilionis, Karthik Kannan, Rajiv Krishnamurthy

    Abstract: Probabilistic machine learning models are often insufficient to help with decisions on interventions because those models find correlations - not causal relationships. If observational data is only available and experimentation are infeasible, the correct approach to study the impact of an intervention is to invoke Pearl's causality framework. Even that framework assumes that the underlying causal… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  40. There is no APTAS for 2-dimensional vector bin packing: Revisited

    Authors: Arka Ray

    Abstract: We study the Vector Bin Packing and the Vector Bin Covering problems, multidimensional generalizations of the Bin Packing and the Bin Covering problems, respectively. In the Vector Bin Packing, we are given a set of $d$-dimensional vectors from $[0,1]^d$ and the aim is to partition the set into the minimum number of bins such that for each bin $B$, each component of the sum of the vectors in $B$ i… ▽ More

    Submitted 1 August, 2023; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: 10 pages; omitted proof can be found in the source; changes: improved presentation

    Journal ref: Information Processing Letters 183C (2024) 106430

  41. arXiv:2103.14712  [pdf, other

    cs.CV cs.AI cs.CY cs.HC

    Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models

    Authors: Arijit Ray, Michael Cogswell, Xiao Lin, Kamran Alipour, Ajay Divakaran, Yi Yao, Giedrius Burachas

    Abstract: Attention maps, a popular heatmap-based explanation method for Visual Question Answering (VQA), are supposed to help users understand the model by highlighting portions of the image/question used by the model to infer answers. However, we see that users are often misled by current attention map visualizations that point to relevant regions despite the model producing an incorrect answer. Hence, we… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Applied AI Letters, Wiley, 25 October 2021

  42. arXiv:2101.02584  [pdf, other

    cs.CE

    Oscillatory Residual Stresses in Steady Angular Channel Extrusion

    Authors: Arunava Ray, Pritam Chakraborty, Anindya Chatterjee

    Abstract: Angular channel extrusion has evolved as processes that can induce significant strengthening of the formed product through grain refinement. However, significant residual stresses are developed in the extruded product whose quantification is necessary for accurate process design and subsequent heat treatment. Experimental evaluation of residual stress provides the through thickness (normal) variat… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 24 pages, 19 figures

    ACM Class: J.2

  43. arXiv:2010.01658  [pdf, other

    cs.CL

    Generating Dialogue Responses from a Semantic Latent Space

    Authors: Wei-Jen Ko, Avik Ray, Yilin Shen, Hongxia Jin

    Abstract: Existing open-domain dialogue generation models are usually trained to mimic the gold response in the training set using cross-entropy loss on the vocabulary. However, a good response does not need to resemble the gold response, since there are multiple possible responses to a given prompt. In this work, we hypothesize that the current models are unable to integrate information from multiple seman… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  44. arXiv:2008.12920  [pdf, other

    cs.RO

    Development and Testing of a Novel Automated Insect Capture Module for Sample Collection and Transfer

    Authors: Keran Ye, Gustavo J. Correa, Tom Guda, Hanzhe Teng, Anandasankar Ray, Konstantinos Karydis

    Abstract: There exists an urgent need for efficient tools in disease surveillance to help model and predict the spread of disease. The transmission of insect-borne diseases poses a serious concern to public health officials and the medical and research community at large. In the modeling of this spread, we face bottlenecks in (1) the frequency at which we are able to sample insect vectors in environments th… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: Accepted to IEEE International Conference on Automation Science and Engineering (CASE) 2020

  45. arXiv:2007.00900  [pdf, other

    cs.CV cs.AI cs.HC

    The Impact of Explanations on AI Competency Prediction in VQA

    Authors: Kamran Alipour, Arijit Ray, Xiao Lin, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

    Abstract: Explainability is one of the key elements for building trust in AI systems. Among numerous attempts to make AI explainable, quantifying the effect of explanations remains a challenge in conducting human-AI collaborative tasks. Aside from the ability to predict the overall behavior of AI, in many applications, users need to understand an AI agent's competency in different aspects of the task domain… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: Submitted to HCCAI 2020

  46. arXiv:2005.12852  [pdf

    q-bio.OT cs.CE

    3D CA model of tumor-induced angiogenesis

    Authors: Monjoy Saha, Amit Kumar Ray, Swapan Kumar Basu

    Abstract: Tumor-induced angiogenesis is the formation of new sprouts from preexisting nearby parent blood vessels. Computationally, tumor-induced angiogenesis can be modeled using cellular automata (CA), partial differential equations, etc. In this present study, a realistic physiological approach has been made to model the process of angiogenesis by using 3D CA model. CA technique uses various neighborhood… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

    Comments: International Conference on Modeling and Simulation of Diffusive Processes and Applications, 2012, Page 170-174

  47. arXiv:2004.09846  [pdf, other

    cs.LG cs.AI stat.ML

    SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning

    Authors: Somjit Nath, Richa Verma, Abhik Ray, Harshad Khadilkar

    Abstract: We propose a generic reward shaping approach for improving the rate of convergence in reinforcement learning (RL), called Self Improvement Based REwards, or SIBRE. The approach is designed for use in conjunction with any existing RL algorithm, and consists of rewarding improvement over the agent's own past performance. We prove that SIBRE converges in expectation under the same conditions as the o… ▽ More

    Submitted 21 December, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 7 pages, 10 figures

  48. arXiv:1911.00089  [pdf, ps, other

    cs.NE cs.LG stat.ML

    A Dynamically Controlled Recurrent Neural Network for Modeling Dynamical Systems

    Authors: Yiwei Fu, Samer Saab Jr, Asok Ray, Michael Hauser

    Abstract: This work proposes a novel neural network architecture, called the Dynamically Controlled Recurrent Neural Network (DCRNN), specifically designed to model dynamical systems that are governed by ordinary differential equations (ODEs). The current state vectors of these types of dynamical systems only depend on their state-space models, along with the respective inputs and initial conditions. Long S… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

  49. arXiv:1910.07060  [pdf, other

    cs.CL

    Iterative Delexicalization for Improved Spoken Language Understanding

    Authors: Avik Ray, Yilin Shen, Hongxia Jin

    Abstract: Recurrent neural network (RNN) based joint intent classification and slot tagging models have achieved tremendous success in recent years for building spoken language understanding and dialog systems. However, these models suffer from poor performance for slots which often encounter large semantic variability in slot values after deployment (e.g. message texts, partial movie/artist names). While g… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: Published at INTERSPEECH 2019, Graz, Austria

    Journal ref: Proc. Interspeech 2019 (2019): 1183-1187

  50. arXiv:1909.04696  [pdf, other

    cs.CV cs.AI

    Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

    Authors: Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas

    Abstract: While models for Visual Question Answering (VQA) have steadily improved over the years, interacting with one quickly reveals that these models lack consistency. For instance, if a model answers "red" to "What color is the balloon?", it might answer "no" if asked, "Is the balloon red?". These responses violate simple notions of entailment and raise questions about how effectively VQA models ground… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)