Skip to main content

Showing 1–50 of 401 results for author: Shashank

  1. arXiv:2407.10250  [pdf, ps, other

    cs.IT

    Product and Ratio of Two $α-κ-μ$ Shadowed Random Variables and its Application to Wireless Communication

    Authors: Shashank Shekhar, Sheetal Kalyani

    Abstract: This work studies the product and ratio statistics of independent and non-identically distributed (i.n.i.d) $ α-κ- μ$ shadowed random variables. We derive the series expression for the probability density function (PDF), cumulative distribution function (CDF), and moment generating function (MGF) of the product and ratio of i.n.i.d $ α- κ- μ$ shadowed random variables. We then give the single inte… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.15760

  2. arXiv:2407.09809  [pdf, other

    cs.AI

    Preserving the Privacy of Reward Functions in MDPs through Deception

    Authors: Shashank Reddy Chirra, Pradeep Varakantham, Praveen Paruchuri

    Abstract: Preserving the privacy of preferences (or rewards) of a sequential decision-making agent when decisions are observable is crucial in many physical and cybersecurity domains. For instance, in wildlife monitoring, agents must allocate patrolling resources without revealing animal locations to poachers. This paper addresses privacy preservation in planning over a sequence of actions in MDPs, where th… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: ECAI 2024

  3. arXiv:2407.08003  [pdf, other

    cs.LG cs.AI

    Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data

    Authors: Ritesh Mehta, Aleksandar Pramov, Shashank Verma

    Abstract: Amyotrophic Lateral Sclerosis (ALS) is characterized as a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options in the realm of medical interventions and therapies. The disease showcases a diverse range of onset patterns and progression trajectories, emphasizing the critical importance of early detection of functional decline to enable tailored care… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Paper submitted to CLEF 2024 CEUR-WS

  4. arXiv:2407.03305  [pdf, other

    cs.CV

    Advanced Smart City Monitoring: Real-Time Identification of Indian Citizen Attributes

    Authors: Shubham Kale, Shashank Sharma, Abhilash Khuntia

    Abstract: This project focuses on creating a smart surveillance system for Indian cities that can identify and analyze people's attributes in real time. Using advanced technologies like artificial intelligence and machine learning, the system can recognize attributes such as upper body color, what the person is wearing, accessories they are wearing, headgear, etc., and analyze behavior through cameras insta… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages , 8 figure , changed title and some alignment issue were resolved, but other contents remains same

  5. arXiv:2407.02514  [pdf, other

    cs.LO cs.AI cs.CL

    LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations

    Authors: Shashank Kirtania, Priyanshu Gupta, Arjun Radhakirshna

    Abstract: In this paper we examine the limitations of Large Language Models (LLMs) for complex reasoning tasks. Although recent works have started to employ formal languages as an intermediate representation for reasoning tasks, they often face challenges in accurately generating and refining these formal specifications to ensure correctness. To address these issues, this paper proposes Logic-LM++, an impro… ▽ More

    Submitted 4 July, 2024; v1 submitted 22 June, 2024; originally announced July 2024.

  6. arXiv:2407.00938  [pdf, other

    cs.CL cs.CY

    MalAlgoQA: A Pedagogical Approach for Evaluating Counterfactual Reasoning Abilities

    Authors: Naiming Liu, Shashank Sonkar, Myco Le, Richard Baraniuk

    Abstract: This paper introduces MalAlgoQA, a novel dataset designed to evaluate the counterfactual reasoning capabilities of Large Language Models (LLMs) through a pedagogical approach. The dataset comprises mathematics and reading comprehension questions, each accompanied by four answer choices and their corresponding rationales. We focus on the incorrect answer rationales, termed "malgorithms", which high… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  7. arXiv:2406.19626  [pdf, other

    cs.AI

    Safety through feedback in Constrained RL

    Authors: Shashank Reddy Chirra, Pradeep Varakantham, Praveen Paruchuri

    Abstract: In safety-critical RL settings, the inclusion of an additional cost function is often favoured over the arduous task of modifying the reward function to ensure the agent's safe behaviour. However, designing or evaluating such a cost function can be prohibitively expensive. For instance, in the domain of self-driving, designing a cost function that encompasses all unsafe behaviours (e.g. aggressive… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  8. arXiv:2406.15578  [pdf, other

    cs.RO

    Neural Moving Horizon Estimation: A Systematic Literature Review

    Authors: Surrayya Mobeen, Jann Cristobal, Shashank Singoji, Basaam Rassas, Mohammadreza Izadi, Zeinab Shayan, Amin Yazdanshenas, Harneet Kaur, Robert Barnsley, Lana Elliott, Reza Faieghi

    Abstract: The neural moving horizon estimator (NMHE) is a relatively new and powerful state estimator that combines the strengths of neural networks (NNs) and model-based state estimation techniques. Various approaches exist for constructing NMHEs, each with its unique advantages and limitations. However, a comprehensive literature review that consolidates existing knowledge, outlines design guidelines and… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  9. arXiv:2406.10520  [pdf, ps, other

    cs.CV eess.IV eess.SP

    Full reference point cloud quality assessment using support vector regression

    Authors: Ryosuke Watanabe, Shashank N. Sridhara, Haoran Hong, Eduardo Pavez, Keisuke Nonaka, Tatsuya Kobayashi, Antonio Ortega

    Abstract: Point clouds are a general format for representing realistic 3D objects in diverse 3D applications. Since point clouds have large data sizes, developing efficient point cloud compression methods is crucial. However, excessive compression leads to various distortions, which deteriorates the point cloud quality perceived by end users. Thus, establishing reliable point cloud quality assessment (PCQA)… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Source code: https://github.com/STAC-USC/FRSVR-PCQA

  10. arXiv:2406.07435  [pdf, other

    cs.CV cs.LG eess.IV

    Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration

    Authors: Shashank Agnihotri, Julia Grabinski, Janis Keuper, Margret Keuper

    Abstract: Image restoration networks are usually comprised of an encoder and a decoder, responsible for aggregating image content from noisy, distorted data and to restore clean, undistorted images, respectively. Data aggregation as well as high-resolution image generation both usually come at the risk of involving aliases, i.e.~standard architectures put their ability to reconstruct the model input in jeop… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Tags: Adversarial attack, image restoration, image deblurring, frequency sampling

  11. arXiv:2406.06448  [pdf, other

    cs.HC

    How is the Pilot Doing: VTOL Pilot Workload Estimation by Multimodal Machine Learning on Psycho-physiological Signals

    Authors: Jong Hoon Park, Lawrence Chen, Ian Higgins, Zhaobo Zheng, Shashank Mehrotra, Kevin Salubre, Mohammadreza Mousaei, Steven Willits, Blain Levedahl, Timothy Buker, Eliot Xing, Teruhisa Misu, Sebastian Scherer, Jean Oh

    Abstract: Vertical take-off and landing (VTOL) aircraft do not require a prolonged runway, thus allowing them to land almost anywhere. In recent years, their flexibility has made them popular in development, research, and operation. When compared to traditional fixed-wing aircraft and rotorcraft, VTOLs bring unique challenges as they combine many maneuvers from both types of aircraft. Pilot workload is a cr… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 8 pages, 7 figures

  12. arXiv:2405.17475  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    How Culturally Aware are Vision-Language Models?

    Authors: Olena Burda-Lassen, Aman Chadha, Shashank Goswami, Vinija Jain

    Abstract: An image is often said to be worth a thousand words, and certain images can tell rich and insightful stories. Can these stories be told via image captioning? Images from folklore genres, such as mythology, folk dance, cultural signs, and symbols, are vital to every culture. Our research compares the performance of four popular vision-language models (GPT-4V, Gemini Pro Vision, LLaVA, and OpenFlami… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  13. arXiv:2405.13009  [pdf, other

    cs.CL cs.AI

    METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

    Authors: Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Sherry Shi, Gustavo Soares

    Abstract: Despite the popularity of Large Language Models (LLMs), crafting specific prompts for LLMs to perform particular tasks remains challenging. Users often engage in multiple conversational turns with an LLM-based agent to accomplish their intended task. Recent studies have demonstrated that linguistic feedback, in the form of self-reflections generated by the model, can work as reinforcement during t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  14. arXiv:2405.10345  [pdf, other

    q-bio.QM cs.AI cs.LG

    Machine Learning Driven Biomarker Selection for Medical Diagnosis

    Authors: Divyagna Bavikadi, Ayushi Agarwal, Shashank Ganta, Yunro Chung, Lusheng Song, Ji Qiu, Paulo Shakarian

    Abstract: Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely unde… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  15. arXiv:2405.09566  [pdf, ps, other

    eess.SP cs.LG

    Detection of Sleep Oxygen Desaturations from Electroencephalogram Signals

    Authors: Shashank Manjunath, Aarti Sathyanarayana

    Abstract: In this work, we leverage machine learning techniques to identify potential biomarkers of oxygen desaturation during sleep exclusively from electroencephalogram (EEG) signals in pediatric patients with sleep apnea. Development of a machine learning technique which can successfully identify EEG signals from patients with sleep apnea as well as identify latent EEG signals which come from subjects wh… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 4 Pages

  16. arXiv:2405.08134  [pdf, other

    cs.CL

    Many-Shot Regurgitation (MSR) Prompting

    Authors: Shashank Sonkar, Richard G. Baraniuk

    Abstract: We introduce Many-Shot Regurgitation (MSR) prompting, a new black-box membership inference attack framework for examining verbatim content reproduction in large language models (LLMs). MSR prompting involves dividing the input text into multiple segments and creating a single prompt that includes a series of faux conversation rounds between a user and a language model to elicit verbatim regurgitat… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  17. arXiv:2405.05757  [pdf, other

    cs.ET eess.SY

    Design and Implementation of Energy-Efficient Wireless Tire Sensing System with Delay Analysis for Intelligent Vehicles

    Authors: Shashank Mishra, Jia-Ming Liang

    Abstract: The growing prevalence of Internet of Things (IoT) technologies has led to a rise in the popularity of intelligent vehicles that incorporate a range of sensors to monitor various aspects, such as driving speed, fuel usage, distance proximity and tire anomalies. Nowadays, real-time tire sensing systems play important roles for intelligent vehicles in increasing mileage, reducing fuel consumption, i… ▽ More

    Submitted 27 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  18. arXiv:2405.05736  [pdf, other

    cs.LG cs.IR

    Optimal Baseline Corrections for Off-Policy Contextual Bandits

    Authors: Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke

    Abstract: The off-policy learning paradigm allows for recommender systems and general ranking applications to be framed as decision-making problems, where we aim to learn decision policies that optimize an unbiased offline estimate of an online reward metric. With unbiasedness comes potentially high variance, and prevalent methods exist to reduce estimation variance. These methods typically make use of cont… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  19. arXiv:2405.01852  [pdf

    cs.DC cs.CR cs.ET

    Tokenization of Real Estate Assets Using Blockchain

    Authors: Shashank Joshi, Arhan Choudhury

    Abstract: Blockchain technology is one of the key technologies that have revolutionized various facets of society, such as the banking, healthcare, and other critical ecosystems. One area that can harness the usage of blockchain is the real estate sector. The most lucrative long-term investment is real estate, followed by gold, equities, mutual funds, and savings accounts. Nevertheless, it has administrativ… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Journal ref: IJIIT vol.18, no.3 2022: pp.1-12.

  20. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: Ajinkya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  21. arXiv:2405.00554  [pdf, other

    cs.IR

    A First Look at Selection Bias in Preference Elicitation for Recommendation

    Authors: Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke

    Abstract: Preference elicitation explicitly asks users what kind of recommendations they would like to receive. It is a popular technique for conversational recommender systems to deal with cold-starts. Previous work has studied selection bias in implicit feedback, e.g., clicks, and in some forms of explicit feedback, i.e., ratings on items. Despite the fact that the extreme sparsity of preference elicitati… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted at the CONSEQUENCES'23 workshop at RecSys '23

  22. arXiv:2405.00250  [pdf, other

    cs.CV cs.RO

    SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations

    Authors: Narayanan Elavathur Ranganatha, Hengyuan Zhang, Shashank Venkatramani, Jing-Yan Liao, Henrik I. Christensen

    Abstract: Vector maps are essential in autonomous driving for tasks like localization and planning, yet their creation and maintenance are notably costly. While recent advances in online vector map generation for autonomous vehicles are promising, current models lack adaptability to different sensor configurations. They tend to overfit to specific sensor poses, leading to decreased performance and higher re… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 8 pages, 6 figures, Accepted to IV 2024

  23. arXiv:2404.19630  [pdf, other

    cs.LG

    Analyzing and Exploring Training Recipes for Large-Scale Transformer-Based Weather Prediction

    Authors: Jared D. Willard, Peter Harrington, Shashank Subramanian, Ankur Mahesh, Travis A. O'Brien, William D. Collins

    Abstract: The rapid rise of deep learning (DL) in numerical weather prediction (NWP) has led to a proliferation of models which forecast atmospheric variables with comparable or superior skill than traditional physics-based NWP. However, among these leading DL models, there is a wide variance in both the training settings and architecture used. Further, the lack of thorough ablation studies makes it hard to… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures

    MSC Class: 68T07; 86A10 ACM Class: J.2; I.2.6

    Journal ref: 23rd Conference on Artificial Intelligence for Environmental Science. Jan 2024. Abstract #437874

  24. arXiv:2404.18400  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

    Authors: Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy

    Abstract: Mathematical equations have been unreasonably effective in describing complex natural phenomena across various scientific disciplines. However, discovering such insightful equations from data presents significant challenges due to the necessity of navigating extremely high-dimensional combinatorial and nonlinear hypothesis spaces. Traditional methods of equation discovery, commonly known as symbol… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  25. Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection

    Authors: Farzad Nozarian, Shashank Agarwal, Farzaneh Rezaeianaran, Danish Shahzad, Atanas Poibrenski, Christian Müller, Philipp Slusallek

    Abstract: Semi-supervised 3D object detection can benefit from the promising pseudo-labeling technique when labeled data is limited. However, recent approaches have overlooked the impact of noisy pseudo-labels during training, despite efforts to enhance pseudo-label quality through confidence-based filtering. In this paper, we examine the impact of noisy pseudo-labels on IoU-based target assignment and prop… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR Workshop L3D-IVU 2023. Code: https://github.com/fnozarian/ReliableStudent

  26. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  27. arXiv:2404.15923  [pdf, other

    cs.AI cs.CL

    KGValidator: A Framework for Automatic Validation of Knowledge Graph Construction

    Authors: Jack Boylan, Shashank Mangla, Dominic Thorn, Demian Gholipour Ghalandari, Parsa Ghaffari, Chris Hokamp

    Abstract: This study explores the use of Large Language Models (LLMs) for automatic evaluation of knowledge graph (KG) completion models. Historically, validating information in KGs has been a challenging task, requiring large-scale human annotation at prohibitive cost. With the emergence of general-purpose generative AI and LLMs, it is now plausible that human-in-the-loop validation could be replaced by a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Text2KG 2024, ESWC 2024

  28. arXiv:2404.15156  [pdf, other

    cs.CL

    Regressive Side Effects of Training Language Models to Mimic Student Misconceptions

    Authors: Shashank Sonkar, Naiming Liu, Richard G. Baraniuk

    Abstract: This paper presents a novel exploration into the regressive side effects of training Large Language Models (LLMs) to mimic student misconceptions for personalized education. We highlight the problem that as LLMs are trained to more accurately mimic student misconceptions, there is a compromise in the factual integrity and reasoning ability of the models. Our work involved training an LLM on a stud… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  29. arXiv:2404.14316  [pdf, other

    cs.CL

    Automated Long Answer Grading with RiceChem Dataset

    Authors: Shashank Sonkar, Kangqi Ni, Lesa Tran Lu, Kristi Kincaid, John S. Hutchinson, Richard G. Baraniuk

    Abstract: We introduce a new area of study in the field of educational Natural Language Processing: Automated Long Answer Grading (ALAG). Distinguishing itself from Automated Short Answer Grading (ASAG) and Automated Essay Grading (AEG), ALAG presents unique challenges due to the complexity and multifaceted nature of fact-based long answers. To study ALAG, we introduce RiceChem, a dataset derived from a col… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  30. arXiv:2404.14301  [pdf, other

    cs.CL

    Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits

    Authors: Shashank Sonkar, Naiming Liu, Debshila B. Mallick, Richard G. Baraniuk

    Abstract: In this paper, we introduce "Marking", a novel grading task that enhances automated grading systems by performing an in-depth analysis of student responses and providing students with visual highlights. Unlike traditional systems that provide binary scores, "marking" identifies and categorizes segments of the student response as correct, incorrect, or irrelevant and detects omissions from gold ans… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  31. arXiv:2404.14167  [pdf, other

    cs.RO

    A multi-robot system for the detection of explosive devices

    Authors: Ken Hasselmann, Mario Malizia, Rafael Caballero, Fabio Polisano, Shashank Govindaraj, Jakob Stigler, Oleksii Ilchenko, Milan Bajic, Geert De Cubber

    Abstract: In order to clear the world of the threat posed by landmines and other explosive devices, robotic systems can play an important role. However, the development of such field robots that need to operate in hazardous conditions requires the careful consideration of multiple aspects related to the perception, mobility, and collaboration capabilities of the system. In the framework of a European challe… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Journal ref: IEEE ICRA Workshop on Field Robotics 2024

  32. arXiv:2404.09067  [pdf, other

    cs.CV cs.AI

    Exploring Explainability in Video Action Recognition

    Authors: Avinab Saha, Shashank Gupta, Sravan Kumar Ankireddy, Karl Chahine, Joydeep Ghosh

    Abstract: Image Classification and Video Action Recognition are perhaps the two most foundational tasks in computer vision. Consequently, explaining the inner workings of trained deep neural networks is of prime importance. While numerous efforts focus on explaining the decisions of trained deep neural networks in image classification, exploration in the domain of its temporal version, video action recognit… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages, 10 figures, Accepted to the 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024

  33. arXiv:2404.01234  [pdf, other

    cs.CL math.LO

    GFLean: An Autoformalisation Framework for Lean via GF

    Authors: Shashank Pathak

    Abstract: We present an autoformalisation framework for the Lean theorem prover, called GFLean. GFLean uses a high-level grammar writing tool called Grammatical Framework (GF) for parsing and linearisation. GFLean is implemented in Haskell. We explain the functionalities of GFLean, its inner working and discuss its limitations. We also discuss how we can use neural network based translation programs and rul… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 19 Pages, 3 Figures

    ACM Class: I.2.7

  34. Should I Help a Delivery Robot? Cultivating Prosocial Norms through Observations

    Authors: Vivienne Bihe Chi, Shashank Mehrotra, Teruhisa Misu, Kumar Akash

    Abstract: We propose leveraging prosocial observations to cultivate new social norms to encourage prosocial behaviors toward delivery robots. With an online experiment, we quantitatively assess updates in norm beliefs regarding human-robot prosocial behaviors through observational learning. Results demonstrate the initially perceived normativity of helping robots is influenced by familiarity with delivery r… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted as a Late Breaking Work at CHI'24

  35. arXiv:2403.14795  [pdf, other

    cs.CE

    Advanced Deep Operator Networks to Predict Multiphysics Solution Fields in Materials Processing and Additive Manufacturing

    Authors: Shashank Kushwaha, Jaewan Park, Seid Koric, Junyan He, Iwona Jasiuk, Diab Abueidda

    Abstract: Unlike classical artificial neural networks, which require retraining for each new set of parametric inputs, the Deep Operator Network (DeepONet), a lately introduced deep learning framework, approximates linear and nonlinear solution operators by taking parametric functions (infinite-dimensional objects) as inputs and mapping them to complete solution fields. In this paper, two newly devised Deep… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  36. arXiv:2403.07389  [pdf, other

    cs.CV cs.AI eess.IV

    Auxiliary CycleGAN-guidance for Task-Aware Domain Translation from Duplex to Monoplex IHC Images

    Authors: Nicolas Brieu, Nicolas Triltsch, Philipp Wortmann, Dominik Winter, Shashank Saran, Marlon Rebelatto, Günter Schmidt

    Abstract: Generative models enable the translation from a source image domain where readily trained models are available to a target domain unseen during training. While Cycle Generative Adversarial Networks (GANs) are well established, the associated cycle consistency constrain relies on that an invertible mapping exists between the two domains. This is, however, not the case for the translation between im… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 4 pages, 2 figures

    MSC Class: I.2.10; J.3; I.4.6

  37. arXiv:2403.05749  [pdf, other

    eess.SY cs.DM

    Characterizing Flow Complexity in Transportation Networks using Graph Homology

    Authors: Shashank A Deshpande, Hamsa Balakrishnan

    Abstract: Series-parallel network topologies generally exhibit simplified dynamical behavior and avoid high combinatorial complexity. A comprehensive analysis of how flow complexity emerges with a graph's deviation from series-parallel topology is therefore of fundamental interest. We introduce the notion of a robust $k$-path on a directed acycylic graph, with increasing values of the length $k$ reflecting… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures, letter

  38. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  39. arXiv:2403.02651  [pdf, other

    eess.SP cs.AI

    Learning at the Speed of Wireless: Online Real-Time Learning for AI-Enabled MIMO in NextG

    Authors: Jiarui Xu, Shashank Jere, Yifei Song, Yi-Hung Kao, Lizhong Zheng, Lingjia Liu

    Abstract: Integration of artificial intelligence (AI) and machine learning (ML) into the air interface has been envisioned as a key technology for next-generation (NextG) cellular networks. At the air interface, multiple-input multiple-output (MIMO) and its variants such as multi-user MIMO (MU-MIMO) and massive/full-dimension MIMO have been key enablers across successive generations of cellular networks wit… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures, 1 table, magazine paper

  40. arXiv:2402.19450  [pdf, other

    cs.AI cs.CL

    Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

    Authors: Saurabh Srivastava, Annarose M B, Anto P V, Shashank Menon, Ajay Sukumar, Adwaith Samod T, Alan Philipose, Stevin Prince, Sooraj Thomas

    Abstract: We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 37 pages, 10 figures

  41. arXiv:2402.18729  [pdf, other

    physics.flu-dyn cs.LG physics.data-an

    A Priori Uncertainty Quantification of Reacting Turbulence Closure Models using Bayesian Neural Networks

    Authors: Graham Pash, Malik Hassanaly, Shashank Yellapantula

    Abstract: While many physics-based closure model forms have been posited for the sub-filter scale (SFS) in large eddy simulation (LES), vast amounts of data available from direct numerical simulation (DNS) create opportunities to leverage data-driven modeling techniques. Albeit flexible, data-driven models still depend on the dataset and the functional form of the model chosen. Increased adoption of such mo… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  42. arXiv:2402.15734  [pdf, other

    cs.LG stat.ML

    Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning

    Authors: Wuyang Chen, Jialin Song, Pu Ren, Shashank Subramanian, Dmitriy Morozov, Michael W. Mahoney

    Abstract: Recent years have witnessed the promise of coupling machine learning methods and physical domainspecific insights for solving scientific problems based on partial differential equations (PDEs). However, being data-intensive, these methods still require a large amount of PDE data. This reintroduces the need for expensive numerical PDE solutions, partially undermining the original goal of avoiding t… ▽ More

    Submitted 13 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  43. arXiv:2402.05000  [pdf, other

    cs.CL

    Pedagogical Alignment of Large Language Models

    Authors: Shashank Sonkar, Kangqi Ni, Sapana Chaudhary, Richard G. Baraniuk

    Abstract: In this paper, we introduce the novel concept of pedagogically aligned Large Language Models (LLMs) that signifies a transformative shift in the application of LLMs within educational contexts. Rather than providing direct responses to user queries, pedagogically-aligned LLMs function as scaffolding tools, breaking complex problems into manageable subproblems and guiding students towards the final… ▽ More

    Submitted 12 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  44. arXiv:2402.04699  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Breaking Free: How to Hack Safety Guardrails in Black-Box Diffusion Models!

    Authors: Shashank Kotyan, Po-Yuan Mao, Pin-Yu Chen, Danilo Vasconcellos Vargas

    Abstract: Deep neural networks can be exploited using natural adversarial samples, which do not impact human perception. Current approaches often rely on deep neural networks' white-box nature to generate these adversarial samples or synthetically alter the distribution of adversarial samples compared to the training distribution. In contrast, we propose EvoSeed, a novel evolutionary strategy-based algorith… ▽ More

    Submitted 22 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  45. arXiv:2401.17883  [pdf, other

    cs.CV

    Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques

    Authors: Shreyank N Gowda, Yash Thakre, Shashank Narayana Gowda, Xiaobo Jin

    Abstract: This paper offers a comprehensive analysis of recent advancements in video inpainting techniques, a critical subset of computer vision and artificial intelligence. As a process that restores or fills in missing or corrupted portions of video sequences with plausible content, video inpainting has evolved significantly with the advent of deep learning methodologies. Despite the plethora of existing… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  46. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  47. arXiv:2401.17700  [pdf

    cs.HC cs.AI

    Classification of executive functioning performance post-longitudinal tDCS using functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Shashank Uttrani, Ayushman Dixit, Dipanshu Verma, Varun Dutt

    Abstract: Executive functioning is a cognitive process that enables humans to plan, organize, and regulate their behavior in a goal-directed manner. Understanding and classifying the changes in executive functioning after longitudinal interventions (like transcranial direct current stimulation (tDCS)) has not been explored in the literature. This study employs functional connectivity and machine learning al… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 7 pages, presented at the IEEE 20th India Council International Conference (INDICON 2023), Hyderabad, India, December 2023

  48. CUI@CHI 2024: Building Trust in CUIs-From Design to Deployment

    Authors: Smit Desai, Christina Wei, Jaisie Sin, Mateusz Dubiel, Nima Zargham, Shashank Ahire, Martin Porcheron, Anastasia Kuzminykh, Minha Lee, Heloisa Candello, Joel Fischer, Cosmin Munteanu, Benjamin R Cowan

    Abstract: Conversational user interfaces (CUIs) have become an everyday technology for people the world over, as well as a booming area of research. Advances in voice synthesis and the emergence of chatbots powered by large language models (LLMs), notably ChatGPT, have pushed CUIs to the forefront of human-computer interaction (HCI) research and practice. Now that these technologies enable an elemental leve… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  49. arXiv:2312.14418  [pdf, other

    math.NA cs.LG

    Sharp error estimates for target measure diffusion maps with applications to the committor problem

    Authors: Shashank Sule, Luke Evans, Maria Cameron

    Abstract: We obtain asymptotically sharp error estimates for the consistency error of the Target Measure Diffusion map (TMDmap) (Banisch et al. 2020), a variant of diffusion maps featuring importance sampling and hence allowing input data drawn from an arbitrary density. The derived error estimates include the bias error and the variance error. The resulting convergence rates are consistent with the approxi… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  50. arXiv:2312.13938  [pdf, other

    cs.DC cs.CY

    How Does Stake Distribution Influence Consensus? Analyzing Blockchain Decentralization

    Authors: Shashank Motepalli, Hans-Arno Jacobsen

    Abstract: In the PoS blockchain landscape, the challenge of achieving full decentralization is often hindered by a disproportionate concentration of staked tokens among a few validators. This study analyses this challenge by first formalizing decentralization metrics for weighted consensus mechanisms. An empirical analysis across ten permissionless blockchains uncovers significant weight concentration among… ▽ More

    Submitted 20 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in ICBC 2024