Skip to main content

Showing 1–50 of 86 results for author: Wang, Z J

  1. arXiv:2407.01972  [pdf, other

    cs.IR cs.AI cs.HC cs.LG

    MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation

    Authors: Zijie J. Wang, Duen Horng Chau

    Abstract: Retrieval-augmented text generation (RAG) addresses the common limitations of large language models (LLMs), such as hallucination, by retrieving information from an updatable external knowledge base. However, existing approaches often require dedicated backend servers for data storage and retrieval, thereby limiting their applicability in use cases that require strict data privacy, such as persona… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to SIGIR 2024. 6 pages, 2 figures. For a live demo, visit https://poloclub.github.io/mememo/. Code is open-source at https://github.com/poloclub/mememo

  2. arXiv:2405.03546  [pdf, other

    cs.CV cs.LG

    CCDM: Continuous Conditional Diffusion Models for Image Generation

    Authors: Xin Ding, Yongwei Wang, Kao Zhang, Z. Jane Wang

    Abstract: Continuous Conditional Generative Modeling (CCGM) aims to estimate the distribution of high-dimensional data, typically images, conditioned on scalar continuous variables known as regression labels. While Continuous conditional Generative Adversarial Networks (CcGANs) were initially designed for this task, their adversarial training mechanism remains vulnerable to extremely sparse or imbalanced da… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2404.16069  [pdf, other

    cs.HC cs.AI

    Interactive Visual Learning for Stable Diffusion

    Authors: Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Polo Chau

    Abstract: Diffusion-based generative models' impressive ability to create convincing images has garnered global attention. However, their complex internal structures and operations often pose challenges for non-experts to grasp. We introduce Diffusion Explainer, the first interactive visualization tool designed to elucidate how Stable Diffusion transforms text prompts into images. It tightly integrates a vi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 4 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2305.03509

  4. arXiv:2404.01361  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    LLM Attributor: Interactive Visual Attribution for LLM Generation

    Authors: Seongmin Lee, Zijie J. Wang, Aishwarya Chakravarthy, Alec Helbling, ShengYun Peng, Mansi Phute, Duen Horng Chau, Minsuk Kahng

    Abstract: While large language models (LLMs) have shown remarkable capability to generate convincing text across diverse domains, concerns around its potential risks have highlighted the importance of understanding the rationale behind text generation. We present LLM Attributor, a Python library that provides interactive visualizations for training data attribution of an LLM's text generation. Our library o… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, For a video demo, see https://youtu.be/mIG2MDQKQxM

  5. arXiv:2403.19754  [pdf, other

    cs.CL

    GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

    Authors: Mohsen Gholami, Mohammad Akbari, Cindy Hu, Vaden Masrani, Z. Jane Wang, Yong Zhang

    Abstract: Knowledge distillation from LLMs is essential for the efficient deployment of language models. Prior works have proposed data generation using LLMs for preparing distilled models. We argue that generating data with LLMs is prone to sampling mainly from the center of original content distribution. This limitation hinders the distilled model from learning the true underlying data distribution and to… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  6. arXiv:2402.15350  [pdf, other

    cs.HC cs.AI cs.CY cs.LG

    Farsight: Fostering Responsible AI Awareness During AI Application Prototyping

    Authors: Zijie J. Wang, Chinmay Kulkarni, Lauren Wilcox, Michael Terry, Michael Madaio

    Abstract: Prompt-based interfaces for Large Language Models (LLMs) have made prototyping and building AI-powered applications easier than ever before. However, identifying potential harms that may arise from AI applications remains a challenge, particularly during prompt-based prototyping. To address this, we present Farsight, a novel in situ interactive tool that helps people identify potential harms from… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to CHI 2024 (Best Paper, Honorable Mention). 40 pages, 19 figures, 5 tables. For a demo video, see https://youtu.be/BlSFbGkOlHk. For a live demo, visit https://PAIR-code.github.io/farsight. The source code is available at https://github.com/PAIR-code/farsight

  7. arXiv:2401.14447  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Wordflow: Social Prompt Engineering for Large Language Models

    Authors: Zijie J. Wang, Aishwarya Chakravarthy, David Munechika, Duen Horng Chau

    Abstract: Large language models (LLMs) require well-crafted prompts for effective use. Prompt engineering, the process of designing prompts, is challenging, particularly for non-experts who are less familiar with AI technologies. While researchers have proposed techniques and tools to assist LLM users in prompt design, these works primarily target AI application developers rather than non-experts. To addres… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 8 pages, 7 figures. Wordflow is available at: https://poloclub.github.io/wordflow. The code is available at: https://github.com/poloclub/wordflow/. For a demo video, see: https://youtu.be/3dOcVuofGVo

  8. arXiv:2401.10029  [pdf

    cs.CE q-bio.TO

    Cardiac Digital Twin Pipeline for Virtual Therapy Evaluation

    Authors: Julia Camps, Zhinuo Jenny Wang, Ruben Doste, Maxx Holmes, Brodie Lawson, Jakub Tomek, Kevin Burrage, Alfonso Bueno-Orovio, Blanca Rodriguez

    Abstract: Cardiac digital twins are computational tools capturing key functional and anatomical characteristics of patient hearts for investigating disease phenotypes and predicting responses to therapy. When paired with large-scale computational resources and large clinical datasets, digital twin technology can enable virtual clinical trials on virtual cohorts to fast-track therapy development. Here, we pr… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  9. arXiv:2312.14915  [pdf, other

    cs.CV

    PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF

    Authors: Mohsen Gholami, Rabab Ward, Z. Jane Wang

    Abstract: This paper proposes an end-to-end framework for generating 3D human pose datasets using Neural Radiance Fields (NeRF). Public datasets generally have limited diversity in terms of human poses and camera viewpoints, largely due to the resource-intensive nature of collecting 3D human pose data. As a result, pose estimators trained on public datasets significantly underperform when applied to unseen… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  10. arXiv:2311.13196  [pdf, other

    cs.IT eess.SP stat.ME

    Optimal Time of Arrival Estimation for MIMO Backscatter Channels

    Authors: Chen He, Luyang Han, Z. Jane Wang

    Abstract: In this paper, we propose a novel time of arrival (TOA) estimator for multiple-input-multiple-output (MIMO) backscatter channels in closed form. The proposed estimator refines the estimation precision from the topological structure of the MIMO backscatter channels, and can considerably enhance the estimation accuracy. Particularly, we show that for the general $M \times N$ bistatic topology, the m… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  11. arXiv:2310.12347  [pdf, other

    cs.HC

    VisGrader: Automatic Grading of D3 Visualizations

    Authors: Matthew Hull, Vivian Pednekar, Hannah Murray, Nimisha Roy, Emmanuel Tung, Susanta Routray, Connor Guerin, Justin Chen, Zijie J. Wang, Seongmin Lee, Mahdi Roozbahani, Duen Horng Chau

    Abstract: Manually grading D3 data visualizations is a challenging endeavor, and is especially difficult for large classes with hundreds of students. Grading an interactive visualization requires a combination of interactive, quantitative, and qualitative evaluation that are conventionally done manually and are difficult to scale up as the visualization complexity, data size, and number of students increase… ▽ More

    Submitted 19 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  12. arXiv:2310.12243  [pdf, other

    cs.LG cs.CV

    REVAMP: Automated Simulations of Adversarial Attacks on Arbitrary Objects in Realistic Scenes

    Authors: Matthew Hull, Zijie J. Wang, Duen Horng Chau

    Abstract: Deep Learning models, such as those used in an autonomous vehicle are vulnerable to adversarial attacks where an attacker could place an adversarial object in the environment, leading to mis-classification. Generating these adversarial objects in the digital space has been extensively studied, however successfully transferring these attacks from the digital realm to the physical realm has proven c… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  13. arXiv:2310.05123  [pdf, other

    cs.AI

    Distribution-Based Trajectory Clustering

    Authors: Zi Jing Wang, Ye Zhu, Kai Ming Ting

    Abstract: Trajectory clustering enables the discovery of common patterns in trajectory data. Current methods of trajectory clustering rely on a distance measure between two points in order to measure the dissimilarity between two trajectories. The distance measures employed have two challenges: high computational cost and low fidelity. Independent of the distance measure employed, existing clustering algori… ▽ More

    Submitted 30 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  14. arXiv:2306.09328  [pdf, other

    cs.LG cs.CL cs.CV cs.HC

    WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings

    Authors: Zijie J. Wang, Fred Hohman, Duen Horng Chau

    Abstract: Machine learning models often learn latent embedding representations that capture the domain semantics of their training data. These embedding representations are valuable for interpreting trained models, building new models, and analyzing new datasets. However, interpreting and using embeddings can be challenging due to their opaqueness, high dimensionality, and the large size of modern datasets.… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 8 pages, 8 figures, Accepted to ACL 2023. For a demo video, see https://youtu.be/8fJG87QVceQ. For a live demo, see https://poloclub.github.io/wizmap. Code is available at https://github.com/poloclub/wizmap

  15. arXiv:2305.03509  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

    Authors: Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Duen Horng Chau

    Abstract: Diffusion-based generative models' impressive ability to create convincing images has captured global attention. However, their complex internal structures and operations often make them difficult for non-experts to understand. We present Diffusion Explainer, the first interactive visualization tool that explains how Stable Diffusion transforms text prompts into images. Diffusion Explainer tightly… ▽ More

    Submitted 8 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures

  16. SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks

    Authors: Zijie J. Wang, David Munechika, Seongmin Lee, Duen Horng Chau

    Abstract: Computational notebooks, such as Jupyter Notebook, have become data scientists' de facto programming environments. Many visualization researchers and practitioners have developed interactive visualization tools that support notebooks, yet little is known about the appropriate design of these tools. To address this critical research gap, we investigate the design strategies in this space by analyzi… ▽ More

    Submitted 28 March, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted at CHI 2024 (Late-Breaking Work). 17 pages, 11 figures, 1 table. SuperNOVA is available at: http://poloclub.github.io/supernova/. The code is available at: https://github.com/poloclub/supernova

  17. arXiv:2304.05967  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Angler: Helping Machine Translation Practitioners Prioritize Model Improvements

    Authors: Samantha Robertson, Zijie J. Wang, Dominik Moritz, Mary Beth Kery, Fred Hohman

    Abstract: Machine learning (ML) models can fail in unexpected ways in the real world, but not all model failures are equal. With finite time and resources, ML practitioners are forced to prioritize their model debugging and improvement efforts. Through interviews with 13 ML practitioners at Apple, we found that practitioners construct small targeted test sets to estimate an error's nature, scope, and impact… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to CHI 2023. 20 pages, 6 figures

  18. Queer In AI: A Case Study in Community-Led Participatory AI

    Authors: Organizers Of QueerInAI, :, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav , et al. (26 additional authors not shown)

    Abstract: We present Queer in AI as a case study for community-led participatory design in AI. We examine how participatory design and intersectional tenets started and shaped this community's programs over the years. We discuss different challenges that emerged in the process, look at ways this organization has fallen short of operationalizing participatory and intersectional principles, and then assess th… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: To appear at FAccT 2023

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  19. arXiv:2303.09545  [pdf, other

    cs.LG cs.AI cs.HC

    WebSHAP: Towards Explaining Any Machine Learning Models Anywhere

    Authors: Zijie J. Wang, Duen Horng Chau

    Abstract: As machine learning (ML) is increasingly integrated into our everyday Web experience, there is a call for transparent and explainable web-based ML. However, existing explainability techniques often require dedicated backend servers, which limit their usefulness as the Web community moves toward in-browser ML for lower latency and greater privacy. To address the pressing need for a client-side expl… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 5 pages, 4 figures. Accepted at the ACM Web Conference 2023 (WWW 2023). For a live demo, visit https://poloclub.github.io/webshap/. Code is open-source at https://github.com/poloclub/webshap

  20. arXiv:2302.14165  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Coach: Towards Interactive and User-centered Algorithmic Recourse

    Authors: Zijie J. Wang, Jennifer Wortman Vaughan, Rich Caruana, Duen Horng Chau

    Abstract: Machine learning (ML) recourse techniques are increasingly used in high-stakes domains, providing end users with actions to alter ML predictions, but they assume ML developers understand what input variables can be changed. However, a recourse plan's actionability is subjective and unlikely to match developers' expectations completely. We present GAM Coach, a novel open-source system that adapts i… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to CHI 2023. 20 pages, 12 figures. For a demo video, see https://youtu.be/ubacP34H9XE. For a live demo, visit https://poloclub.github.io/gam-coach/

  21. arXiv:2212.04029  [pdf, other

    cs.CV

    Occlusion-Robust FAU Recognition by Mining Latent Space of Masked Autoencoders

    Authors: Minyang Jiang, Yongwei Wang, Martin J. McKeown, Z. Jane Wang

    Abstract: Facial action units (FAUs) are critical for fine-grained facial expression analysis. Although FAU detection has been actively studied using ideally high quality images, it was not thoroughly studied under heavily occluded conditions. In this paper, we propose the first occlusion-robust FAU recognition method to maintain FAU detection performance under heavy occlusions. Our novel approach takes adv… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  22. arXiv:2211.04020  [pdf, other

    q-bio.QM cs.LG q-bio.GN q-bio.TO

    Generating counterfactual explanations of tumor spatial proteomes to discover effective strategies for enhancing immune infiltration

    Authors: Zitong Jerry Wang, Alexander M. Xu, Aman Bhargava, Matt W. Thomson

    Abstract: The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition. While therapies for altering the immune composition, including immunotherapies, have shown exciting results for treating hematological cancers, they are less effective for immunologically-cold, solid tumors. Spatial omics technologies capture the spatial organization of the TME with unprecedented… ▽ More

    Submitted 13 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  23. arXiv:2210.14896  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

    Authors: Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau

    Abstract: With recent advancements in diffusion models, users can generate high-quality images by writing text prompts in natural language. However, generating images with desired details requires proper prompts, and it is often unclear how a model reacts to different prompts or what the best prompts are. To help researchers tackle these critical challenges, we introduce DiffusionDB, the first large-scale t… ▽ More

    Submitted 6 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to ACL 2023 (nominated for best paper, top 1.6% of submissions, oral presentation). 17 pages, 11 figures. The dataset is available at https://huggingface.co/datasets/poloclub/diffusiondb. The code is at https://github.com/poloclub/diffusiondb. The interactive visualization demo is at https://poloclub.github.io/diffusiondb/explorer/

  24. arXiv:2210.00160  [pdf, other

    cs.SI cs.CR cs.CY cs.HC

    Explaining Website Reliability by Visualizing Hyperlink Connectivity

    Authors: Seongmin Lee, Sadia Afroz, Haekyu Park, Zijie J. Wang, Omar Shaikh, Vibhor Sehgal, Ankit Peshin, Duen Horng Chau

    Abstract: As the information on the Internet continues growing exponentially, understanding and assessing the reliability of a website is becoming increasingly important. Misinformation has far-ranging repercussions, from sowing mistrust in media to undermining democratic elections. While some research investigates how to alert people to misinformation on the web, much less research has been conducted on ex… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted at IEEE VIS 2022, 5 pages, 4 figures, For a live demo, visit https://poloclub.github.io/MisVis

  25. TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization

    Authors: Zijie J. Wang, Chudi Zhong, Rui Xin, Takuya Takagi, Zhi Chen, Duen Horng Chau, Cynthia Rudin, Margo Seltzer

    Abstract: Given thousands of equally accurate machine learning (ML) models, how can users choose among them? A recent ML technique enables domain experts and data scientists to generate a complete Rashomon set for sparse decision trees--a huge set of almost-optimal interpretable ML models. To help ML practitioners identify models with desirable properties from this Rashomon set, we develop TimberTrek, the f… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at IEEE VIS 2022. 5 pages, 6 figures. For a demo video, see https://youtu.be/3eGqTmsStJM. For a live demo, visit https://poloclub.github.io/timbertrek

  26. arXiv:2209.04966  [pdf, other

    cs.CV cs.RO

    Multi-modal Streaming 3D Object Detection

    Authors: Mazen Abdelfattah, Kaiwen Yuan, Z. Jane Wang, Rabab Ward

    Abstract: Modern autonomous vehicles rely heavily on mechanical LiDARs for perception. Current perception methods generally require 360° point clouds, collected sequentially as the LiDAR scans the azimuth and acquires consecutive wedge-shaped slices. The acquisition latency of a full scan (~ 100ms) may lead to outdated perception which is detrimental to safe operation. Recent streaming perception works prop… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  27. arXiv:2206.15465  [pdf, other

    cs.LG cs.AI cs.HC

    Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark E. Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Machine learning (ML) interpretability techniques can reveal undesirable patterns in data that models exploit to make predictions--potentially causing harms once deployed. However, how to take action to address these patterns is not always clear. In a collaboration between ML and human-computer interaction researchers, physicians, and data scientists, we develop GAM Changer, the first interactive… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted at KDD 2022. 11 pages, 19 figures. For a demo video, see https://youtu.be/D6whtfInqTc. For a live demo, visit https://interpret.ml/gam-changer

  28. arXiv:2206.13801  [pdf, other

    cs.IT eess.SP

    Joint Precoding for Active Intelligent Transmitting Surface Empowered Outdoor-to-Indoor Communication in mmWave Cellular Networks

    Authors: Xie Xie, Chen He, Feifei Gao, Zhu Han, Z. Jane Wang

    Abstract: Outdoor-to-indoor communications in millimeter-wave (mmWave) cellular networks have been one challenging research problem due to the severe attenuation and the high penetration loss caused by the propagation characteristics of mmWave signals. We propose a viable solution to implement the outdoor-to-indoor mmWave communication system with the aid of an active intelligent transmitting surface (activ… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 30 pages, 8 figures

  29. arXiv:2206.12540  [pdf, other

    cs.HC cs.LG

    Visual Auditor: Interactive Visualization for Detection and Summarization of Model Biases

    Authors: David Munechika, Zijie J. Wang, Jack Reidy, Josh Rubin, Krishna Gade, Krishnaram Kenthapadi, Duen Horng Chau

    Abstract: As machine learning (ML) systems become increasingly widespread, it is necessary to audit these systems for biases prior to their deployment. Recent research has developed algorithms for effectively identifying intersectional bias in the form of interpretable, underperforming subsets (or slices) of the data. However, these solutions and their insights are limited without a tool for visually unders… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  30. arXiv:2206.05375  [pdf, other

    cs.CV

    Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer

    Authors: Dan Wang, Xinrui Cui, Septimiu Salcudean, Z. Jane Wang

    Abstract: We propose a Transformer-based NeRF (TransNeRF) to learn a generic neural radiance field conditioned on observed-view images for the novel view synthesis task. By contrast, existing MLP-based NeRFs are not able to directly receive observed views with an arbitrary number and require an auxiliary pooling-based operation to fuse source-view information, resulting in the missing of complicated relatio… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  31. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  32. arXiv:2205.09744  [pdf, other

    cs.LG cs.CY cs.MM

    Overcoming Language Disparity in Online Content Classification with Multimodal Learning

    Authors: Gaurav Verma, Rohit Mujumdar, Zijie J. Wang, Munmun De Choudhury, Srijan Kumar

    Abstract: Advances in Natural Language Processing (NLP) have revolutionized the way researchers and practitioners address crucial societal problems. Large language models are now the standard to develop state-of-the-art solutions for text detection and classification tasks. However, the development of advanced computational techniques and resources is disproportionately focused on the English language, side… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at ICWSM 2022 as a full paper

  33. arXiv:2205.03963  [pdf, other

    cs.HC

    NOVA: A Practical Method for Creating Notebook-Ready Visual Analytics

    Authors: Zijie J. Wang, David Munechika, Seongmin Lee, Duen Horng Chau

    Abstract: How can we develop visual analytics (VA) tools that can be easily adopted? Visualization researchers have developed a large number of web-based VA tools to help data scientists in a wide range of tasks. However, adopting these standalone systems can be challenging, as they require data scientists to create new workflows to streamline the VA processes. Recent surveys suggest computational notebooks… ▽ More

    Submitted 15 May, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted to IEEE VIS 2022 (poster). 2 pages, 1 figure. For a live demo, visit https://poloclub.github.io/nova. For method application examples, see https://github.com/poloclub/nova

  34. arXiv:2204.05899  [pdf, other

    cs.CV cs.HC cs.LG

    VisCUIT: Visual Auditor for Bias in CNN Image Classifier

    Authors: Seongmin Lee, Zijie J. Wang, Judy Hoffman, Duen Horng Chau

    Abstract: CNN image classifiers are widely used, thanks to their efficiency and accuracy. However, they can suffer from biases that impede their practical applications. Most existing bias investigation techniques are either inapplicable to general image classification tasks or require significant user efforts in perusing all data subgroups to manually specify which data attributes to inspect. We present Vis… ▽ More

    Submitted 13 April, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: 9 pages, 4 figures

  35. arXiv:2203.11490  [pdf, other

    cs.CV

    SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images

    Authors: Yongwei Wang, Yuheng Wang, Tim K. Lee, Chunyan Miao, Z. Jane Wang

    Abstract: Skin cancer is one of the most common types of malignancy, affecting a large population and causing a heavy economic burden worldwide. Over the last few years, computer-aided diagnosis has been rapidly developed and make great progress in healthcare and medical practices due to the advances in artificial intelligence. However, most studies in skin cancer detection keep pursuing high prediction acc… ▽ More

    Submitted 29 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 14 pages, 5 figures

  36. arXiv:2203.08176  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    SemiPFL: Personalized Semi-Supervised Federated Learning Framework for Edge Intelligence

    Authors: Arvin Tashakori, Wenwen Zhang, Z. Jane Wang, Peyman Servati

    Abstract: Recent advances in wearable devices and Internet-of-Things (IoT) have led to massive growth in sensor data generated in edge devices. Labeling such massive data for classification tasks has proven to be challenging. In addition, data generated by different users bear various personal attributes and edge heterogeneity, rendering it impractical to develop a global model that adapts well to all users… ▽ More

    Submitted 19 November, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  37. StickyLand: Breaking the Linear Presentation of Computational Notebooks

    Authors: Zijie J. Wang, Katie Dai, W. Keith Edwards

    Abstract: How can we better organize code in computational notebooks? Notebooks have become a popular tool among data scientists, as they seamlessly weave text and code together, supporting users to rapidly iterate and document code experiments. However, it is often challenging to organize code in notebooks, partially because there is a mismatch between the linear presentation of code and the non-linear pro… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at CHI 2022 (Late-Breaking Work). 7 pages, 6 figures. For a demo video, see https://youtu.be/OKaPmEBzEX0. For a live demo, visit https://zijie.wang/#stickyland-demo

  38. arXiv:2201.09685  [pdf, other

    cs.IT eess.SP

    Robust Joint Design for Intelligent Reflecting Surfaces Assisted Cell-Free Networks

    Authors: Xie Xie, Chen He, Xiaoya Li, Zhu Han, Z. Jane Wang

    Abstract: Intelligent reflecting surfaces (IRSs) have emerged as a promising economical solution to implement cell-free networks. However, the performance gains achieved by IRSs critically depend on smartly tuned passive beamforming based on the assumption that the accurate channel state information (CSI) knowledge is available, which is practically impossible. Thus, in this paper, we investigate the impact… ▽ More

    Submitted 20 February, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: 30 pages

  39. arXiv:2112.11593  [pdf, other

    cs.CV

    AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation

    Authors: Mohsen Gholami, Bastian Wandt, Helge Rhodin, Rabab Ward, Z. Jane Wang

    Abstract: This paper addresses the problem of cross-dataset generalization of 3D human pose estimation models. Testing a pre-trained 3D pose estimator on a new dataset results in a major performance drop. Previous methods have mainly addressed this problem by improving the diversity of the training data. We argue that diversity alone is not sufficient and that the characteristics of the training data need t… ▽ More

    Submitted 15 March, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  40. arXiv:2112.06654  [pdf, other

    eess.SP cs.HC cs.LG

    Toward Open-World Electroencephalogram Decoding Via Deep Learning: A Comprehensive Survey

    Authors: Xun Chen, Chang Li, Aiping Liu, Martin J. McKeown, Ruobing Qian, Z. Jane Wang

    Abstract: Electroencephalogram (EEG) decoding aims to identify the perceptual, semantic, and cognitive content of neural processing based on non-invasively measured brain activity. Traditional EEG decoding methods have achieved moderate success when applied to data acquired in static, well-controlled lab environments. However, an open-world environment is a more realistic setting, where situations affecting… ▽ More

    Submitted 16 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted by the IEEE Signal Processing Magazine

  41. arXiv:2112.03245  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Changer: Editing Generalized Additive Models with Interactive Visualization

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Recent strides in interpretable machine learning (ML) research reveal that models exploit undesirable patterns in the data to make predictions, which potentially causes harms in deployment. However, it is unclear how we can fix these models. We present our ongoing work, GAM Changer, an open-source interactive system to help data scientists and domain experts easily and responsibly edit their Gener… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 15 figures, accepted to the Research2Clinics workshop at NeurIPS 2021. For a demo video, see https://youtu.be/2gVSoPoSeJ8. For a live demo, visit https://interpret.ml/gam-changer/

  42. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, Jinho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  43. Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification

    Authors: Jianzhe Lin, Tianze Yu, Z. Jane Wang

    Abstract: Annotated images are required for both supervised model training and evaluation in image classification. Manually annotating images is arduous and expensive, especially for multi-labeled images. A recent trend for conducting such laboursome annotation tasks is through crowdsourcing, where images are annotated by volunteers or paid workers online (e.g., workers of Amazon Mechanical Turk) from scrat… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  44. SCIDA: Self-Correction Integrated Domain Adaptation from Single- to Multi-label Aerial Images

    Authors: Tianze Yu, Jianzhe Lin, Lichao Mou, Yuansheng Hua, Xiaoxiang Zhu, Z. Jane Wang

    Abstract: Most publicly available datasets for image classification are with single labels, while images are inherently multi-labeled in our daily life. Such an annotation gap makes many pre-trained single-label classification models fail in practical scenarios. This annotation issue is more concerned for aerial images: Aerial data collected from sensors naturally cover a relatively large land area with mul… ▽ More

    Submitted 29 November, 2021; v1 submitted 15 August, 2021; originally announced August 2021.

  45. arXiv:2108.00180  [pdf, other

    cs.CV

    Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework

    Authors: Li Ding, Yongwei Wang, Xin Ding, Kaiwen Yuan, Ping Wang, Hua Huang, Z. Jane Wang

    Abstract: Deep learning based image classification models are shown vulnerable to adversarial attacks by injecting deliberately crafted noises to clean images. To defend against adversarial attacks in a training-free and attack-agnostic manner, this work proposes a novel and effective reconstruction-based defense framework by delving into deep image prior (DIP). Fundamentally different from existing reconst… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    Comments: To be publish in ACM MM 2021

  46. arXiv:2105.14545  [pdf, other

    cs.IT eess.SP

    A Joint Power Splitting, Active and Passive Beamforming Optimization Framework for IRS Assisted MIMO SWIPT System

    Authors: Chen He, Xie Xie, Kun Yang, Z. Jane Wang

    Abstract: This paper considers an intelligent reflecting surface (IRS) assisted multi-input multi-output (MIMO) power splitting (PS) based simultaneous wireless information and power transfer (SWIPT) system with multiple PS receivers (PSRs). The objective is to maximize the achievable data rate of the system by jointly optimizing the PS ratios at the PSRs, the active transmit beamforming (ATB) at the access… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: 13 pages, 7 figures

  47. arXiv:2105.06599  [pdf, other

    cs.CV

    TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video

    Authors: Mohsen Gholami, Ahmad Rezaei, Helge Rhodin, Rabab Ward, Z. Jane Wang

    Abstract: Estimating 3D human poses from video is a challenging problem. The lack of 3D human pose annotations is a major obstacle for supervised training and for generalization to unseen datasets. In this work, we address this problem by proposing a weakly-supervised training scheme that does not require 3D annotations or calibrated cameras. The proposed method relies on temporal information and triangulat… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  48. Distilling and Transferring Knowledge via cGAN-generated Samples for Image Classification and Regression

    Authors: Xin Ding, Yongwei Wang, Zuheng Xu, Z. Jane Wang, William J. Welch

    Abstract: Knowledge distillation (KD) has been actively studied for image classification tasks in deep learning, aiming to improve the performance of a student based on the knowledge from a teacher. However, applying KD in image regression with a scalar response variable has been rarely studied, and there exists no KD method applicable to both classification and regression tasks yet. Moreover, existing KD m… ▽ More

    Submitted 26 December, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

  49. arXiv:2103.14625  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Dodrio: Exploring Transformer Models with Interactive Visualization

    Authors: Zijie J. Wang, Robert Turko, Duen Horng Chau

    Abstract: Why do large pre-trained transformer-based models perform so well across a wide variety of NLP tasks? Recent research suggests the key may lie in multi-headed attention mechanism's ability to learn and represent linguistic information. Understanding how these models represent both syntactic and semantic knowledge is vital to investigate why they succeed and fail, what they have learned, and how th… ▽ More

    Submitted 5 June, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 10 pages, 8 figures, Accepted to ACL 2021. For a demo video, see https://youtu.be/qB-T9j7UTgE . For a live demo, see https://poloclub.github.io/dodrio/

  50. arXiv:2103.12957  [pdf, ps, other

    cs.CV

    Multi-view 3D Reconstruction with Transformer

    Authors: Dan Wang, Xinrui Cui, Xun Chen, Zhengxia Zou, Tianyang Shi, Septimiu Salcudean, Z. Jane Wang, Rabab Ward

    Abstract: Deep CNN-based methods have so far achieved the state of the art results in multi-view 3D object reconstruction. Despite the considerable progress, the two core modules of these methods - multi-view feature extraction and fusion, are usually investigated separately, and the object relations in different views are rarely explored. In this paper, inspired by the recent great success in self-attentio… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.