Skip to main content

Showing 1–48 of 48 results for author: Lyu, Q

  1. arXiv:2406.10952  [pdf, other

    cs.CL

    Avoiding Copyright Infringement via Machine Unlearning

    Authors: Guangyao Dou, Zheyuan Liu, Qing Lyu, Kaize Ding, Eric Wong

    Abstract: Pre-trained Large Language Models (LLMs) have demonstrated remarkable capabilities but also pose risks by learning and generating copyrighted material, leading to significant legal and ethical concerns. To address these issues, it is critical for model owners to be able to unlearn copyrighted content at various time steps. We explore the setting of sequential unlearning, where copyrighted content… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.10550  [pdf, other

    eess.IV cs.CV

    LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion

    Authors: Tong Chen, Qingcheng Lyu, Long Bai, Erjian Guo, Huxin Gao, Xiaoxiao Yang, Hongliang Ren, Luping Zhou

    Abstract: Advances in endoscopy use in surgeries face challenges like inadequate lighting. Deep learning, notably the Denoising Diffusion Probabilistic Model (DDPM), holds promise for low-light image enhancement in the medical field. However, DDPMs are computationally demanding and slow, limiting their practical medical applications. To bridge this gap, we propose a lightweight DDPM, dubbed LighTDiff. It ad… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  3. arXiv:2405.00332  [pdf, other

    cs.CL cs.AI cs.LG

    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Authors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

    Abstract: Large language models (LLMs) have achieved impressive success on many benchmarks for mathematical reasoning. However, there is growing concern that some of this performance actually reflects dataset contamination, where data closely resembling benchmark questions leaks into the training data, instead of true reasoning ability. To investigate this claim rigorously, we commission Grade School Math 1… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  4. arXiv:2404.10775  [pdf, other

    cs.CV cs.AI cs.MA

    COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

    Authors: Hongxin Zhang, Zeyuan Wang, Qiushi Lyu, Zheyuan Zhang, Sunli Chen, Tianmin Shu, Yilun Du, Chuang Gan

    Abstract: In this paper, we investigate the problem of embodied multi-agent cooperation, where decentralized agents must cooperate given only partial egocentric views of the world. To effectively plan in this setting, in contrast to learning world dynamics in a single-agent scenario, we must simulate world dynamics conditioned on an arbitrary number of agents' actions given only partial egocentric visual ob… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 23 pages. The first three authors contributed equally

  5. arXiv:2402.13904  [pdf, other

    cs.CL

    Calibrating Large Language Models with Sample Consistency

    Authors: Qing Lyu, Kumar Shridhar, Chaitanya Malaviya, Li Zhang, Yanai Elazar, Niket Tandon, Marianna Apidianaki, Mrinmaya Sachan, Chris Callison-Burch

    Abstract: Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional calibration techniques due to their proprietary nature and massive scale. In this work, we explore the potential of deriving confidence from the distribution of multiple randomly sampled model generati… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  6. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Weiping Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, Pingyu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  7. arXiv:2311.17286  [pdf, other

    cs.CV

    LEOD: Label-Efficient Object Detection for Event Cameras

    Authors: Ziyi Wu, Mathias Gehrig, Qing Lyu, Xudong Liu, Igor Gilitschenski

    Abstract: Object detection with event cameras benefits from the sensor's low latency and high dynamic range. However, it is costly to fully label event streams for supervised training due to their high temporal resolution. To reduce this cost, we present LEOD, the first method for label-efficient event-based detection. Our approach unifies weakly- and semi-supervised object detection with a self-training me… ▽ More

    Submitted 25 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: CVPR 2024. Code: https://github.com/Wuziyi616/LEOD

  8. arXiv:2310.19660  [pdf, other

    cs.CL

    Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck

    Authors: Josh Magnus Ludan, Qing Lyu, Yue Yang, Liam Dugan, Mark Yatskar, Chris Callison-Burch

    Abstract: Black-box deep neural networks excel in text classification, yet their application in high-stakes domains is hindered by their lack of interpretability. To address this, we propose Text Bottleneck Models (TBM), an intrinsically interpretable text classification framework that offers both global and local explanations. Rather than directly predicting the output label, TBM predicts categorical value… ▽ More

    Submitted 3 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  9. arXiv:2309.00962  [pdf, other

    cs.RO cs.CV

    NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping

    Authors: Jun Zhang, Huayang Zhuge, Yiyao Liu, Guohao Peng, Zhenyu Wu, Haoyuan Zhang, Qiyang Lyu, Heshan Li, Chunyang Zhao, Dogan Kircali, Sanat Mharolkar, Xun Yang, Su Yi, Yuanzhe Wang, Danwei Wang

    Abstract: Simultaneous Localization and Mapping (SLAM) is moving towards a robust perception age. However, LiDAR- and visual- SLAM may easily fail in adverse conditions (rain, snow, smoke and fog, etc.). In comparison, SLAM based on 4D Radar, thermal camera and IMU can work robustly. But only a few literature can be found. A major reason is the lack of related datasets, which seriously hinders the research.… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE International Intelligent Transportation Systems Conference (ITSC 2023)

  10. arXiv:2306.06584  [pdf, other

    cs.CV

    Compositional Prototypical Networks for Few-Shot Classification

    Authors: Qiang Lyu, Weiqiang Wang

    Abstract: It is assumed that pre-training provides the feature extractor with strong class transferability and that high novel class generalization can be achieved by simply reusing the transferable feature extractor. In this work, our motivation is to explicitly learn some fine-grained and transferable meta-knowledge so that feature reusability can be further improved. Concretely, inspired by the fact that… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI 2023

  11. arXiv:2305.18657  [pdf, other

    cs.CL

    Representation Of Lexical Stylistic Features In Language Models' Embedding Space

    Authors: Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: The representation space of pretrained Language Models (LMs) encodes rich information about words and their relationships (e.g., similarity, hypernymy, polysemy) as well as abstract semantic notions (e.g., intensity). In this paper, we demonstrate that lexical stylistic notions such as complexity, formality, and figurativeness, can also be identified in this space. We show that it is possible to d… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at *SEM 2023

  12. arXiv:2305.10263  [pdf, other

    cs.CL

    M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

    Authors: Chuang Liu, Renren Jin, Yuqi Ren, Linhao Yu, Tianyu Dong, Xiaohan Peng, Shuting Zhang, Jianxiang Peng, Peiyi Zhang, Qingqing Lyu, Xiaowen Su, Qun Liu, Deyi Xiong

    Abstract: Large language models have recently made tremendous progress in a variety of aspects, e.g., cross-task generalization, instruction following. Comprehensively evaluating the capability of large language models in multiple tasks is of great importance. In this paper, we propose M3KE, a Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark, which is developed to measure knowledge acquired… ▽ More

    Submitted 20 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  13. arXiv:2305.04990  [pdf, other

    cs.CL cs.LG

    Explanation-based Finetuning Makes Models More Robust to Spurious Cues

    Authors: Josh Magnus Ludan, Yixuan Meng, Tai Nguyen, Saurabh Shah, Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Large Language Models (LLMs) are so powerful that they sometimes learn correlations between labels and features that are irrelevant to the task, leading to poor generalization on out-of-distribution data. We propose explanation-based finetuning as a general approach to mitigate LLMs' reliance on spurious correlations. Unlike standard finetuning where the model only predicts the answer given the in… ▽ More

    Submitted 6 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  14. arXiv:2304.02649  [pdf, other

    eess.IV cs.AI cs.CV

    Specialty-Oriented Generalist Medical AI for Chest CT Screening

    Authors: Chuang Niu, Qing Lyu, Christopher D. Carothers, Parisa Kaviani, Josh Tan, Pingkun Yan, Mannudeep K. Kalra, Christopher T. Whitlow, Ge Wang

    Abstract: Modern medical records include a vast amount of multimodal free text clinical data and imaging data from radiology, cardiology, and digital pathology. Fully mining such big data requires multitasking; otherwise, occult but important aspects may be overlooked, adversely affecting clinical management and population healthcare. Despite remarkable successes of AI in individual tasks with single-modal… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  15. arXiv:2303.09038  [pdf, other

    cs.CL cs.AI physics.med-ph

    Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential

    Authors: Qing Lyu, Josh Tan, Michael E. Zapadka, Janardhana Ponnatapura, Chuang Niu, Kyle J. Myers, Ge Wang, Christopher T. Whitlow

    Abstract: The large language model called ChatGPT has drawn extensively attention because of its human-like expression and reasoning abilities. In this study, we investigate the feasibility of using ChatGPT in experiments on using ChatGPT to translate radiology reports into plain language for patients and healthcare providers so that they are educated for improved healthcare. Radiology reports from 62 low-d… ▽ More

    Submitted 28 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  16. arXiv:2301.13379  [pdf, other

    cs.CL

    Faithful Chain-of-Thought Reasoning

    Authors: Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch

    Abstract: While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\rightarrow$ symbolic reasoning chain) and Problem Solving… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: IJCNLP-AACL 2023 camera-ready version

  17. arXiv:2210.07532  [pdf, other

    cs.LG eess.SP stat.ML

    Provable Subspace Identification Under Post-Nonlinear Mixtures

    Authors: Qi Lyu, Xiao Fu

    Abstract: Unsupervised mixture learning (UML) aims at identifying linearly or nonlinearly mixed latent components in a blind manner. UML is known to be challenging: Even learning linear mixtures requires highly nontrivial analytical tools, e.g., independent component analysis or nonnegative matrix factorization. In this work, the post-nonlinear (PNL) mixture model -- where unknown element-wise nonlinear fun… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022, 21 pages, 2 figures

  18. arXiv:2209.15136  [pdf, other

    eess.IV cs.LG eess.SP physics.med-ph

    Low-Dose CT Using Denoising Diffusion Probabilistic Model for 20$\times$ Speedup

    Authors: Wenjun Xia, Qing Lyu, Ge Wang

    Abstract: Low-dose computed tomography (LDCT) is an important topic in the field of radiology over the past decades. LDCT reduces ionizing radiation-induced patient health risks but it also results in a low signal-to-noise ratio (SNR) and a potential compromise in the diagnostic performance. In this paper, to improve the LDCT denoising performance, we introduce the conditional denoising diffusion probabilis… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  19. arXiv:2209.12104  [pdf, other

    eess.IV cs.CV physics.med-ph

    Conversion Between CT and MRI Images Using Diffusion and Score-Matching Models

    Authors: Qing Lyu, Ge Wang

    Abstract: MRI and CT are most widely used medical imaging modalities. It is often necessary to acquire multi-modality images for diagnosis and treatment such as radiotherapy planning. However, multi-modality imaging is not only costly but also introduces misalignment between MRI and CT images. To address this challenge, computational conversion is a viable approach between MRI and CT images, especially from… ▽ More

    Submitted 29 September, 2022; v1 submitted 24 September, 2022; originally announced September 2022.

  20. arXiv:2209.11326  [pdf, other

    cs.CL

    Towards Faithful Model Explanation in NLP: A Survey

    Authors: Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: End-to-end neural Natural Language Processing (NLP) models are notoriously difficult to understand. This has given rise to numerous efforts towards model explainability in recent years. One desideratum of model explanation is faithfulness, i.e. an explanation should accurately represent the reasoning process behind the model's prediction. In this survey, we review over 110 model explanation method… ▽ More

    Submitted 12 January, 2024; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: Added acknowledgements; Accepted to the Computational Linguistics Journal (June 2024 issue)

  21. arXiv:2206.06593  [pdf, other

    cs.LG stat.ML

    On Finite-Sample Identifiability of Contrastive Learning-Based Nonlinear Independent Component Analysis

    Authors: Qi Lyu, Xiao Fu

    Abstract: Nonlinear independent component analysis (nICA) aims at recovering statistically independent latent components that are mixed by unknown nonlinear functions. Central to nICA is the identifiability of the latent components, which had been elusive until very recently. Specifically, Hyvärinen et al. have shown that the nonlinearly mixed latent components are identifiable (up to often inconsequential… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022, 19 pages, 4 figures

  22. arXiv:2206.04911  [pdf, ps, other

    cs.CR

    NSSIA: A New Self-Sovereign Identity Scheme with Accountability

    Authors: Qiuyun Lyu, Shaopeng Cheng, Hao Li, Junliang Liu, Yanzhao Shen, Zhen Wang

    Abstract: Self-Sovereign Identity (SSI) is a new distributed method for identity management, commonly used to address the problem that users are lack of control over their identities. However, the excessive pursuit of self-sovereignty in the most existing SSI schemes hinders sanctions against attackers. To deal with the malicious behavior, a few SSI schemes introduce accountability mechanisms, but they sacr… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  23. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  24. arXiv:2203.07264  [pdf, other

    cs.CL cs.AI

    Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data

    Authors: Shuyan Zhou, Li Zhang, Yue Yang, Qing Lyu, Pengcheng Yin, Chris Callison-Burch, Graham Neubig

    Abstract: Procedures are inherently hierarchical. To "make videos", one may need to "purchase a camera", which in turn may require one to "set a budget". While such hierarchical knowledge is critical for reasoning about complex procedures, most existing work has treated procedures as shallow structures without modeling the parent-child relation. In this work, we attempt to construct an open-domain hierarchi… ▽ More

    Submitted 17 March, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  25. arXiv:2202.07173  [pdf, other

    eess.IV cs.CV

    To what extent can Plug-and-Play methods outperform neural networks alone in low-dose CT reconstruction

    Authors: Qifan Xu, Qihui Lyu, Dan Ruan, Ke Sheng

    Abstract: The Plug-and-Play (PnP) framework was recently introduced for low-dose CT reconstruction to leverage the interpretability and the flexibility of model-based methods to incorporate various plugins, such as trained deep learning (DL) neural networks. However, the benefits of PnP vs. state-of-the-art DL methods have not been clearly demonstrated. In this work, we proposed an improved PnP framework to… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to IEEE ISBI 2022

  26. arXiv:2201.08418  [pdf, other

    eess.IV cs.AI cs.CV physics.med-ph

    SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis

    Authors: Qing Lyu, Christopher T. Whitlow, Ge Wang

    Abstract: Recently, deep learning has achieved remarkable successes in medical image analysis. Although deep neural networks generate clinically important predictions, they have inherent uncertainty. Such uncertainty is a major barrier to report these predictions with confidence. In this paper, we propose a novel yet simple Bayesian inference approach called SoftDropConnect (SDC) to quantify the network unc… ▽ More

    Submitted 1 June, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  27. arXiv:2112.08326  [pdf, other

    cs.CL

    Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases

    Authors: Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Recursive noun phrases (NPs) have interesting semantic properties. For example, "my favorite new movie" is not necessarily my favorite movie, whereas "my new favorite movie" is. This is common sense to humans, yet it is unknown whether language models have such knowledge. We introduce the Recursive Noun Phrase Challenge (RNPC), a dataset of three textual inference tasks involving textual entailmen… ▽ More

    Submitted 8 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  28. arXiv:2110.03588  [pdf

    eess.IV cs.CV physics.med-ph

    A transformer-based deep learning approach for classifying brain metastases into primary organ sites using clinical whole brain MRI

    Authors: Qing Lyu, Sanjeev V. Namjoshi, Emory McTyre, Umit Topaloglu, Richard Barcus, Michael D. Chan, Christina K. Cramer, Waldemar Debinski, Metin N. Gurcan, Glenn J. Lesser, Hui-Kuan Lin, Reginald F. Munden, Boris C. Pasche, Kiran Kumar Solingapuram Sai, Roy E. Strowd, Stephen B. Tatter, Kounosuke Watabe, Wei Zhang, Ge Wang, Christopher T. Whitlow

    Abstract: Treatment decisions for brain metastatic disease rely on knowledge of the primary organ site, and currently made with biopsy and histology. Here we develop a novel deep learning approach for accurate non-invasive digital histology with whole-brain MRI data. Our IRB-approved single-site retrospective study was comprised of patients (n=1,399) referred for MRI treatment-planning and gamma knife radio… ▽ More

    Submitted 20 April, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

  29. arXiv:2107.13189  [pdf, other

    cs.CL

    Goal-Oriented Script Construction

    Authors: Qing Lyu, Li Zhang, Chris Callison-Burch

    Abstract: The knowledge of scripts, common chains of events in stereotypical scenarios, is a valuable asset for task-oriented natural language understanding systems. We propose the Goal-Oriented Script Construction task, where a model produces a sequence of steps to accomplish a given goal. We pilot our task on the first multilingual script learning dataset supporting 18 languages collected from wikiHow, a… ▽ More

    Submitted 31 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: INLG2021 (14th International Conference on Natural Language Generation)

  30. arXiv:2106.09070  [pdf, other

    cs.LG eess.SP stat.ML

    Identifiability-Guaranteed Simplex-Structured Post-Nonlinear Mixture Learning via Autoencoder

    Authors: Qi Lyu, Xiao Fu

    Abstract: This work focuses on the problem of unraveling nonlinearly mixed latent components in an unsupervised manner. The latent components are assumed to reside in the probability simplex, and are transformed by an unknown post-nonlinear mixing system. This problem finds various applications in signal and data analytics, e.g., nonlinear hyperspectral unmixing, image embedding, and nonlinear clustering. L… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  31. arXiv:2106.07115  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective

    Authors: Qi Lyu, Xiao Fu, Weiran Wang, Songtao Lu

    Abstract: Multiple views of data, both naturally acquired (e.g., image and audio) and artificially produced (e.g., via adding different noise to data samples), have proven useful in enhancing representation learning. Natural views are often handled by multiview analysis tools, e.g., (deep) canonical correlation analysis [(D)CCA], while the artificial ones are frequently used in self-supervised learning (SSL… ▽ More

    Submitted 8 April, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022 Spotlight, 37 pages, 11 figures

  32. arXiv:2104.05845  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Visual Goal-Step Inference using wikiHow

    Authors: Yue Yang, Artemis Panagopoulou, Qing Lyu, Li Zhang, Mark Yatskar, Chris Callison-Burch

    Abstract: Understanding what sequence of steps are needed to complete a goal can help artificial intelligence systems reason about human activities. Past work in NLP has examined the task of goal-step inference for text. We introduce the visual analogue. We propose the Visual Goal-Step Inference (VGSI) task, where a model is given a textual goal and must choose which of four images represents a plausible st… ▽ More

    Submitted 9 September, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

  33. arXiv:2102.08430  [pdf

    cs.LG eess.SY

    Multi-Stage Transmission Line Flow Control Using Centralized and Decentralized Reinforcement Learning Agents

    Authors: Xiumin Shang, Jinping Yang, Bingquan Zhu, Lin Ye, Jing Zhang, Jianping Xu, Qin Lyu, Ruisheng Diao

    Abstract: Planning future operational scenarios of bulk power systems that meet security and economic constraints typically requires intensive labor efforts in performing massive simulations. To automate this process and relieve engineers' burden, a novel multi-stage control approach is presented in this paper to train centralized and decentralized reinforcement learning agents that can automatically adjust… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: This work is accepted by NeurIPS ML4Eng workshop 2020, please refer to https://ml4eng.github.io/camera_readys/56.pdf

  34. arXiv:2101.09441  [pdf, ps, other

    cs.DB

    DBL: Efficient Reachability Queries on Dynamic Graphs (Complete Version)

    Authors: Qiuyi Lyu, Yuchen Li, Bingsheng He, Bin Gong

    Abstract: Reachability query is a fundamental problem on graphs, which has been extensively studied in academia and industry. Since graphs are subject to frequent updates in many applications, it is essential to support efficient graph updates while offering good performance in reachability queries. Existing solutions compress the original graph with the Directed Acyclic Graph (DAG) and propose efficient qu… ▽ More

    Submitted 15 April, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

  35. arXiv:2011.03384  [pdf, other

    cs.LG cs.CV eess.IV

    Suppression of Correlated Noise with Similarity-based Unsupervised Deep Learning

    Authors: Chuang Niu, Mengzhou Li, Fenglei Fan, Weiwen Wu, Xiaodong Guo, Qing Lyu, Ge Wang

    Abstract: Image denoising is a prerequisite for downstream tasks in many fields. Low-dose and photon-counting computed tomography (CT) denoising can optimize diagnostic performance at minimized radiation dose. Supervised deep denoising methods are popular but require paired clean or noisy samples that are often unavailable in practice. Limited by the independent noise assumption, current unsupervised denois… ▽ More

    Submitted 5 January, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

  36. Reasoning about Goals, Steps, and Temporal Ordering with WikiHow

    Authors: Li Zhang, Qing Lyu, Chris Callison-Burch

    Abstract: We propose a suite of reasoning tasks on two types of relations between procedural events: goal-step relations ("learn poses" is a step in the larger goal of "doing yoga") and step-step temporal relations ("buy a yoga mat" typically precedes "learn poses"). We introduce a dataset targeting these two relations based on wikiHow, a website of instructional how-to articles. Our human-validated test se… ▽ More

    Submitted 12 December, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: In EMNLP 2020

    Journal ref: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2020) 4630-4639

  37. arXiv:2009.05781  [pdf, other

    cs.CL

    Intent Detection with WikiHow

    Authors: Li Zhang, Qing Lyu, Chris Callison-Burch

    Abstract: Modern task-oriented dialog systems need to reliably understand users' intents. Intent detection is most challenging when moving to new domains or new languages, since there is little annotated data. To address this challenge, we present a suite of pretrained intent detection models. Our models are able to predict a broad range of intended goals from many actions because they are trained on wikiHo… ▽ More

    Submitted 12 December, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

    Comments: In AACL-IJCNLP 2020

    Journal ref: Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (2020) 328-333

  38. arXiv:2008.04759  [pdf, ps, other

    cs.CL

    Hybrid Ranking Network for Text-to-SQL

    Authors: Qin Lyu, Kaushik Chakrabarti, Shobhit Hathi, Souvik Kundu, Jianwen Zhang, Zheng Chen

    Abstract: In this paper, we study how to leverage pre-trained language models in Text-to-SQL. We argue that previous approaches under utilize the base language models by concatenating all columns together with the NL question and feeding them into the base language model in the encoding stage. We propose a neat approach called Hybrid Ranking Network (HydraNet) which breaks down the problem into column-wise… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  39. arXiv:2008.00682  [pdf, ps, other

    cs.LG cs.AI

    Discovering indicators of dark horse of soccer games by deep learning from sequential trading data

    Authors: Liyao Lu, Qiang Lyu

    Abstract: It is not surprise for machine learning models to provide decent prediction accuracy of soccer games outcomes based on various objective metrics. However, the performance is not that decent in terms of predicting difficult and valuable matches. A deep learning model is designed and trained on a real sequential trading data from the real prediction market, with the assumption that such trading data… ▽ More

    Submitted 3 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  40. arXiv:2006.12700  [pdf, other

    eess.IV cs.LG physics.med-ph

    Cine Cardiac MRI Motion Artifact Reduction Using a Recurrent Neural Network

    Authors: Qing Lyu, Hongming Shan, Yibin Xie, Debiao Li, Ge Wang

    Abstract: Cine cardiac magnetic resonance imaging (MRI) is widely used for diagnosis of cardiac diseases thanks to its ability to present cardiovascular features in excellent contrast. As compared to computed tomography (CT), MRI, however, requires a long scan time, which inevitably induces motion artifacts and causes patients' discomfort. Thus, there has been a strong clinical motivation to develop techniq… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 10 pages, 11 figures

  41. arXiv:1910.09455  [pdf, other

    cs.CV

    Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

    Authors: Yihui He, Jianing Qian, Jianren Wang, Cindy X. Le, Congrui Hetang, Qi Lyu, Wenping Wang, Tianwei Yue

    Abstract: Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks. However, most state-of-the-art CNNs are large, which results in high inference latency. Recently, depth-wise separable convolution has been proposed for image recognition tasks on computationally limited platforms such as robotics and self-driving cars. Though it is mu… ▽ More

    Submitted 23 September, 2023; v1 submitted 21 October, 2019; originally announced October 2019.

  42. Nonlinear Multiview Analysis: Identifiability and Neural Network-assisted Implementation

    Authors: Qi Lyu, Xiao Fu

    Abstract: Multiview analysis aims at extracting shared latent components from data samples that are acquired in different domains, e.g., image, text, and audio. Classic multiview analysis, e.g., canonical correlation analysis (CCA), tackles this problem via matching the linearly transformed views in a certain latent domain. More recently, powerful nonlinear learning tools such as kernel methods and neural n… ▽ More

    Submitted 30 March, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

    Comments: accepted version; IEEE transactions on signal processing

  43. arXiv:1908.01612  [pdf, other

    eess.IV cs.LG physics.med-ph

    Multi-Contrast Super-Resolution MRI Through a Progressive Network

    Authors: Qing Lyu, Hongming Shan, Ge Wang

    Abstract: Magnetic resonance imaging (MRI) is widely used for screening, diagnosis, image-guided therapy, and scientific research. A significant advantage of MRI over other imaging modalities such as computed tomography (CT) and nuclear imaging is that it clearly shows soft tissues in multi-contrasts. Compared with other medical image super-resolution (SR) methods that are in a single contrast, multi-contra… ▽ More

    Submitted 6 August, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: 10 figures, 5 tables, 11 pages

    Journal ref: IEEE Transactions on Medical Imaging, early access, 2020

  44. arXiv:1907.03063  [pdf

    eess.IV cs.LG physics.med-ph

    MRI Super-Resolution with Ensemble Learning and Complementary Priors

    Authors: Qing Lyu, Hongming Shan, Ge Wang

    Abstract: Magnetic resonance imaging (MRI) is a widely used medical imaging modality. However, due to the limitations in hardware, scan time, and throughput, it is often clinically challenging to obtain high-quality MR images. The super-resolution approach is potentially promising to improve MR image quality without any hardware upgrade. In this paper, we propose an ensemble learning and deep learning frame… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Journal ref: IEEE Transactions on Computational Imaging, vol. 6, pp. 615-624, 2020

  45. arXiv:1810.10286  [pdf, other

    cs.CV

    Learning color space adaptation from synthetic to real images of cirrus clouds

    Authors: Qing Lyu, Minghao Chen, Xiang Chen

    Abstract: Cloud segmentation plays a crucial role in image analysis for climate modeling. Manually labeling the training data for cloud segmentation is time-consuming and error-prone. We explore to train segmentation networks with synthetic data due to the natural acquisition of pixel-level labels. Nevertheless, the domain gap between synthetic and real images significantly degrades the performance of the t… ▽ More

    Submitted 16 November, 2020; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: 12 pages, 13 figures

  46. arXiv:1802.00810  [pdf, ps, other

    q-bio.GN cs.LG

    Deep Learning for Genomics: A Concise Overview

    Authors: Tianwei Yue, Yuanxin Wang, Longxiang Zhang, Chunming Gu, Haoru Xue, Wenping Wang, Qi Lyu, Yujie Dun

    Abstract: Advancements in genomic research such as high-throughput sequencing techniques have driven modern genomic studies into "big data" disciplines. This data explosion is constantly challenging conventional methods used in genomics. In parallel with the urgent demand for robust algorithms, deep learning has succeeded in a variety of fields such as vision, speech, and text processing. Yet genomics entai… ▽ More

    Submitted 4 October, 2023; v1 submitted 2 February, 2018; originally announced February 2018.

  47. arXiv:1710.07941  [pdf, other

    cs.HC cs.CR stat.ML

    WristAuthen: A Dynamic Time Wrapping Approach for User Authentication by Hand-Interaction through Wrist-Worn Devices

    Authors: Qi Lyu, Zhifeng Kong, Chao Shen, Tianwei Yue

    Abstract: The growing trend of using wearable devices for context-aware computing and pervasive sensing systems has raised its potentials for quick and reliable authentication techniques. Since personal writing habitats differ from each other, it is possible to realize user authentication through writing. This is of great significance as sensible information is easily collected by these devices. This paper… ▽ More

    Submitted 22 October, 2017; originally announced October 2017.

    Comments: 11 pages, 12 figures

  48. arXiv:1611.00873  [pdf, ps, other

    cs.AI cs.LG

    Extracting Actionability from Machine Learning Models by Sub-optimal Deterministic Planning

    Authors: Qiang Lyu, Yixin Chen, Zhaorong Li, Zhicheng Cui, Ling Chen, Xing Zhang, Haihua Shen

    Abstract: A main focus of machine learning research has been improving the generalization accuracy and efficiency of prediction models. Many models such as SVM, random forest, and deep neural nets have been proposed and achieved great success. However, what emerges as missing in many applications is actionability, i.e., the ability to turn prediction results into actions. For example, in applications such a… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

    Comments: 16 pages, 4 figures