Search Results for author: Shu Yang

Found 28 papers, 10 papers with code

MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning

no code implementations • 23 Apr 2024 • Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen

The rapid advancement of large-scale vision-language models has showcased remarkable capabilities across various tasks.

Medical Diagnosis Medical Report Generation +3

Paper
Add Code

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

no code implementations • 30 Mar 2024 • Shu Yang, Jiayuan Su, Han Jiang, Mengdi Li, Keyuan Cheng, Muhammad Asif Ali, Lijie Hu, Di Wang

With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial.

knowledge editing Navigate +1

Paper
Add Code

PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression

no code implementations • 30 Mar 2024 • Muhammad Asif Ali, ZhengPing Li, Shu Yang, Keyuan Cheng, Yang Cao, Tianhao Huang, Lijie Hu, Lu Yu, Di Wang

Large language models (LLMs) have shown exceptional abilities for multiple different natural language processing tasks.

GSM8K Relation

Paper
Add Code

MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology

1 code implementation • 11 Mar 2024 • Shu Yang, Yihui Wang, Hao Chen

Multiple Instance Learning (MIL) has emerged as a dominant paradigm to extract discriminative feature representations within Whole Slide Images (WSIs) in computational pathology.

Multiple Instance Learning whole slide images

Paper
Code

SDiT: Spiking Diffusion Model with Transformer

no code implementations • 18 Feb 2024 • Shu Yang, Hanzhi Ma, Chengting Yu, Aili Wang, Er-Ping Li

In this paper, we explore a novel diffusion model architecture within spiking neural networks.

Image Generation

Paper
Add Code

MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning

no code implementations • 17 Feb 2024 • Shu Yang, Muhammad Asif Ali, Cheng-Long Wang, Lijie Hu, Di Wang

Adapting large language models (LLMs) to new domains/tasks and enabling them to be efficient lifelong learners is a pivotal challenge.

Paper
Add Code

MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions

no code implementations • 17 Feb 2024 • Shu Yang, Muhammad Asif Ali, Lu Yu, Lijie Hu, Di Wang

The increasing significance of large models and their multi-modal variants in societal information processing has ignited debates on social safety and ethics.

Ethics

Paper
Add Code

Mixed Matrix Completion in Complex Survey Sampling under Heterogeneous Missingness

no code implementations • 6 Feb 2024 • Xiaojun Mao, Hengfang Wang, Zhonglei Wang, Shu Yang

Modern surveys with large sample sizes and growing mixed-type questionnaires require robust and scalable analysis methods.

Matrix Completion Nutrition +1

Paper
Add Code

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

1 code implementation • 23 Oct 2023 • Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao

In this survey, we collate recent research breakthroughs in this area and underscore the pressing need to bolster detector research.

LLM-generated Text Detection Text Detection

120

Paper
Code

Positivity-free Policy Learning with Observational Data

1 code implementation • 10 Oct 2023 • Pan Zhao, Antoine Chambaz, Julie Josse, Shu Yang

Policy learning utilizing observational data is pivotal across various domains, with the objective of learning the optimal treatment assignment policy while adhering to specific constraints such as fairness, budget, and simplicity.

Fairness

Paper
Code

Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation

no code implementations • 4 Apr 2023 • Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang

To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT settings using in-context learning for ChatGPT.

Grammatical Error Correction In-Context Learning +2

Paper
Add Code

Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data

1 code implementation • 13 Jan 2023 • Pan Zhao, Julie Josse, Shu Yang

We present an efficient and robust transfer learning framework for estimating the optimal ITR with right-censored survival data that generalizes well to the target population.

counterfactual Transfer Learning

Paper
Code

A Bayesian Semiparametric Method For Estimating Causal Quantile Effects

no code implementations • 3 Nov 2022 • Steven G. Xu, Shu Yang, Brian J. Reich

We adopt a semiparametric conditional distribution regression model that allows inference on any functionals of counterfactual distributions, including PDFs and multiple QTEs.

Causal Inference counterfactual

Paper
Add Code

Self-supervised Denoising via Low-rank Tensor Approximated Convolutional Neural Network

no code implementations • 26 Sep 2022 • Chenyin Gao, Shu Yang, Anru R. Zhang

With the proposed design, we are able to characterize our denoiser with fewer parameters and train it based on a single image, which considerably improves the model generalizability and reduces the cost of data acquisition.

Image Denoising

Paper
Add Code

Faith: An Efficient Framework for Transformer Verification on GPUs

1 code implementation • 23 Sep 2022 • Boyuan Feng, Tianqi Tang, yuke wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding

In this paper, we propose Faith, an efficient framework for transformer verification on GPUs.

Sentence

Paper
Code

UMSNet: An Universal Multi-sensor Network for Human Activity Recognition

no code implementations • 24 May 2022 • Jialiang Wang, Haotian Wei, Yi Wang, Shu Yang, Chi Li

Human activity recognition (HAR) based on multimodal sensors has become a rapidly growing branch of biometric recognition and artificial intelligence.

Human Activity Recognition Time Series +2

Paper
Add Code

Combining Doubly Robust Methods and Machine Learning for Estimating Average Treatment Effects for Observational Real-world Data

1 code implementation • 23 Apr 2022 • Xiaoqing Tan, Shu Yang, Wenyu Ye, Douglas E. Faries, Ilya Lipkovich, Zbigniew Kadziola

Recently, various doubly robust methods have been proposed for average treatment effect estimation by combining the treatment model and the outcome model via different vehicles, such as matching, weighting, and regression.

regression

Paper
Code

Targeted Optimal Treatment Regime Learning Using Summary Statistics

no code implementations • 17 Jan 2022 • Jianing Chu, Wenbin Lu, Shu Yang

We consider the problem of treatment regime estimation when the source and target populations may be heterogeneous, individual-level data is available from the source population, and only the summary information of covariates, such as moments, is accessible from the target population.

Decision Making

Paper
Add Code

Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation

1 code implementation • ICCV 2021 • Shu Yang, Lu Zhang, Jinqing Qi, Huchuan Lu, Shuo Wang, Xiaoxing Zhang

How to make the appearance and motion information interact effectively to accommodate complex scenarios is a fundamental issue in flow-based zero-shot video object segmentation.

Ranked #4 on Unsupervised Video Object Segmentation on YouTube-Objects

Semantic Segmentation Unsupervised Video Object Segmentation +2

Paper
Code

A spectral adjustment for spatial confounding

no code implementations • 22 Dec 2020 • Yawen Guan, Garritt L. Page, Brian J Reich, Massimo Ventrucci, Shu Yang

We show that this assumption in the spectral domain is equivalent to adjusting for global-scale confounding in the spatial domain by adding a spatially smoothed version of the treatment variable to the mean of the response variable.

Methodology

Paper
Add Code

A Distributed Training Algorithm of Generative Adversarial Networks with Quantized Gradients

no code implementations • 26 Oct 2020 • Xiaojun Chen, Shu Yang, Li Shen, Xuanrong Pang

In this paper, we propose a {distributed GANs training algorithm with quantized gradient, dubbed DQGAN,} which is the first distributed training method with quantized gradient for GANs.

Paper
Add Code

SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization

no code implementations • 9 Jul 2020 • Boyuan Feng, yuke wang, Xu Li, Shu Yang, Xueqiao Peng, Yufei Ding

With the increasing popularity of graph-based learning, Graph Neural Networks (GNNs) win lots of attention from the research and industry field because of their high accuracy.

Quantization

Paper
Add Code

Double score matching estimators of average and quantile treatment effects

1 code implementation • 16 Jan 2020 • Shu Yang, Yunshu Zhang

Propensity score matching has a long tradition for handling confounding in causal inference.

Methodology

Paper
Code

Multi-cause causal inference with unmeasured confounding and binary outcome

no code implementations • 31 Jul 2019 • Dehan Kong, Shu Yang, Linbo Wang

Unobserved confounding presents a major threat to causal inference in observational studies.

Methodology

Paper
Add Code

PCNN: Environment Adaptive Model Without Finetuning

no code implementations • ICLR 2019 • Boyuan Feng, Kun Wan, Shu Yang, Yufei Ding

Convolutional Neural Networks (CNNs) have achieved tremendous success for many computer vision tasks, which shows a promising perspective of deploying CNNs on mobile platforms.

Transfer Learning

Paper
Add Code

Penetrating the Fog: the Path to Efficient CNN Models

no code implementations • ICLR 2019 • Kun Wan, Boyuan Feng, Shu Yang, Yufei Ding

In this paper, we are the first in the field to consider how to craft an effective sparse kernel design by eliminating the large design space.

Paper
Add Code

Domain-Adversarial Multi-Task Framework for Novel Therapeutic Property Prediction of Compounds

1 code implementation • 28 Sep 2018 • Lingwei Xie, Song He, Shu Yang, Boyuan Feng, Kun Wan, Zhongnan Zhang, Xiaochen Bo, Yufei Ding

In this paper, we propose a novel domain-adversarial multi-task framework for integrating shared knowledge from multiple domains.

Property Prediction

Paper
Code

Semiparametric estimation of structural failure time model in continuous-time processes

1 code implementation • 20 Aug 2018 • Shu Yang, Karen Pieper, Frank Cools

We propose a class of continuous-time structural failure time models and semiparametric estimators, which do not restrict to regularly spaced data.

Methodology

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.