Search Results for author: Shu Yang

Found 28 papers, 10 papers with code

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

no code implementations30 Mar 2024 Shu Yang, Jiayuan Su, Han Jiang, Mengdi Li, Keyuan Cheng, Muhammad Asif Ali, Lijie Hu, Di Wang

With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial.

knowledge editing Navigate +1

PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression

no code implementations30 Mar 2024 Muhammad Asif Ali, ZhengPing Li, Shu Yang, Keyuan Cheng, Yang Cao, Tianhao Huang, Lijie Hu, Lu Yu, Di Wang

Large language models (LLMs) have shown exceptional abilities for multiple different natural language processing tasks.

GSM8K Relation

MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology

1 code implementation11 Mar 2024 Shu Yang, Yihui Wang, Hao Chen

Multiple Instance Learning (MIL) has emerged as a dominant paradigm to extract discriminative feature representations within Whole Slide Images (WSIs) in computational pathology.

Multiple Instance Learning whole slide images

SDiT: Spiking Diffusion Model with Transformer

no code implementations18 Feb 2024 Shu Yang, Hanzhi Ma, Chengting Yu, Aili Wang, Er-Ping Li

In this paper, we explore a novel diffusion model architecture within spiking neural networks.

Image Generation

MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning

no code implementations17 Feb 2024 Shu Yang, Muhammad Asif Ali, Cheng-Long Wang, Lijie Hu, Di Wang

Adapting large language models (LLMs) to new domains/tasks and enabling them to be efficient lifelong learners is a pivotal challenge.

MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions

no code implementations17 Feb 2024 Shu Yang, Muhammad Asif Ali, Lu Yu, Lijie Hu, Di Wang

The increasing significance of large models and their multi-modal variants in societal information processing has ignited debates on social safety and ethics.

Ethics

Mixed Matrix Completion in Complex Survey Sampling under Heterogeneous Missingness

no code implementations6 Feb 2024 Xiaojun Mao, Hengfang Wang, Zhonglei Wang, Shu Yang

Modern surveys with large sample sizes and growing mixed-type questionnaires require robust and scalable analysis methods.

Matrix Completion Nutrition +1

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

1 code implementation23 Oct 2023 Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao

In this survey, we collate recent research breakthroughs in this area and underscore the pressing need to bolster detector research.

LLM-generated Text Detection Text Detection

Positivity-free Policy Learning with Observational Data

1 code implementation10 Oct 2023 Pan Zhao, Antoine Chambaz, Julie Josse, Shu Yang

Policy learning utilizing observational data is pivotal across various domains, with the objective of learning the optimal treatment assignment policy while adhering to specific constraints such as fairness, budget, and simplicity.

Fairness

Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation

no code implementations4 Apr 2023 Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang

To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT settings using in-context learning for ChatGPT.

Grammatical Error Correction In-Context Learning +2

Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data

1 code implementation13 Jan 2023 Pan Zhao, Julie Josse, Shu Yang

We present an efficient and robust transfer learning framework for estimating the optimal ITR with right-censored survival data that generalizes well to the target population.

counterfactual Transfer Learning

A Bayesian Semiparametric Method For Estimating Causal Quantile Effects

no code implementations3 Nov 2022 Steven G. Xu, Shu Yang, Brian J. Reich

We adopt a semiparametric conditional distribution regression model that allows inference on any functionals of counterfactual distributions, including PDFs and multiple QTEs.

Causal Inference counterfactual

Self-supervised Denoising via Low-rank Tensor Approximated Convolutional Neural Network

no code implementations26 Sep 2022 Chenyin Gao, Shu Yang, Anru R. Zhang

With the proposed design, we are able to characterize our denoiser with fewer parameters and train it based on a single image, which considerably improves the model generalizability and reduces the cost of data acquisition.

Image Denoising

Faith: An Efficient Framework for Transformer Verification on GPUs

1 code implementation23 Sep 2022 Boyuan Feng, Tianqi Tang, yuke wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding

In this paper, we propose Faith, an efficient framework for transformer verification on GPUs.

Sentence

UMSNet: An Universal Multi-sensor Network for Human Activity Recognition

no code implementations24 May 2022 Jialiang Wang, Haotian Wei, Yi Wang, Shu Yang, Chi Li

Human activity recognition (HAR) based on multimodal sensors has become a rapidly growing branch of biometric recognition and artificial intelligence.

Human Activity Recognition Time Series +2

Combining Doubly Robust Methods and Machine Learning for Estimating Average Treatment Effects for Observational Real-world Data

1 code implementation23 Apr 2022 Xiaoqing Tan, Shu Yang, Wenyu Ye, Douglas E. Faries, Ilya Lipkovich, Zbigniew Kadziola

Recently, various doubly robust methods have been proposed for average treatment effect estimation by combining the treatment model and the outcome model via different vehicles, such as matching, weighting, and regression.

regression

Targeted Optimal Treatment Regime Learning Using Summary Statistics

no code implementations17 Jan 2022 Jianing Chu, Wenbin Lu, Shu Yang

We consider the problem of treatment regime estimation when the source and target populations may be heterogeneous, individual-level data is available from the source population, and only the summary information of covariates, such as moments, is accessible from the target population.

Decision Making

A spectral adjustment for spatial confounding

no code implementations22 Dec 2020 Yawen Guan, Garritt L. Page, Brian J Reich, Massimo Ventrucci, Shu Yang

We show that this assumption in the spectral domain is equivalent to adjusting for global-scale confounding in the spatial domain by adding a spatially smoothed version of the treatment variable to the mean of the response variable.

Methodology

A Distributed Training Algorithm of Generative Adversarial Networks with Quantized Gradients

no code implementations26 Oct 2020 Xiaojun Chen, Shu Yang, Li Shen, Xuanrong Pang

In this paper, we propose a {distributed GANs training algorithm with quantized gradient, dubbed DQGAN,} which is the first distributed training method with quantized gradient for GANs.

SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization

no code implementations9 Jul 2020 Boyuan Feng, yuke wang, Xu Li, Shu Yang, Xueqiao Peng, Yufei Ding

With the increasing popularity of graph-based learning, Graph Neural Networks (GNNs) win lots of attention from the research and industry field because of their high accuracy.

Quantization

Double score matching estimators of average and quantile treatment effects

1 code implementation16 Jan 2020 Shu Yang, Yunshu Zhang

Propensity score matching has a long tradition for handling confounding in causal inference.

Methodology

Multi-cause causal inference with unmeasured confounding and binary outcome

no code implementations31 Jul 2019 Dehan Kong, Shu Yang, Linbo Wang

Unobserved confounding presents a major threat to causal inference in observational studies.

Methodology

PCNN: Environment Adaptive Model Without Finetuning

no code implementations ICLR 2019 Boyuan Feng, Kun Wan, Shu Yang, Yufei Ding

Convolutional Neural Networks (CNNs) have achieved tremendous success for many computer vision tasks, which shows a promising perspective of deploying CNNs on mobile platforms.

Transfer Learning

Penetrating the Fog: the Path to Efficient CNN Models

no code implementations ICLR 2019 Kun Wan, Boyuan Feng, Shu Yang, Yufei Ding

In this paper, we are the first in the field to consider how to craft an effective sparse kernel design by eliminating the large design space.

Domain-Adversarial Multi-Task Framework for Novel Therapeutic Property Prediction of Compounds

1 code implementation28 Sep 2018 Lingwei Xie, Song He, Shu Yang, Boyuan Feng, Kun Wan, Zhongnan Zhang, Xiaochen Bo, Yufei Ding

In this paper, we propose a novel domain-adversarial multi-task framework for integrating shared knowledge from multiple domains.

Property Prediction

Semiparametric estimation of structural failure time model in continuous-time processes

1 code implementation20 Aug 2018 Shu Yang, Karen Pieper, Frank Cools

We propose a class of continuous-time structural failure time models and semiparametric estimators, which do not restrict to regularly spaced data.

Methodology

Cannot find the paper you are looking for? You can Submit a new open access paper.