Search Results for author: Yao Liu

Found 34 papers, 7 papers with code

InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment

1 code implementation • 13 Feb 2024 • Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian McAuley

In this paper, we propose InstructGraph, a framework that empowers LLMs with the abilities of graph reasoning and generation by instruction tuning and preference alignment.

Hallucination

Paper
Code

Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction

no code implementations • 26 Dec 2023 • Yao Liu, Binghao Li, Xianzhi Wang, Claude Sammut, Lina Yao

We propose Attention-aware Social Graph Transformer Networks for multi-modal trajectory prediction.

Autonomous Driving motion prediction +1

Paper
Add Code

Precipitation Prediction Using an Ensemble of Lightweight Learners

1 code implementation • 30 Nov 2023 • Xinzhe Li, Sun Rui, Yiming Niu, Yao Liu

Specifically, the framework consists of a precipitation predictor with multiple lightweight heads (learners) and a controller that combines the outputs from these heads.

Ensemble Learning

Paper
Code

Parrot-Trained Adversarial Examples: Pushing the Practicality of Black-Box Audio Attacks against Speaker Recognition Models

no code implementations • 13 Nov 2023 • Rui Duan, Zhe Qu, Leah Ding, Yao Liu, Zhuo Lu

Motivated by recent advancements in voice conversion (VC), we propose to use the one short sentence knowledge to generate more synthetic speech samples that sound like the target speaker, called parrot speech.

Sentence Speaker Recognition +1

Paper
Add Code

MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation

no code implementations • 21 Oct 2023 • Zexue He, Yu Wang, An Yan, Yao Liu, Eric Y. Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

Curated datasets for healthcare are often limited due to the need of human annotations from experts.

Benchmarking Language Modelling

Paper
Add Code

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

no code implementations • 9 Oct 2023 • Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor

Inspired by recent advancements in parameter-efficient fine-tuning in language domains, we explore efficient fine-tuning techniques -- e. g., Bottleneck Adapters, P-Tuning, and Low-Rank Adaptation (LoRA) -- in TAIL to adapt large pretrained models for new tasks with limited demonstration data.

Continual Learning Imitation Learning

Paper
Add Code

A Novel Convolutional Neural Network Architecture with a Continuous Symmetry

1 code implementation • 3 Aug 2023 • Yao Liu, Hang Shao, Bing Bai

This paper introduces a new Convolutional Neural Network (ConvNet) architecture inspired by a class of partial differential equations (PDEs) called quasi-linear hyperbolic systems.

Image Classification

Paper
Code

Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

no code implementations • 22 Jul 2023 • Yao Liu, Gangfeng Cui, Jiahui Luo, Lina Yao, Xiaojun Chang

Subsequently, a frame features learning module and a two-stream multi-level feature aggregation module extract global and partial features from the sampled frames, effectively representing the local-region spatial information, appearance information, and motion information related to the interactions.

Action Recognition Temporal Action Localization

Paper
Add Code

Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task

no code implementations • 11 Apr 2023 • Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, JQ Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y Wang, Rui Ying, James A Landay, Emma Brunskill

Resource limitations make it hard to provide all students with one of the most effective educational interventions: personalized instruction.

Explainable artificial intelligence Math +1

Paper
Add Code

Uncertainty-Aware Pedestrian Trajectory Prediction via Distributional Diffusion

no code implementations • 15 Mar 2023 • Yao Liu, Zesheng Ye, Binghao Li, Lina Yao

In this work, we propose to separately model these two factors by implicitly deriving a flexible distribution that describes complex pedestrians' movements, whereas incorporating predictive uncertainty of individuals with explicit density functions over their future locations.

Denoising Pedestrian Trajectory Prediction +1

Paper
Add Code

Symbolic Discovery of Optimization Algorithms

no code implementations • NeurIPS 2023 • Xiangning Chen, Chen Liang, Da Huang, Esteban Real, Kaiyuan Wang, Yao Liu, Hieu Pham, Xuanyi Dong, Thang Luong, Cho-Jui Hsieh, Yifeng Lu, Quoc V. Le

On diffusion models, Lion outperforms Adam by achieving a better FID score and reducing the training compute by up to 2. 3x.

Contrastive Learning Image Classification +2

Paper
Add Code

Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception

no code implementations • 26 Jul 2022 • Rui Duan, Zhe Qu, Shangqing Zhao, Leah Ding, Yao Liu, Zhuo Lu

In this work, we formulate the adversarial attack against music signals as a new perception-aware attack framework, which integrates human study into adversarial attack design.

Adversarial Attack Speaker Recognition +2

Paper
Add Code

Towards Adaptive Unknown Authentication for Universal Domain Adaptation by Classifier Paradox

no code implementations • 10 Jul 2022 • Yunyun Wang, Yao Liu, Songcan Chen

In this paper, we propose a new UniDA method with adaptive Unknown Authentication by Classifier Paradox (UACP), considering that samples with paradoxical predictions are probably unknowns belonging to none of the source classes.

Universal Domain Adaptation Unsupervised Domain Adaptation

Paper
Add Code

Offline Policy Optimization with Eligible Actions

1 code implementation • 1 Jul 2022 • Yao Liu, Yannis Flet-Berliac, Emma Brunskill

Offline policy optimization could have a large impact on many real-world decision-making problems, as online learning may be infeasible in many applications.

Continuous Control Decision Making

Paper
Code

Generalized Federated Learning via Sharpness Aware Minimization

no code implementations • 6 Jun 2022 • Zhe Qu, Xingyu Li, Rui Duan, Yao Liu, Bo Tang, Zhuo Lu

Therefore, in this paper, we revisit the solutions to the distribution shift problem in FL with a focus on local learning generality.

Federated Learning Privacy Preserving

Paper
Add Code

Provably Sample-Efficient RL with Side Information about Latent Dynamics

no code implementations • 27 May 2022 • Yao Liu, Dipendra Misra, Miro Dudík, Robert E. Schapire

We study reinforcement learning (RL) in settings where observations are high-dimensional, but where an RL agent has access to abstract knowledge about the structure of the state space, as is the case, for example, when a robot is tasked to go to a specific room in a building using observations from its own camera, while having access to the floor plan.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Tiny Object Tracking: A Large-scale Dataset and A Baseline

1 code implementation • 11 Feb 2022 • Yabin Zhu, Chenglong Li, Yao Liu, Xiao Wang, Jin Tang, Bin Luo, Zhixiang Huang

Tiny objects, frequently appearing in practical applications, have weak appearance and features, and receive increasing interests in meany vision tasks, such as object detection and segmentation.

Attribute Knowledge Distillation +4

131

Paper
Code

IoTGAN: GAN Powered Camouflage Against Machine Learning Based IoT Device Identification

no code implementations • 10 Jan 2022 • Tao Hou, Tao Wang, Zhuo Lu, Yao Liu, Yalin Sagduyu

In this research, we propose a novel attack strategy named IoTGAN to manipulate an IoT device's traffic such that it can evade machine learning based IoT device identification.

BIG-bench Machine Learning

Paper
Add Code

LoMar: A Local Defense Against Poisoning Attack on Federated Learning

no code implementations • 8 Jan 2022 • Xingyu Li, Zhe Qu, Shangqing Zhao, Bo Tang, Zhuo Lu, Yao Liu

Federated learning (FL) provides a high efficient decentralized machine learning framework, where the training data remains distributed at remote clients in a network.

Density Estimation Edge-computing +2

Paper
Add Code

Context-Aware Online Client Selection for Hierarchical Federated Learning

no code implementations • 2 Dec 2021 • Zhe Qu, Rui Duan, Lixing Chen, Jie Xu, Zhuo Lu, Yao Liu

In addition, client selection for HFL faces more challenges than conventional FL, e. g., the time-varying connection of client-ES pairs and the limited budget of the Network Operator (NO).

Federated Learning

Paper
Add Code

Avoiding Overfitting to the Importance Weights in Offline Policy Optimization

no code implementations • 29 Sep 2021 • Yao Liu, Emma Brunskill

Offline policy optimization has a critical impact on many real-world decision-making problems, as online learning is costly and concerning in many applications.

Decision Making

Paper
Add Code

Applying VertexShuffle Toward 360-Degree Video Super-Resolution on Focused-Icosahedral-Mesh

no code implementations • 21 Jun 2021 • Na Li, Yao Liu

We further apply our proposed methods on super resolution model, which is the first to propose a spherical super-resolution model that directly operates on a mesh representation of spherical pixels of 360-degree data.

Video Super-Resolution

Paper
Add Code

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration

no code implementations • NeurIPS 2020 • Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Doing batch RL in a way that yields a reliable new policy in large domains is challenging: a new decision policy may visit states and actions outside the support of the batch data, and function approximation and optimization with limited samples can further increase the potential of learning policies with overly optimistic estimates of their future performance.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Provably Good Batch Reinforcement Learning Without Great Exploration

1 code implementation • 16 Jul 2020 • Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

no code implementations • ICML 2020 • Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo Anthony Celi, Emma Brunskill, Finale Doshi-Velez

Off-policy evaluation in reinforcement learning offers the chance of using observational data to improve future outcomes in domains such as healthcare and education, but safe deployment in high stakes settings requires ways of assessing its validity.

Off-policy evaluation reinforcement-learning

Paper
Add Code

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation

no code implementations • 20 Nov 2019 • Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua

In this paper, we propose a novel Semi-supervised Self-pace Adversarial Hashing method, named SSAH to solve the above problems in a unified framework.

Deep Hashing Generative Adversarial Network

Paper
Add Code

All-Action Policy Gradient Methods: A Numerical Integration Approach

no code implementations • 21 Oct 2019 • Benjamin Petit, Loren Amdahl-Culleton, Yao Liu, Jimmy Smith, Pierre-Luc Bacon

While often stated as an instance of the likelihood ratio trick [Rubinstein, 1989], the original policy gradient theorem [Sutton, 1999] involves an integral over the action space.

Continuous Control Numerical Integration +1

Paper
Add Code

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

no code implementations • ICML 2020 • Yao Liu, Pierre-Luc Bacon, Emma Brunskill

Surprisingly, we find that in finite horizon MDPs there is no strict variance reduction of per-decision importance sampling or stationary importance sampling, comparing with vanilla importance sampling.

Off-policy evaluation

Paper
Add Code

Combining Parametric and Nonparametric Models for Off-Policy Evaluation

no code implementations • 14 May 2019 • Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez

We consider a model-based approach to perform batch off-policy evaluation in reinforcement learning.

Off-policy evaluation reinforcement-learning

Paper
Add Code

Off-Policy Policy Gradient with State Distribution Correction

no code implementations • 17 Apr 2019 • Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

We study the problem of off-policy policy optimization in Markov decision processes, and develop a novel off-policy policy gradient method.

Paper
Add Code

Aurora Guard: Real-Time Face Anti-Spoofing via Light Reflection

no code implementations • 27 Feb 2019 • Yao Liu, Ying Tai, Jilin Li, Shouhong Ding, Chengjie Wang, Feiyue Huang, Dongyang Li, Wenshuai Qi, Rongrong Ji

In this paper, we propose a light reflection based face anti-spoofing method named Aurora Guard (AG), which is fast, simple yet effective that has already been deployed in real-world systems serving for millions of users.

Face Anti-Spoofing General Classification

Paper
Add Code

Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters

no code implementations • 3 Jul 2018 • Aniruddh Raghu, Omer Gottesman, Yao Liu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill

In this work, we consider the problem of estimating a behaviour policy for use in Off-Policy Policy Evaluation (OPE) when the true behaviour policy is unknown.

Paper
Add Code

When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

no code implementations • 23 May 2018 • Yao Liu, Emma Brunskill

Efficient exploration is one of the key challenges for reinforcement learning (RL) algorithms.

Efficient Exploration Q-Learning +1

Paper
Add Code

Representation Balancing MDPs for Off-Policy Policy Evaluation

1 code implementation • NeurIPS 2018 • Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill

We study the problem of off-policy policy evaluation (OPPE) in RL.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.