Search Results for author: Shuo Yang

Found 83 papers, 29 papers with code

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

no code implementations11 Sep 2014 Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Yuanjun Xiong, Chen Qian, Zhenyao Zhu, Ruohui Wang, Chen-Change Loy, Xiaogang Wang, Xiaoou Tang

In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty.

Object object-detection +1

From Facial Parts Responses to Face Detection: A Deep Learning Approach

1 code implementation ICCV 2015 Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

In this paper, we propose a novel deep convolutional network (DCN) that achieves outstanding performance on FDDB, PASCAL Face, and AFW.

Face Detection

Application of Statistical Relational Learning to Hybrid Recommendation Systems

no code implementations4 Jul 2016 Shuo Yang, Mohammed Korayem, Khalifeh Aljadda, Trey Grainger, Sriraam Natarajan

In this paper, we proposed a way to adapt the state-of-the-art in SRL learning approaches to construct a real hybrid recommendation system.

Collaborative Filtering Feature Engineering +2

Joint Hand Detection and Rotation Estimation by Using CNN

no code implementations8 Dec 2016 Xiaoming Deng, Ye Yuan, Yinda Zhang, Ping Tan, Liang Chang, Shuo Yang, Hongan Wang

Hand detection is essential for many hand related tasks, e. g. parsing hand pose, understanding gesture, which are extremely useful for robotics and human-computer interaction.

General Classification Hand Detection +2

Faceness-Net: Face Detection through Deep Facial Part Responses

no code implementations29 Jan 2017 Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

We propose a deep convolutional neural network (CNN) for face detection leveraging on facial attributes based supervision.

Face Detection

Hand3D: Hand Pose Estimation using 3D Neural Network

no code implementations7 Apr 2017 Xiaoming Deng, Shuo Yang, yinda zhang, Ping Tan, Liang Chang, Hongan Wang

We propose a novel 3D neural network architecture for 3D hand pose estimation from a single depth image.

3D Hand Pose Estimation

Residual Attention Network for Image Classification

21 code implementations CVPR 2017 Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang

In this work, we propose "Residual Attention Network", a convolutional neural network using attention mechanism which can incorporate with state-of-art feed forward network architecture in an end-to-end training fashion.

General Classification Image Classification +1

Face Detection through Scale-Friendly Deep Convolutional Networks

no code implementations9 Jun 2017 Shuo Yang, Yuanjun Xiong, Chen Change Loy, Xiaoou Tang

Specifically, our method achieves 76. 4 average precision on the challenging WIDER FACE dataset and 96% recall rate on the FDDB dataset with 7 frames per second (fps) for 900 * 1300 input image.

Face Detection

Optimizing Video Object Detection via a Scale-Time Lattice

1 code implementation CVPR 2018 Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin

High-performance object detection relies on expensive convolutional networks to compute features, often leading to significant challenges in applications, e. g. those that require detecting objects from video streams in real time.

Object object-detection +1

Look at Boundary: A Boundary-Aware Face Alignment Algorithm

2 code implementations CVPR 2018 Wayne Wu, Chen Qian, Shuo Yang, Quan Wang, Yici Cai, Qiang Zhou

By utilising boundary information of 300-W dataset, our method achieves 3. 92% mean error with 0. 39% failure rate on COFW dataset, and 1. 25% mean error on AFLW-Full dataset.

Ranked #4 on Face Alignment on AFLW-19 (using extra training data)

Face Alignment Facial Landmark Detection

Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling

no code implementations ICLR 2018 Kenny J. Young, Richard S. Sutton, Shuo Yang

We suggest one advantage of this particular type of memory is the ability to easily assign credit to a specific state when remembered information is found to be useful.

reinforcement-learning Reinforcement Learning (RL)

Effective Learning of Probabilistic Models for Clinical Predictions from Longitudinal Data

no code implementations2 Nov 2018 Shuo Yang

It also demonstrates the outstanding performance of these proposed models as well as other state of the art machine learning models when applied to medical research problems and other real-world large-scale systems, reveals the great potential of statistical relational learning for exploring the structured health-related data to facilitate medical research.

BIG-bench Machine Learning Medical Diagnosis +1

Region Proposal by Guided Anchoring

1 code implementation CVPR 2019 Jiaqi Wang, Kai Chen, Shuo Yang, Chen Change Loy, Dahua Lin

State-of-the-art detectors mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the spatial domain with a predefined set of scales and aspect ratios.

object-detection Object Detection +1

Learning Actions from Human Demonstration Video for Robotic Manipulation

no code implementations10 Sep 2019 Shuo Yang, Wei zhang, Weizhi Lu, Hesheng Wang, Yibin Li

However, the general video captioning methods focus more on the understanding of the full frame, lacking of consideration on the specific object of interests in robotic manipulations.

Video Captioning

Interaction Hard Thresholding: Consistent Sparse Quadratic Regression in Sub-quadratic Time and Space

no code implementations NeurIPS 2019 Shuo Yang, Yanyao Shen, Sujay Sanghavi

In this paper, we provide a new algorithm - Interaction Hard Thresholding (IntHT) which is the first one to provably accurately solve this problem in sub-quadratic time and space.

regression

Single-View 3D Object Reconstruction from Shape Priors in Memory

no code implementations CVPR 2021 Shuo Yang, Min Xu, Haozhe Xie, Stuart Perry, Jiahao Xia

Inspired by this, we propose a novel method, named Mem3D, that explicitly constructs shape priors to supplement the missing information in the image.

3D Object Reconstruction 3D Reconstruction +4

Adaptive Semantic-Visual Tree for Hierarchical Embeddings

no code implementations8 Mar 2020 Shuo Yang, Wei Yu, Ying Zheng, Hongxun Yao, Tao Mei

To solve this new problem, we propose a hierarchical adaptive semantic-visual tree (ASVT) to depict the architecture of merchandise categories, which evaluates semantic similarities between different semantic levels and visual similarities within the same semantic class simultaneously.

Image Retrieval Retrieval

High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification

2 code implementations CVPR 2020 Guan'an Wang, Shuo Yang, Huanyu Liu, Zhicheng Wang, Yang Yang, Shuliang Wang, Gang Yu, Erjin Zhou, Jian Sun

When aligning two groups of local features from two images, we view it as a graph matching problem and propose a cross-graph embedded-alignment (CGEA) layer to jointly learn and embed topology information to local features, and straightly predict similarity score.

Graph Matching Person Re-Identification +1

SMOT: Single-Shot Multi Object Tracking

1 code implementation30 Oct 2020 Wei Li, Yuanjun Xiong, Shuo Yang, Siqi Deng, Wei Xia

We combine this scheme with SSD detectors by proposing a novel tracking anchor assignment module.

Multi-Object Tracking Object

Positive-Congruent Training: Towards Regression-Free Model Updates

no code implementations CVPR 2021 Sijie Yan, Yuanjun Xiong, Kaustav Kundu, Shuo Yang, Siqi Deng, Meng Wang, Wei Xia, Stefano Soatto

Reducing inconsistencies in the behavior of different versions of an AI system can be as important in practice as reducing its overall error.

Image Classification regression

The ANTARES Astronomical Time-Domain Event Broker

no code implementations24 Nov 2020 Thomas Matheson, Carl Stubens, Nicholas Wolf, Chien-Hsiu Lee, Gautham Narayan, Abhijit Saha, Adam Scott, Monika Soraisam, Adam S. Bolton, Benjamin Hauger, David R. Silva, John Kececioglu, Carlos Scheidegger, Richard Snodgrass, Patrick D. Aleo, Eric Evans-Jacquez, Navdeep Singh, Zhe Wang, Shuo Yang, Zhenge Zhao

We describe the Arizona-NOIRLab Temporal Analysis and Response to Events System (ANTARES), a software instrument designed to process large-scale streams of astronomical time-domain alerts.

Instrumentation and Methods for Astrophysics

Real-space construction of crystalline topological superconductors and insulators in 2D interacting fermionic systems

no code implementations31 Dec 2020 Jian-Hao Zhang, Shuo Yang, Yang Qi, Zheng-Cheng Gu

The construction and classification of crystalline symmetry protected topological (SPT) phases in interacting bosonic and fermionic systems have been intensively studied in the past few years.

Strongly Correlated Electrons Mesoscale and Nanoscale Physics Mathematical Physics Mathematical Physics

Unsupervised Word Alignment via Cross-Lingual Contrastive Learning

no code implementations1 Jan 2021 Di wu, Liang Ding, Shuo Yang, DaCheng Tao

Recently, the performance of the neural word alignment models has exceeded that of statistical models.

Contrastive Learning Translation +1

Speeding up Deep Learning Training by Sharing Weights and Then Unsharing

no code implementations1 Jan 2021 Shuo Yang, Le Hou, Xiaodan Song, Qiang Liu, Denny Zhou

It has been widely observed that increasing deep learning model sizes often leads to significant performance improvements on a variety of natural language processing and computer vision tasks.

Free Lunch for Few-shot Learning: Distribution Calibration

6 code implementations ICLR 2021 Shuo Yang, Lu Liu, Min Xu

In this paper, we calibrate the distribution of these few-sample classes by transferring statistics from the classes with sufficient examples, then an adequate number of examples can be sampled from the calibrated distribution to expand the inputs to the classifier.

Few-Shot Learning

Combinatorial Bandits without Total Order for Arms

no code implementations3 Mar 2021 Shuo Yang, Tongzheng Ren, Inderjit S. Dhillon, Sujay Sanghavi

Specifically, we focus on a challenging setting where 1) the reward distribution of an arm depends on the set $s$ it is part of, and crucially 2) there is \textit{no total order} for the arms in $\mathcal{A}$.

Linear Bandit Algorithms with Sublinear Time Complexity

no code implementations3 Mar 2021 Shuo Yang, Tongzheng Ren, Sanjay Shakkottai, Eric Price, Inderjit S. Dhillon, Sujay Sanghavi

For sufficiently large $K$, our algorithms have sublinear per-step complexity and $\tilde O(\sqrt{T})$ regret.

Movie Recommendation

An Efficient Multitask Neural Network for Face Alignment, Head Pose Estimation and Face Tracking

no code implementations13 Mar 2021 Jiahao Xia, Haimin Zhang, Shiping Wen, Shuo Yang, Min Xu

Moreover, we generate a cheap heatmap based on the face alignment result and fuse it with features to improve the performance of the other two tasks.

Face Alignment Face Detection +1

Compatibility-aware Heterogeneous Visual Search

no code implementations CVPR 2021 Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto

Existing systems use the same embedding model to compute representations (embeddings) for the query and gallery images.

Neural Architecture Search Retrieval

Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network

no code implementations27 May 2021 Shuo Yang, Erkun Yang, Bo Han, Yang Liu, Min Xu, Gang Niu, Tongliang Liu

Motivated by that classifiers mostly output Bayes optimal labels for prediction, in this paper, we study to directly model the transition from Bayes optimal labels to noisy labels (i. e., Bayes-label transition matrix (BLTM)) and learn a classifier to predict Bayes optimal labels.

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

no code implementations6 Jul 2021 Wei Li, Yuanjun Xiong, Shuo Yang, Mingze Xu, Yongxin Wang, Wei Xia

We design a new instance-to-track matching objective to learn appearance embedding that compares a candidate detection to the embedding of the tracks persisted in the tracker.

Multiple Object Tracking Object +1

Theoretical Analysis of Consistency Regularization with Limited Augmented Data

no code implementations29 Sep 2021 Shuo Yang, Yijun Dong, Rachel Ward, Inderjit S Dhillon, Sujay Sanghavi, Qi Lei

Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data.

Data Augmentation Generalization Bounds +1

Objects in Semantic Topology

no code implementations ICLR 2022 Shuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan Yuan, Changhu Wang, Ping Luo, Min Xu

A more realistic object detection paradigm, Open-World Object Detection, has arisen increasing research interests in the community recently.

Incremental Learning Language Modelling +3

Speeding up Deep Model Training by Sharing Weights and Then Unsharing

no code implementations8 Oct 2021 Shuo Yang, Le Hou, Xiaodan Song, Qiang Liu, Denny Zhou

Our approach exploits the special structure of BERT that contains a stack of repeated modules (i. e., transformer encoders).

Does Preprocessing Help Training Over-parameterized Neural Networks?

no code implementations NeurIPS 2021 Zhao Song, Shuo Yang, Ruizhe Zhang

The classical training method requires paying $\Omega(mnd)$ cost for both forward computation and backward computation, where $m$ is the width of the neural network, and we are given $n$ training points in $d$-dimensional space.

PartImageNet: A Large, High-Quality Dataset of Parts

1 code implementation2 Dec 2021 Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille

To help address this problem, we propose PartImageNet, a large, high-quality dataset with part segmentation annotations.

Activity Recognition Few-Shot Learning +6

Semantically Contrastive Learning for Low-light Image Enhancement

1 code implementation13 Dec 2021 Dong Liang, Ling Li, Mingqiang Wei, Shuo Yang, Liyan Zhang, Wenhan Yang, Yun Du, Huiyu Zhou

Low-light image enhancement (LLE) remains challenging due to the unfavorable prevailing low-contrast and weak-visibility problems of single RGB images.

Contrastive Learning Low-Light Image Enhancement +1

Sample Efficiency of Data Augmentation Consistency Regularization

no code implementations24 Feb 2022 Shuo Yang, Yijun Dong, Rachel Ward, Inderjit S. Dhillon, Sujay Sanghavi, Qi Lei

Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data.

Data Augmentation Generalization Bounds

An Effective Graph Learning based Approach for Temporal Link Prediction: The First Place of WSDM Cup 2022

1 code implementation1 Mar 2022 Qian Zhao, Shuo Yang, Binbin Hu, Zhiqiang Zhang, Yakun Wang, Yusong Chen, Jun Zhou, Chuan Shi

Temporal link prediction, as one of the most crucial work in temporal graphs, has attracted lots of attention from the research area.

Attribute Graph Learning +1

CAFE: Learning to Condense Dataset by Aligning Features

2 code implementations CVPR 2022 Kai Wang, Bo Zhao, Xiangyu Peng, Zheng Zhu, Shuo Yang, Shuo Wang, Guan Huang, Hakan Bilen, Xinchao Wang, Yang You

Dataset condensation aims at reducing the network training effort through condensing a cumbersome training set into a compact synthetic one.

Dataset Condensation

Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels

1 code implementation30 Apr 2022 Kai Wang, Xiangyu Peng, Shuo Yang, Jianfei Yang, Zheng Zhu, Xinchao Wang, Yang You

This paradigm, however, is prone to significant degeneration under heavy label noise, as the number of clean samples is too small for conventional methods to behave well.

Learning with noisy labels

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training

no code implementations12 May 2022 Yue Zhao, Yantao Shen, Yuanjun Xiong, Shuo Yang, Wei Xia, Zhuowen Tu, Bernt Schiele, Stefano Soatto

We present a method to train a classification system that achieves paragon performance in both error rate and NFR, at the inference cost of a single model.

Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos

1 code implementation12 May 2022 Shuo Yang, Xinxiao wu

Language-driven action localization in videos is a challenging task that involves not only visual-linguistic matching but also action boundary prediction.

Action Localization Representation Learning

Dataset Pruning: Reducing Training Data by Examining Generalization Influence

no code implementations19 May 2022 Shuo Yang, Zeke Xie, Hanyu Peng, Min Xu, Mingming Sun, Ping Li

To answer these, we propose dataset pruning, an optimization-based sample selection method that can (1) examine the influence of removing a particular set of training samples on model's generalization ability with theoretical guarantee, and (2) construct the smallest subset of training data that yields strictly constrained generalization gap.

Toward Understanding Privileged Features Distillation in Learning-to-Rank

no code implementations19 Sep 2022 Shuo Yang, Sujay Sanghavi, Holakou Rahmanian, Jan Bakus, S. V. N. Vishwanathan

Such features naturally arise in merchandised recommendation systems; for instance, "user clicked this item" as a feature is predictive of "user purchased this item" in the offline data, but is clearly not available during online serving.

Learning-To-Rank Recommendation Systems

Differentiable Safe Controller Design through Control Barrier Functions

no code implementations20 Sep 2022 Shuo Yang, Shaoru Chen, Victor M. Preciado, Rahul Mangharam

Learning-based controllers, such as neural network (NN) controllers, can show high empirical performance but lack formal safety guarantees.

Towards Regression-Free Neural Networks for Diverse Compute Platforms

no code implementations27 Sep 2022 Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia

With the shift towards on-device deep learning, ensuring a consistent behavior of an AI service across diverse compute platforms becomes tremendously important.

Neural Architecture Search regression

You Don't Know When I Will Arrive: Unpredictable Controller Synthesis for Temporal Logic Tasks

no code implementations23 Nov 2022 Yu Chen, Shuo Yang, Rahul Mangharam, Xiang Yin

This problem is particularly challenging since future information is involved in the synthesis process.

Robot Task Planning

Learning Imbalanced Data with Vision Transformers

1 code implementation CVPR 2023 Zhengzhuo Xu, Ruikang Liu, Shuo Yang, Zenghao Chai, Chun Yuan

In this paper, we systematically investigate the ViTs' performance in LTR and propose LiVT to train ViTs from scratch only with LT data.

Long-tail Learning

CPMLHO:Hyperparameter Tuning via Cutting Plane and Mixed-Level Optimization

no code implementations11 Dec 2022 Shuo Yang, Yang Jiao, Shaoyu Dou, Mana Zheng, Chen Zhu

The bilevel optimization is used to automatically update the hyperparameter, and the gradient of the hyperparameter is the approximate gradient based on the best response function.

Bilevel Optimization Hyperparameter Optimization

Improving Lens Flare Removal with General-Purpose Pipeline and Multiple Light Sources Recovery

1 code implementation ICCV 2023 Yuyan Zhou, Dong Liang, Songcan Chen, Sheng-Jun Huang, Shuo Yang, Chongyi Li

In this paper, we propose a solution to improve the performance of lens flare removal by revisiting the ISP and remodeling the principle of automatic exposure in the synthesis pipeline and design a more reliable light sources recovery strategy.

Flare Removal Tone Mapping

PPR: Physically Plausible Reconstruction from Monocular Videos

no code implementations ICCV 2023 Gengshan Yang, Shuo Yang, John Z. Zhang, Zachary Manchester, Deva Ramanan

Given monocular videos, we build 3D models of articulated objects and environments whose 3D configurations satisfy dynamics and contact constraints.

Rethink Long-tailed Recognition with Vision Transformers

no code implementations28 Feb 2023 Zhengzhuo Xu, Shuo Yang, Xingjun Wang, Chun Yuan

Hence, we propose to adopt unsupervised learning to utilize long-tailed data.

MEGA-DAgger: Imitation Learning with Multiple Imperfect Experts

no code implementations1 Mar 2023 Xiatao Sun, Shuo Yang, Rahul Mangharam

Imitation learning has been widely applied to various autonomous systems thanks to recent development in interactive algorithms that address covariate shift and compounding errors induced by traditional approaches like behavior cloning.

Imitation Learning

BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency

1 code implementation CVPR 2023 Shuo Yang, Zhaopan Xu, Kai Wang, Yang You, Hongxun Yao, Tongliang Liu, Min Xu

As one of the most fundamental techniques in multimodal learning, cross-modal matching aims to project various sensory modalities into a shared feature space.

Image-text matching Text Matching

Safe Perception-Based Control under Stochastic Sensor Uncertainty using Conformal Prediction

no code implementations1 Apr 2023 Shuo Yang, George J. Pappas, Rahul Mangharam, Lars Lindemann

However, these perception maps are not perfect and result in state estimation errors that can lead to unsafe system behavior.

Conformal Prediction valid

Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach

no code implementations8 May 2023 Xuan Son Nguyen, Shuo Yang

Recently, by applying the theory of gyrogroups and gyrovector spaces that is a powerful framework for studying hyperbolic geometry, some works have attempted to build principled generalizations of Euclidean neural networks on matrix manifolds.

Action Recognition Knowledge Graph Completion +1

Large-scale Dataset Pruning with Dynamic Uncertainty

1 code implementation8 Jun 2023 Muyang He, Shuo Yang, Tiejun Huang, Bo Zhao

The state of the art of many learning tasks, e. g., image classification, is advanced by collecting larger datasets and then training larger models on them.

Image Classification

Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications

no code implementations11 Jun 2023 Jiangwei Wang, Shuo Yang, Ziyan An, Songyang Han, Zhili Zhang, Rahul Mangharam, Meiyi Ma, Fei Miao

The STL requirements are designed to include both task specifications according to the objective of each agent and safety specifications, and the robustness values of the STL specifications are leveraged to generate rewards.

Multi-agent Reinforcement Learning reinforcement-learning

Improving Lens Flare Removal with General Purpose Pipeline and Multiple Light Sources Recovery

1 code implementation31 Aug 2023 Yuyan Zhou, Dong Liang, Songcan Chen, Sheng-Jun Huang, Shuo Yang, Chongyi Li

In this paper, we propose a solution to improve the performance of lens flare removal by revisiting the ISP and remodeling the principle of automatic exposure in the synthesis pipeline and design a more reliable light sources recovery strategy.

Flare Removal Tone Mapping

Learning Adaptive Safety for Multi-Agent Systems

1 code implementation19 Sep 2023 Luigi Berducci, Shuo Yang, Rahul Mangharam, Radu Grosu

Ensuring safety in dynamic multi-agent systems is challenging due to limited information about the other agents.

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

1 code implementation6 Nov 2023 Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica

To capitalize on these opportunities, we present S-LoRA, a system designed for the scalable serving of many LoRA adapters.

Rethinking Benchmark and Contamination for Language Models with Rephrased Samples

1 code implementation8 Nov 2023 Shuo Yang, Wei-Lin Chiang, Lianmin Zheng, Joseph E. Gonzalez, Ion Stoica

Many have raised concerns about the trustworthiness of public benchmarks due to potential contamination in pre-training or fine-tuning datasets.

Not All Negatives Are Worth Attending to: Meta-Bootstrapping Negative Sampling Framework for Link Prediction

no code implementations8 Dec 2023 Yakun Wang, Binbin Hu, Shuo Yang, Meiqi Zhu, Zhiqiang Zhang, Qiyang Zhang, Jun Zhou, Guo Ye, Huimei He

In particular, we elaborately devise a Meta-learning Supported Teacher-student GNN (MST-GNN) that is not only built upon teacher-student architecture for alleviating the migration between "easy" and "hard" samples but also equipped with a meta learning based sample re-weighting module for helping the student GNN distinguish "hard" samples in a fine-grained manner.

Link Prediction Meta-Learning

Learning Local Control Barrier Functions for Safety Control of Hybrid Systems

1 code implementation26 Jan 2024 Shuo Yang, Yu Chen, Xiang Yin, Rahul Mangharam

Our approach is computationally efficient, minimally invasive to any reference controller, and applicable to large-scale systems.

Model Predictive Control

Data-efficient Fine-tuning for LLM-based Recommendation

no code implementations30 Jan 2024 Xinyu Lin, Wenjie Wang, Yongqi Li, Shuo Yang, Fuli Feng, Yinwei Wei, Tat-Seng Chua

To pursue the two objectives, we propose a novel data pruning method based on two scores, i. e., influence score and effort score, to efficiently identify the influential samples.

Peer-review-in-LLMs: Automatic Evaluation Method for LLMs in Open-environment

1 code implementation2 Feb 2024 Kun-Peng Ning, Shuo Yang, Yu-Yang Liu, Jia-Yu Yao, Zhen-Hui Liu, Yu Wang, Ming Pang, Li Yuan

Existing large language models (LLMs) evaluation methods typically focus on testing the performance on some closed-environment and domain-specific benchmarks with human annotations.

Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization

no code implementations28 Feb 2024 Shuo Yang, Gjergji Kasneci

This research significantly reduces training costs of proximal policy-guided models and demonstrates the potential for self-correction of language models.

Language Modelling

MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation

1 code implementation29 Feb 2024 Jinfeng Xu, Zheyu Chen, Shuo Yang, Jinze Li, Hewei Wang, Edith C. -H. Ngai

It utilizes multimodal information to alleviate the data sparsity problem in recommendation systems, thus improving recommendation accuracy.

Multimodal Recommendation Self-Supervised Learning

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

no code implementations2 Mar 2024 Shuo Yang, Zirui Shang, Yongqi Wang, Derong Deng, Hongwei Chen, Qiyuan Cheng, Xinxiao wu

This paper proposes a novel framework for multi-label image recognition without any training data, called data-free framework, which uses knowledge of pre-trained Large Language Model (LLM) to learn prompts to adapt pretrained Vision-Language Model (VLM) like CLIP to multilabel classification.

Language Modelling Large Language Model

Conformal Off-Policy Prediction for Multi-Agent Systems

no code implementations25 Mar 2024 Tom Kuipers, Renukanandan Tumu, Shuo Yang, Milad Kazemi, Rahul Mangharam, Nicola Paoletti

In this work, we introduce MA-COPP, the first conformal prediction method to solve OPP problems involving multi-agent systems, deriving joint prediction regions for all agents' trajectories when one or more "ego" agents change their policies.

Conformal Prediction

Enhancing Content-based Recommendation via Large Language Model

no code implementations30 Mar 2024 Wentao Xu, Qianqian Xie, Shuo Yang, Jiangxia Cao, Shuchao Pang

However, they still neglect the following two points: (1) The content semantic is a universal world knowledge; how do we extract the multi-aspect semantic information to empower different domains?

Language Modelling Large Language Model +1

Cannot find the paper you are looking for? You can Submit a new open access paper.