Search Results for author: Yu Yang

Found 114 papers, 38 papers with code

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration

no code implementations20 Mar 2025 Andy Zhou, Kevin Wu, Francesco Pinto, Zhaorun Chen, Yi Zeng, Yu Yang, Shuang Yang, Sanmi Koyejo, James Zou, Bo Li

As large language models (LLMs) become increasingly capable, security and safety evaluation are crucial.

Red Teaming

Dynamic Load Balancing for EV Charging Stations Using Reinforcement Learning and Demand Prediction

no code implementations9 Mar 2025 Hesam Mosalli, Saba Sanami, Yu Yang, Hen-Geul Yeh, Amir G. Aghdam

This paper presents a method for load balancing and dynamic pricing in electric vehicle (EV) charging networks, utilizing reinforcement learning (RL) to enhance network performance.

Graph Neural Network Reinforcement Learning (RL)

Demand Forecasting for Electric Vehicle Charging Stations using Multivariate Time-Series Analysis

no code implementations22 Feb 2025 Saba Sanami, Hesam Mosalli, Yu Yang, Hen-Geul Yeh, Amir G. Aghdam

As the number of electric vehicles (EVs) continues to grow, the demand for charging stations is also increasing, leading to challenges such as long wait times and insufficient infrastructure.

Decision Making Demand Forecasting +2

Modeling Behavior Change for Multi-model At-Risk Students Early Prediction (extended version)

no code implementations19 Feb 2025 Jiabei Cheng, Zhen-Qun Yang, Jiannong Cao, Yu Yang, Kai Cheung Franky Poon, Daniel Lai

In the educational domain, identifying students at risk of dropping out is essential for allowing educators to intervene effectively, improving both academic outcomes and overall student well-being.

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency

no code implementations7 Feb 2025 Qixin Zhang, Zongqi Wan, Yu Yang, Li Shen, DaCheng Tao

Coordinating multiple agents to collaboratively maximize submodular functions in unpredictable environments is a critical task with numerous applications in machine learning, robot planning and control.

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

1 code implementation7 Feb 2025 Yihe Deng, Yu Yang, Junkai Zhang, Wei Wang, Bo Li

While substantial safety data exist in English, multilingual guardrail modeling remains underexplored due to the scarcity of open-source safety data in other languages.

Reinforcement Learning (RL) Synthetic Data Generation

Combinatorial Optimization Perspective based Framework for Multi-behavior Recommendation

1 code implementation4 Feb 2025 Chenhao Zhai, Chang Meng, Yu Yang, Kexin Zhang, Xuhao Zhao, Xiu Li

To address these problems, we propose a novel multi-behavior recommendation framework based on the combinatorial optimization perspective, named COPF.

Combinatorial Optimization Multi-Task Learning +1

Data-Efficient Model for Psychological Resilience Prediction based on Neurological Data

no code implementations3 Feb 2025 Zhi Zhang, Yan Liu, Mengxia Gao, Yu Yang, Jiannong Cao, Wai Kai Hou, Shirley Li, Sonata Yau, Yun Kwok Wing, Tatia M. C. Lee

In the test stage, a new noise-informed inference algorithm is proposed to address the low signal-to-noise ratio of the neurological data.

Kolmogorov-Arnold Networks

Pre-train and Fine-tune: Recommenders as Large Models

no code implementations24 Jan 2025 Zhenhao Jiang, Chenghao Chen, Hao Feng, Yu Yang, Jin Liu, Jie Zhang, Jia Jia, Ning Hu

We first propose the theory of the information bottleneck for fine-tuning and present an explanation for the fine-tuning technique in recommenders.

Recommendation Systems

Collaborative Imputation of Urban Time Series through Cross-city Meta-learning

no code implementations20 Jan 2025 Tong Nie, Wei Ma, Jian Sun, Yu Yang, Jiannong Cao

We then introduce a cross-city collaborative learning scheme through model-agnostic meta learning, incorporating hierarchical modulation and normalization techniques to accommodate multiscale representations and reduce variance in response to heterogeneity.

Imputation Meta-Learning +1

AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations

no code implementations16 Nov 2024 Jiawei Mao, Yu Yang, Xuesong Yin, Ling Shao, Hao Tang

Specifically, we introduce an All-in-One Transformer Block (AiOTB), which adaptively removes all degradations present in a given image by modeling the relationships between all degradations and the image embedding in latent space.

All Image Restoration

Mixture of Knowledge Minigraph Agents for Literature Review Generation

no code implementations9 Nov 2024 Zhi Zhang, Yan Liu, Sheng-hua Zhong, Gong Chen, Yu Yang, Jiannong Cao

A novel prompt-based algorithm, the knowledge minigraph construction agent (KMCA), is designed to identify relations between concepts from academic literature and automatically constructs knowledge minigraphs.

Review Generation

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

1 code implementation4 Nov 2024 Zehan Qi, Xiao Liu, Iat Long Iong, Hanyu Lai, Xueqiao Sun, Wenyi Zhao, Yu Yang, Xinyue Yang, Jiadai Sun, Shuntian Yao, Tianjie Zhang, Wei Xu, Jie Tang, Yuxiao Dong

Specifically, WebRL incorporates 1) a self-evolving curriculum that generates new tasks from unsuccessful attempts, 2) a robust outcome-supervised reward model (ORM), and 3) adaptive reinforcement learning strategies to ensure consistent improvements.

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

no code implementations30 Oct 2024 Ruhan Wang, Yu Yang, Zhishuai Liu, Dongruo Zhou, Pan Xu

Previous works tackle the dynamics shift problem by augmenting the reward in the trajectory from the source domain to match the optimal trajectory in the target domain.

D4RL reinforcement-learning +2

Personality Analysis from Online Short Video Platforms with Multi-domain Adaptation

1 code implementation26 Oct 2024 Sixu An, Xiangguo Sun, Yicong Li, Yu Yang, Guandong Xu

Personality analysis from online short videos has gained prominence due to its applications in personalized recommendation systems, sentiment analysis, and human-computer interaction.

Domain Adaptation Recommendation Systems +1

FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs

no code implementations22 Oct 2024 Haoran Lin, Xianzhi Yu, Kang Zhao, Lu Hou, Zongyuan Zhan, Stanislav Kamenev, Han Bao, Ting Hu, Mingkai Wang, Qixin Chang, Siyue Sui, Weihao Sun, Jiaxin Hu, Jun Yao, Zekun Yin, Cheng Qian, Ying Zhang, Yinfei Pan, Yu Yang, Weiguo Liu

In this work, we propose FastAttention which pioneers the adaptation of FlashAttention series for NPUs and low-resource GPUs to boost LLM inference efficiency.

Graph Neural Patching for Cold-Start Recommendations

no code implementations18 Oct 2024 Hao Chen, Yu Yang, Yuanchen Bei, Zefan Wang, Yue Xu, Feiran Huang

To this end, we introduce Graph Neural Patching for Cold-Start Recommendations (GNP), a customized GNN framework with dual functionalities: GWarmer for modeling collaborative signal on existing warm users/items and Patching Networks for simulating and enhancing GWarmer's performance on cold-start recommendations.

Recommendation Systems

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

no code implementations14 Oct 2024 Yu Yang, Yuzhou Nie, Zhun Wang, Yuheng Tang, Wenbo Guo, Bo Li, Dawn Song

Our methodology ensures the data quality while enabling large-scale generation.

Language-centered Human Activity Recognition

no code implementations12 Sep 2024 Hua Yan, Heng Tan, Yi Ding, Pengfei Zhou, Vinod Namboodiri, Yu Yang

To address this, we propose LanHAR, a novel system that leverages Large Language Models (LLMs) to generate semantic interpretations of sensor readings and activity labels for cross-dataset HAR.

Human Activity Recognition

DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

1 code implementation6 Sep 2024 Jianbiao Mei, Xuemeng Yang, Licheng Wen, Tao Hu, Yu Yang, Tiantian Wei, Yukai Ma, Min Dou, Botian Shi, Yong liu

Recent advances in diffusion models have improved controllable streetscape generation and supported downstream perception and planning tasks.

Video Generation

DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries

no code implementations28 Aug 2024 Yu Yang, Jianbiao Mei, Liang Liu, Siliang Du, Yilin Xiao, Jongwon Ra, Yong liu, Xiao Xu, Huifeng Wu

To this end, we propose a novel framework dubbed DQFormer to implement semantic and instance segmentation in a unified workflow.

Decoder Instance Segmentation +2

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

no code implementations26 Aug 2024 Yu Yang, Jianbiao Mei, Yukai Ma, Siliang Du, Wenqing Chen, Yijie Qian, Yuxiang Feng, Yong liu

Unlike the aforementioned prior works, we propose Drive-OccWorld, which adapts a vision-centric 4D forecasting world model to end-to-end planning for autonomous driving.

Autonomous Driving Decoder

An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation

no code implementations15 Aug 2024 Jun Wang, Likang Wu, Qi Liu, Yu Yang

However, previous studies mainly focus on discrete action and policy spaces, which might have difficulties in handling dramatically growing items efficiently.

continuous-control Continuous Control +1

Experimental evaluation of offline reinforcement learning for HVAC control in buildings

1 code implementation15 Aug 2024 Jun Wang, Linyan Li, Qi Liu, Yu Yang

In summary, this paper presents our well-structured investigations and new findings when applying offline reinforcement learning to building HVAC systems.

Offline RL Reinforcement Learning (RL)

HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction

no code implementations2 Aug 2024 Xingyu Lou, Yu Yang, Kuiyao Dong, Heyuan Huang, Wenyi Yu, Ping Wang, Xiu Li, Jun Wang

As the recommendation service needs to address increasingly diverse distributions, such as multi-population, multi-scenario, multitarget, and multi-interest, more and more recent works have focused on multi-distribution modeling and achieved great progress.

Click-Through Rate Prediction Mixture-of-Experts +1

Mini-batch Coresets for Memory-efficient Training of Large Language Models

no code implementations28 Jul 2024 Dang Nguyen, Wenhan Yang, Rathul Anand, Yu Yang, Baharan Mirzasoleiman

However, this approach becomes infeasible and ineffective for LLMs, due to the highly imbalanced nature of the sources in language data, use of the Adam optimizer, and the very large gradient dimensionality of LLMs.

Network Pruning

AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies

no code implementations11 Jul 2024 Yi Zeng, Yu Yang, Andy Zhou, Jeffrey Ziwei Tan, Yuheng Tu, Yifan Mai, Kevin Klyman, Minzhou Pan, Ruoxi Jia, Dawn Song, Percy Liang, Bo Li

However, existing public benchmarks often define safety categories based on previous literature, intuitions, or common sense, leading to disjointed sets of categories for risks specified in recent regulations and policies, which makes it challenging to evaluate and compare FMs across these benchmarks.

Common Sense Reasoning

Brevity is the soul of wit: Pruning long files for code generation

no code implementations29 Jun 2024 Aaditya K. Singh, Yu Yang, Kushal Tirumala, Mostafa Elhoushi, Ari S. Morcos

Specifically, many have shown that de-duplicating data, or sub-selecting higher quality data, can lead to efficiency or performance improvements.

Code Generation HumanEval +1

Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models

no code implementations25 Jun 2024 Zhiyuan Wen, Yu Yang, Jiannong Cao, Haoming Sun, Ruosong Yang, Shuaiqi Liu

By presenting a clear taxonomy, in-depth analysis, promising future directions, and extensive resource collections, we aim to provide a better understanding and facilitate further advancements in this emerging field.

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

no code implementations25 Jun 2024 Yi Zeng, Kevin Klyman, Andy Zhou, Yu Yang, Minzhou Pan, Ruoxi Jia, Dawn Song, Percy Liang, Bo Li

We present a comprehensive AI risk taxonomy derived from eight government policies from the European Union, United States, and China and 16 company policies worldwide, making a significant step towards establishing a unified language for generative AI safety evaluation.

A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphs

1 code implementation20 Jun 2024 Zhicheng Liang, Yu Yang, Xiangyu Ke, Xiaokui Xiao, Yunjun Gao

Our benchmark study sheds light on potential challenges in current deep reinforcement learning research for solving combinatorial optimization problems.

Combinatorial Optimization Deep Reinforcement Learning +1

Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing

no code implementations20 Jun 2024 Xinbo Zhao, Yingxue Zhang, Xin Zhang, Yu Yang, Yiqun Xie, Yanhua Li, Jun Luo

MODA addresses the challenges of data scarcity and heterogeneity in a multi-task urban setting through Contrastive Data Sharing among tasks.

Autonomous Driving Data Augmentation +5

Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach

1 code implementation19 Jun 2024 Yicong Li, Yu Yang, Jiannong Cao, Shuaiqi Liu, Haoran Tang, Guandong Xu

We first identify biased structural evolutions in a dynamic graph based on the evolving trend of vertex degree and then propose FairDGE, the first structurally Fair Dynamic Graph Embedding algorithm.

Dynamic graph embedding Fairness +1

More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling

1 code implementation18 Jun 2024 Haque Ishfaq, Yixin Tan, Yu Yang, Qingfeng Lan, Jianfeng Lu, A. Rupam Mahmood, Doina Precup, Pan Xu

Empirically, we show that in tasks where deep exploration is necessary, our proposed algorithms that combine FGTS and approximate sampling perform significantly better compared to other strong baselines.

reinforcement-learning Reinforcement Learning +2

GLINT-RU: Gated Lightweight Intelligent Recurrent Units for Sequential Recommender Systems

no code implementations6 Jun 2024 Sheng Zhang, Maolin Wang, Wanyu Wang, Jingtong Gao, Xiangyu Zhao, Yu Yang, Xuetao Wei, Zitao Liu, Tong Xu

Meanwhile, existing efficient SRS approaches struggle to embed high-quality semantic and positional information into latent representations.

Sequential Recommendation

LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration

no code implementations29 May 2024 Mingrui Ma, Yu Yang

We find that using the pretrained large language model to encode deep features of the medical images in the registration model can effectively improve image registration accuracy, indicating the great potential of the large language model in medical image registration tasks.

Decoder Image Registration +5

Dual-Camera Smooth Zoom on Mobile Phones

1 code implementation7 Apr 2024 Renlong Wu, Zhilu Zhang, Yu Yang, WangMeng Zuo

In this work, we introduce a new task, ie, dual-camera smooth zoom (DCSZ) to achieve a smooth zoom preview.

Affective-NLI: Towards Accurate and Interpretable Personality Recognition in Conversation

1 code implementation3 Apr 2024 Zhiyuan Wen, Jiannong Cao, Yu Yang, Ruosong Yang, Shuaiqi Liu

To utilize affectivity within dialog content for accurate personality recognition, we fine-tuned a pre-trained language model specifically for emotion recognition in conversations, facilitating real-time affective annotations for utterances.

Emotion Recognition Language Modelling +2

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

1 code implementation12 Mar 2024 Yu Yang, Siddhartha Mishra, Jeffrey N Chiang, Baharan Mirzasoleiman

In clinical text summarization on the MIMIC-III dataset (Johnson et al., 2016), S2L again outperforms training on the full dataset using only 50% of the data.

Math Mathematical Problem-Solving +1

Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach

no code implementations19 Feb 2024 Chengyi Ju, Jiannong Cao, Yu Yang, Zhen-Qun Yang, Ho Man Lee

In response, we propose HFRec, a heterogeneity-aware hybrid federated recommender system designed for cross-school elective course recommendations.

Diversity Recommendation Systems

Boosting Gradient Ascent for Continuous DR-submodular Maximization

no code implementations16 Jan 2024 Qixin Zhang, Zongqi Wan, Zengde Deng, Zaiyi Chen, Xiaoming Sun, Jialin Zhang, Yu Yang

The fundamental idea of our boosting technique is to exploit non-oblivious search to derive a novel auxiliary function $F$, whose stationary points are excellent approximations to the global maximum of the original DR-submodular objective $f$.

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

no code implementations18 Dec 2023 Zilin Wang, Haolin Zhuang, Lu Li, Yinmin Zhang, Junjie Zhong, Jun Chen, Yu Yang, Boshi Tang, Zhiyong Wu

This paper presents an Exploratory 3D Dance generation framework, E3D2, designed to address the exploration capability deficiency in existing music-conditioned 3D dance generation models.

Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

1 code implementation10 Dec 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Jongwon Ra, Yukai Ma, Laijian Li, Yong liu

In this paper, we adopt the dense-sparse-dense design and propose a one-stage camera-based SSC framework, termed SGN, to propagate semantics from the semantic-aware seed voxels to the whole scene based on spatial geometry cues.

3D Semantic Scene Completion Autonomous Driving +2

Moving Sampling Physics-informed Neural Networks induced by Moving Mesh PDE

1 code implementation14 Nov 2023 Yu Yang, Qihong Yang, Yangtao Deng, Qiaolin He

In this work, we propose an end-to-end adaptive sampling neural network (MMPDE-Net) based on the moving mesh method, which can adaptively generate new sampling points by solving the moving mesh PDE.

Training A Multi-stage Deep Classifier with Feedback Signals

no code implementations12 Nov 2023 Chao Xu, Yu Yang, Rongzhao Wang, Guan Wang, Bojia Lin

Multi-Stage Classifier (MSC) - several classifiers working sequentially in an arranged order and classification decision is partially made at each step - is widely used in industrial applications for various resource limitation reasons.

Binary Classification

Boosting Summarization with Normalizing Flows and Aggressive Training

1 code implementation1 Nov 2023 Yu Yang, Xiaotong Shen

This paper presents FlowSUM, a normalizing flows-based variational encoder-decoder framework for Transformer-based summarization.

Decoder Knowledge Distillation +1

Bayes-enhanced Multi-view Attention Networks for Robust POI Recommendation

no code implementations1 Nov 2023 Jiangnan Xia, Yu Yang, Senzhang Wang, Hongzhi Yin, Jiannong Cao, Philip S. Yu

To this end, we investigate a novel problem of robust POI recommendation by considering the uncertainty factors of the user check-ins, and proposes a Bayes-enhanced Multi-view Attention Network.

Data Augmentation Representation Learning

Optimal Batched Best Arm Identification

no code implementations21 Oct 2023 Tianyuan Jin, Yu Yang, Jing Tang, Xiaokui Xiao, Pan Xu

Based on Tri-BBAI, we further propose the almost optimal batched best arm identification (Opt-BBAI) algorithm, which is the first algorithm that achieves the near-optimal sample and batch complexity in the non-asymptotic setting (i. e., $\delta>0$ is arbitrarily fixed), while enjoying the same batch and sample complexity as Tri-BBAI when $\delta$ tends to zero.

Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

no code implementations10 Oct 2023 Xuxi Chen, Yu Yang, Zhangyang Wang, Baharan Mirzasoleiman

Dataset distillation aims to minimize the time and memory needed for training deep networks on large datasets, by creating a small set of synthetic images that has a similar generalization performance to that of the full dataset.

Dataset Distillation

On the Stability of Expressive Positional Encodings for Graphs

2 code implementations4 Oct 2023 Yinan Huang, William Lu, Joshua Robinson, Yu Yang, Muhan Zhang, Stefanie Jegelka, Pan Li

Despite many attempts to address non-uniqueness, most methods overlook stability, leading to poor generalization on unseen graph structures.

Molecular Property Prediction Out-of-Distribution Generalization +1

Sieve: Multimodal Dataset Pruning Using Image Captioning Models

1 code implementation CVPR 2024 Anas Mahmoud, Mostafa Elhoushi, Amro Abbas, Yu Yang, Newsha Ardalani, Hugh Leather, Ari Morcos

We propose a pruning signal, Sieve, that employs synthetic captions generated by image-captioning models pretrained on small, diverse, and well-aligned image-text pairs to evaluate the alignment of noisy image-text pairs.

Diversity Image Captioning +2

Non-Uniform Sampling Reconstruction for Symmetrical NMR Spectroscopy by Exploiting Inherent Symmetry

no code implementations24 Sep 2023 Enping Lin, Ze Fang, Yuqing Huang, Yu Yang, Zhong Chen

Symmetrical NMR spectroscopy constitutes a vital branch of multidimensional NMR spectroscopy, providing a powerful tool for the structural elucidation of biological macromolecules.

compressed sensing

Parallel Knowledge Enhancement based Framework for Multi-behavior Recommendation

2 code implementations9 Aug 2023 Chang Meng, Chenhao Zhai, Yu Yang, Hengyu Zhang, Xiu Li

In the fusion step, advanced neural networks are used to model the hierarchical correlations between user behaviors.

Multi-Task Learning Prediction

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Tianxin Huang, Xuemeng Yang, Yong liu

However, how to effectively exploit the relationships between the semantic context in semantic segmentation and geometric structure in scene completion remains under exploration.

Autonomous Driving Scene Understanding +1

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Xiaojun Hou, Laijian Li, Yong liu

Firstly, we propose a non-learning Sparse Instance Proposal (SIP) module with the ``sampling-shifting-grouping" scheme to directly group thing points into instances from the raw point cloud efficiently.

Autonomous Driving Instance Segmentation +2

Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious Features

1 code implementation21 Jun 2023 Siddharth Joshi, Yu Yang, Yihao Xue, Wenhan Yang, Baharan Mirzasoleiman

Through this, we highlight how existing group inference methods struggle in the presence of spurious features that are learned later in training.

Benchmarking Model Selection

Towards Sustainable Learning: Coresets for Data-efficient Deep Learning

1 code implementation2 Jun 2023 Yu Yang, Hao Kang, Baharan Mirzasoleiman

To improve the efficiency and sustainability of learning deep models, we propose CREST, the first scalable framework with rigorous theoretical guarantees to identify the most valuable examples for training non-convex models, particularly deep networks.

Deep Learning

Identifying Spurious Biases Early in Training through the Lens of Simplicity Bias

no code implementations30 May 2023 Yu Yang, Eric Gan, Gintare Karolina Dziugaite, Baharan Mirzasoleiman

In this work, we provide the first theoretical analysis of the effect of simplicity bias on learning spurious correlations.

Inductive Bias

Few-shot Adaptation to Distribution Shifts By Mixing Source and Target Embeddings

no code implementations23 May 2023 Yihao Xue, Ali Payani, Yu Yang, Baharan Mirzasoleiman

Pretrained machine learning models need to be adapted to distribution shifts when deployed in new target environments.

Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning

no code implementations8 Apr 2023 Yu Yang, Besmira Nushi, Hamid Palangi, Baharan Mirzasoleiman

Spurious correlations that degrade model generalization or lead the model to be right for the wrong reasons are one of the main robustness concerns for real-world deployments.

Attribute

CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning

1 code implementation ICCV 2023 Hritik Bansal, Nishad Singhi, Yu Yang, Fan Yin, Aditya Grover, Kai-Wei Chang

Multimodal contrastive pretraining has been used to train multimodal representation models, such as CLIP, on large amounts of paired image-text data.

Backdoor Attack Contrastive Learning +1

Battery Valuation and Management for Battery Swapping Station with an Intertemporal Framework

no code implementations28 Feb 2023 Xinjiang Chen, Yu Yang, Jianxiao Wang, Jie Song, Guannan He

Battery swapping as a business model for battery energy storage (BES) has great potential in future integrated low-carbon energy and transportation systems.

Management

Machine Learning for Smart and Energy-Efficient Buildings

no code implementations27 Nov 2022 Hari Prasanna Das, Yu-Wen Lin, Utkarsha Agwan, Lucas Spangher, Alex Devonport, Yu Yang, Jan Drgona, Adrian Chong, Stefano Schiavon, Costas J. Spanos

In this work, we review the ways in which machine learning has been leveraged to make buildings smart and energy-efficient.

ILSGAN: Independent Layer Synthesis for Unsupervised Foreground-Background Segmentation

1 code implementation25 Nov 2022 Qiran Zou, Yu Yang, Wing Yin Cheung, Chang Liu, Xiangyang Ji

Unsupervised foreground-background segmentation aims at extracting salient objects from cluttered backgrounds, where Generative Adversarial Network (GAN) approaches, especially layered GANs, show great promise.

Generative Adversarial Network Image Generation +4

Distilling Representations from GAN Generator via Squeeze and Span

1 code implementation6 Nov 2022 Yu Yang, Xiaotian Cheng, Chang Liu, Hakan Bilen, Xiangyang Ji

In recent years, generative adversarial networks (GANs) have been an actively studied topic and shown to successfully produce high-quality realistic images in various domains.

Representation Learning

Learning to Annotate Part Segmentation with Gradient Matching

1 code implementation ICLR 2022 Yu Yang, Xiaotian Cheng, Hakan Bilen, Xiangyang Ji

The success of state-of-the-art deep neural networks heavily relies on the presence of large-scale labelled datasets, which are extremely expensive and time-consuming to annotate.

Segmentation

Local Manifold Augmentation for Multiview Semantic Consistency

no code implementations5 Nov 2022 Yu Yang, Wing Yin Cheung, Chang Liu, Xiangyang Ji

Multiview self-supervised representation learning roots in exploring semantic consistency across data of complex intra-class variation.

Representation Learning Self-Supervised Learning

Not All Poisons are Created Equal: Robust Training against Data Poisoning

2 code implementations18 Oct 2022 Yu Yang, Tian Yu Liu, Baharan Mirzasoleiman

Data poisoning causes misclassification of test time target examples by injecting maliciously crafted samples in the training data.

All Data Poisoning

Neural Networks Based on Power Method and Inverse Power Method for Solving Linear Eigenvalue Problems

1 code implementation22 Sep 2022 Qihong Yang, Yangtao Deng, Yu Yang, Qiaolin He, Shiquan Zhang

In this article, we propose two kinds of neural networks inspired by power method and inverse power method to solve linear eigenvalue problems.

Sequence-to-Set Generative Models

1 code implementation19 Sep 2022 Longtao Tang, Ying Zhou, Yu Yang

We present GRU2Set, which is an instance of our sequence-to-set method and employs the famous GRU model as the sequence generative model.

Deep Feature Selection for Anomaly Detection Based on Pretrained Network and Gaussian Discriminative Analysis

1 code implementation IEEE Open Journal of Instrumentation and Measurement (Volume: 1) 2022 Jie Lin, Song Chen, Enping Lin, Yu Yang

Deep learning neural network serves as a powerful tool for visual anomaly detection (AD) and fault diagnosis, attributed to its strong abstractive interpretation ability in the representation domain.

Anomaly Detection Fault Diagnosis +1

Communication-Efficient Decentralized Online Continuous DR-Submodular Maximization

no code implementations18 Aug 2022 Qixin Zhang, Zengde Deng, Xiangru Jian, Zaiyi Chen, Haoyuan Hu, Yu Yang

Maximizing a monotone submodular function is a fundamental task in machine learning, economics, and statistics.

Online Learning for Non-monotone Submodular Maximization: From Full Information to Bandit Feedback

no code implementations16 Aug 2022 Qixin Zhang, Zengde Deng, Zaiyi Chen, Kuangqi Zhou, Haoyuan Hu, Yu Yang

In this paper, we revisit the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set, which finds wide real-world applications in the domain of machine learning, economics, and operations research.

Friendly Noise against Adversarial Noise: A Powerful Defense against Data Poisoning Attacks

1 code implementation14 Aug 2022 Tian Yu Liu, Yu Yang, Baharan Mirzasoleiman

We make the key observation that attacks introduce local sharp regions of high training loss, which when minimized, results in learning the adversarial perturbations and makes the attack successful.

Data Poisoning

Towards Better Dermoscopic Image Feature Representation Learning for Melanoma Classification

1 code implementation15 Jul 2022 Chenghui Yu, Mingkang Tang, ShengGe Yang, Mingqing Wang, Zhe Xu, Jiangpeng Yan, HanMo Chen, Yu Yang, Xiao-jun Zeng, Xiu Li

Deep learning-based melanoma classification with dermoscopic images has recently shown great potential in automatic early-stage melanoma diagnosis.

Data Augmentation Denoising +2

Time-aware Dynamic Graph Embedding for Asynchronous Structural Evolution

no code implementations1 Jul 2022 Yu Yang, Hongzhi Yin, Jiannong Cao, Tong Chen, Quoc Viet Hung Nguyen, Xiaofang Zhou, Lei Chen

Meanwhile, we treat each edge sequence as a whole and embed its ToV of the first vertex to further encode the time-sensitive information.

Dynamic graph embedding Graph Mining

MNL-Bandits under Inventory and Limited Switches Constraints

no code implementations22 Apr 2022 Hongbin Zhang, Yu Yang, Feng Wu, Qixin Zhang

Optimizing the assortment of products to display to customers is a key to increasing revenue for both offline and online retailers.

Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention

1 code implementation CVPR 2022 Yu Yang, Seungbae Kim, Jungseock Joo

We also demonstrate a novel application of our method for unsupervised dataset bias analysis which allows us to automatically discover hidden biases in datasets or compare different subsets without using additional labels.

Feature Construction and Selection for PV Solar Power Modeling

no code implementations13 Feb 2022 Yu Yang, Jia Mao, Richard Nguyen, Annas Tohmeh, Hen-Geul Yeh

A machine learning framework for 1-hour ahead solar power prediction is developed in this paper based on the historical data.

BIG-bench Machine Learning feature selection +2

An Exact Method for the Daily Package Shipment Problem with Outsourcing

no code implementations8 Feb 2022 Zhuolin Wang, Rongping Zhu, Jian-Ya Ding, Yu Yang, Keyou You

The package shipment problem requires to optimally co-design paths for both packages and a heterogeneous fleet in a transit center network (TCN).

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

1 code implementation24 Jan 2022 Bo Li, Qiulin Wang, JiQuan Pei, Yu Yang, Xiangyang Ji

First, we propose a novel approach to disentangle latent subspace semantics by exploiting existing face analysis models, e. g., face parsers and face landmark detectors.

counterfactual Counterfactual Explanation +3

Stochastic Continuous Submodular Maximization: Boosting via Non-oblivious Function

no code implementations3 Jan 2022 Qixin Zhang, Zengde Deng, Zaiyi Chen, Haoyuan Hu, Yu Yang

In the online setting, for the first time we consider the adversarial delays for stochastic gradient feedback, under which we propose a boosting online gradient algorithm with the same non-oblivious function $F$.

Towards Transactive Energy: An Analysis of Information-related Practical Issues

no code implementations28 Dec 2021 Yue Chen, Yu Yang, Xiaoyuan Xu

The development of distributed energy resources, such as rooftop photovoltaic (PV) panels, batteries, and electric vehicles (EVs), has decentralized our power system operation, where transactive energy markets empower local energy exchanges.

Privacy Preserving

Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients

1 code implementation21 Dec 2021 Jiafei Lyu, Yu Yang, Jiangpeng Yan, Xiu Li

It is vital to accurately estimate the value function in Deep Reinforcement Learning (DRL) such that the agent could execute proper actions instead of suboptimal ones.

continuous-control Continuous Control +1

Improving Cooperative Game Theory-based Data Valuation via Data Utility Learning

1 code implementation13 Jul 2021 Tianhao Wang, Yu Yang, Ruoxi Jia

The Shapley value (SV) and Least core (LC) are classic methods in cooperative game theory for cost/profit sharing problems.

Active Learning Data Valuation

Language Scaling for Universal Suggested Replies Model

no code implementations NAACL 2021 Qianlan Ying, Payal Bajaj, Budhaditya Deb, Yu Yang, Wei Wang, Bojia Lin, Milad Shokouhi, Xia Song, Yang Yang, Daxin Jiang

Faced with increased compute requirements and low resources for language expansion, we build a single universal model for improving the quality and reducing run-time costs of our production system.

Continual Learning Cross-Lingual Transfer +1

A Dimension-Insensitive Algorithm for Stochastic Zeroth-Order Optimization

no code implementations22 Apr 2021 Hongcheng Liu, Yu Yang

This paper concerns a convex, stochastic zeroth-order optimization (S-ZOO) problem.

Learning Foreground-Background Segmentation from Improved Layered GANs

no code implementations1 Apr 2021 Yu Yang, Hakan Bilen, Qiran Zou, Wing Yin Cheung, Xiangyang Ji

Deep learning approaches heavily rely on high-quality human supervision which is nonetheless expensive, time-consuming, and error-prone, especially for image segmentation task.

Generative Adversarial Network Image Segmentation +3

Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test

no code implementations1 Nov 2020 Zicun Cong, Lingyang Chu, Yu Yang, Jian Pei

One challenge remained untouched is how we can obtain an explanation on why a test set fails the KS test.

Anomaly Detection Astronomy +2

Optimal Sharing and Fair Cost Allocation of Community Energy Storage

no code implementations29 Oct 2020 Yu Yang, Guoqiang Hu, Costas J. Spanos

Further, we demonstrate both the building-wise and community-wise economic benefits are enhanced with the ES sharing model over the individual ES (IES) model.

Fairness Computer Science and Game Theory

EPARS: Early Prediction of At-risk Students with Online and Offline Learning Behaviors

no code implementations6 Jun 2020 Yu Yang, Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Hongzhi Yin, Xiaofang Zhou

We propose a novel algorithm (EPARS) that could early predict STAR in a semester by modeling online and offline learning behaviors.

Management Network Embedding

A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks

no code implementations7 May 2019 Lior Deutsch, Erik Nijkamp, Yu Yang

Recent work on mode connectivity in the loss landscape of deep neural networks has demonstrated that the locus of (sub-)optimal weight vectors lies on continuous paths.

Computational Efficiency Diversity

Unsupervised Learning of Neural Networks to Explain Neural Networks (extended abstract)

no code implementations21 Jan 2019 Quanshi Zhang, Yu Yang, Ying Nian Wu

This paper presents an unsupervised method to learn a neural network, namely an explainer, to interpret a pre-trained convolutional neural network (CNN), i. e., the explainer uses interpretable visual concepts to explain features in middle conv-layers of a CNN.

Knowledge Distillation Object

Network Transplanting (extended abstract)

no code implementations21 Jan 2019 Quanshi Zhang, Yu Yang, Qian Yu, Ying Nian Wu

This paper focuses on a new task, i. e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision.

Explaining AlphaGo: Interpreting Contextual Effects in Neural Networks

no code implementations8 Jan 2019 Zenan Ling, Haotian Ma, Yu Yang, Robert C. Qiu, Song-Chun Zhu, Quanshi Zhang

In this paper, we propose to disentangle and interpret contextual effects that are encoded in a pre-trained deep neural network.

Unsupervised Learning of Neural Networks to Explain Neural Networks

no code implementations18 May 2018 Quanshi Zhang, Yu Yang, Yuchen Liu, Ying Nian Wu, Song-Chun Zhu

Given feature maps of a certain conv-layer of the CNN, the explainer performs like an auto-encoder, which first disentangles the feature maps into object-part features and then inverts object-part features back to features of higher conv-layers of the CNN.

Disentanglement Object

Network Transplanting

no code implementations26 Apr 2018 Quanshi Zhang, Yu Yang, Qian Yu, Ying Nian Wu

This paper focuses on a new task, i. e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision.

Dynamic Filtering with Large Sampling Field for ConvNets

no code implementations ECCV 2018 Jialin Wu, Dai Li, Yu Yang, Chandrajit Bajaj, Xiangyang Ji

We propose a dynamic filtering strategy with large sampling field for ConvNets (LS-DFN), where the position-specific kernels learn from not only the identical position but also multiple sampled neighbor regions.

object-detection Object Detection +3

Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection

no code implementations14 Mar 2018 Zhan Yusen, Da Qing, Xiao Fei, Zeng An-xiang, Yu Yang

Solving the problem by reinforcement learning, we propose the RankCFS, which has been assessed in an off-line environment as well as a real-world on-line environment (Taobao. com).

Combinatorial Optimization Decision Making +2

Interpreting CNNs via Decision Trees

no code implementations CVPR 2019 Quanshi Zhang, Yu Yang, Haotian Ma, Ying Nian Wu

We propose to learn a decision tree, which clarifies the specific reason for each prediction made by the CNN at the semantic level.

Object Prediction

Finding Theme Communities from Database Networks

no code implementations23 Sep 2017 Lingyang Chu, Zhefeng Wang, Jian Pei, Yanyan Zhang, Yu Yang, Enhong Chen

Given a database network where each vertex is associated with a transaction database, we are interested in finding theme communities.

PIEFA: Personalized Incremental and Ensemble Face Alignment

no code implementations ICCV 2015 Xi Peng, Shaoting Zhang, Yu Yang, Dimitris N. Metaxas

Face alignment, especially on real-time or large-scale sequential images, is a challenging task with broad applications.

Face Alignment Incremental Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.