Search Results for author: Junge Zhang

Found 44 papers, 24 papers with code

Counter-Contrastive Learning for Language GANs

no code implementations Findings (EMNLP) 2021 Yekun Chai, Haidong Zhang, Qiyue Yin, Junge Zhang

Generative Adversarial Networks (GANs) have achieved great success in image synthesis, but have proven to be difficult to generate natural language.

Contrastive Learning Image Generation

Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective

1 code implementation30 Nov 2024 Jiangmeng Li, Zehua Zang, Qirui Ji, Chuxiong Sun, Wenwen Qiang, Junge Zhang, Changwen Zheng, Fuchun Sun, Hui Xiong

Thus, to enhance the methodological generalization, we propose a novel self-supervised learning method that leverages advancements in reinforcement learning to jointly benefit from the general guidance of EGT and sequentially optimize the model to chase the consistent improvement of generalizability and discriminability for specific target domains during pre-training.

Self-Supervised Learning

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown

no code implementations1 Oct 2024 Xingzhou Lou, Dong Yan, Wei Shen, Yuzi Yan, Jian Xie, Junge Zhang

Reward models (RM) play a critical role in aligning generations of large language models (LLM) to human expectations.

Uncertainty Quantification

Recent Advances in Attack and Defense Approaches of Large Language Models

no code implementations5 Sep 2024 Jing Cui, Yishi Xu, Zhewei Huang, Shuchang Zhou, Jianbin Jiao, Junge Zhang

Given the extensive research in the field of LLM security, we believe that summarizing the current state of affairs will help the research community better understand the present landscape and inform future developments.

Position: Foundation Agents as the Paradigm Shift for Decision Making

1 code implementation27 May 2024 Xiaoqian Liu, Xingzhou Lou, Jianbin Jiao, Junge Zhang

Decision making demands intricate interplay between perception, memory, and reasoning to discern optimal policies.

Decision Making Position

SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling

1 code implementation21 May 2024 Xingzhou Lou, Junge Zhang, Jian Xie, Lifeng Liu, Dong Yan, Kaiqi Huang

Human preference alignment is critical in building powerful and reliable large language models (LLMs).

S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation

no code implementations3 Feb 2024 Yurui Chen, Junge Zhang, Ziyang Xie, Wenye Li, Feihu Zhang, Jiachen Lu, Li Zhang

Autonomous driving simulation system plays a crucial role in enhancing self-driving data and simulating complex and rare traffic scenarios, ensuring navigation safety.

Autonomous Driving

Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

no code implementations15 Jan 2024 Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du

Through the use of pre-trained LMs and the elimination of the need for a ground-truth cost, our method enhances safe policy learning under a diverse set of human-derived free-form natural language constraints.

Reinforcement Learning (RL) Safe Reinforcement Learning

Self-supervised Pretraining for Decision Foundation Model: Formulation, Pipeline and Challenges

no code implementations29 Dec 2023 Xiaoqian Liu, Jianbin Jiao, Junge Zhang

Decision-making is a dynamic process requiring perception, memory, and reasoning to make choices and find optimal policies.

Decision Making Few-Shot Learning

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

1 code implementation25 Dec 2023 Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du

We propose Topology-based multi-Agent Policy gradiEnt (TAPE) for both stochastic and deterministic MAPG methods.

BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning

1 code implementation19 Dec 2023 Jing Cui, Yufei Han, Yuzhe ma, Jianbin Jiao, Junge Zhang

Our algorithm, BadRL, strategically chooses state observations with high attack values to inject triggers during training and testing, thereby reducing the chances of detection.

Backdoor Attack reinforcement-learning +2

Benchmarking Continual Learning from Cognitive Perspectives

no code implementations6 Dec 2023 Xiaoqian Liu, Junge Zhang, Mingyi Zhang, Peipei Yang

To address these issues, we propose to integrate model cognitive capacities and evaluation metrics into a unified evaluation paradigm.

Benchmarking Continual Learning

Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork

1 code implementation19 Jun 2023 Yonggang Jin, Chenxu Wang, Tianyu Zheng, Liuyu Xiang, Yaodong Yang, Junge Zhang, Jie Fu, Zhaofeng He

Deep reinforcement learning algorithms are usually impeded by sampling inefficiency, heavily depending on multiple interactions with the environment to acquire accurate decision-making capabilities.

Decision Making Deep Reinforcement Learning +3

NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields

1 code implementation28 Apr 2023 Junge Zhang, Feihu Zhang, Shaochen Kuang, Li Zhang

We verify the effectiveness of our NeRF-LiDAR by training different 3D segmentation models on the generated LiDAR point clouds.

Autonomous Driving Novel View Synthesis +2

S-NeRF: Neural Radiance Fields for Street Views

no code implementations1 Mar 2023 Ziyang Xie, Junge Zhang, Wenye Li, Feihu Zhang, Li Zhang

Specifically, we improve the scene parameterization function and the camera poses for learning better neural representations from street views.

Novel View Synthesis Self-Driving Cars

Improved Training of Mixture-of-Experts Language GANs

no code implementations23 Feb 2023 Yekun Chai, Qiyue Yin, Junge Zhang

In this work, we (1) first empirically show that the mixture-of-experts approach is able to enhance the representation capacity of the generator for language GANs and (2) harness the Feature Statistics Alignment (FSA) paradigm to render fine-grained learning signals to advance the generator training.

Adversarial Text Image Generation +1

PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination

1 code implementation16 Jan 2023 Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du

We conduct experiments on the Overcooked environment, and evaluate the zero-shot human-AI coordination performance of our method with both behavior-cloned human proxies and real humans.

Diversity

Delving into Transformer for Incremental Semantic Segmentation

no code implementations18 Nov 2022 Zekai Xu, Mingyi Zhang, Jiayue Hou, Xing Gong, Chuan Wen, Chengjie Wang, Junge Zhang

In contrast, a Transformer based method has a natural advantage in curbing catastrophic forgetting due to its ability to model both long-term and short-term tasks.

Diversity Segmentation +1

Softmax-free Linear Transformers

1 code implementation5 Jul 2022 Jiachen Lu, Junge Zhang, Xiatian Zhu, Jianfeng Feng, Tao Xiang, Li Zhang

With linear complexity, much longer token sequences are permitted by SOFT, resulting in superior trade-off between accuracy and complexity.

Computational Efficiency

Coordinated Proximal Policy Optimization

1 code implementation NeurIPS 2021 Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo

We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting.

Starcraft Starcraft II

SOFT: Softmax-free Transformer with Linear Complexity

2 code implementations NeurIPS 2021 Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang

Crucially, with a linear complexity, much longer token sequences are permitted in SOFT, resulting in superior trade-off between accuracy and complexity.

Computational Efficiency

Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning

no code implementations9 Apr 2021 Wenzhen Huang, Qiyue Yin, Junge Zhang, Kaiqi Huang

More specifically, we evaluate the effect of an imaginary transition by calculating the change of the loss computed on the real samples when we use the transition to train the action-value and policy functions.

Model-based Reinforcement Learning reinforcement-learning +2

Improving Sequence Generative Adversarial Networks with Feature Statistics Alignment

no code implementations1 Jan 2021 Yekun Chai, Qiyue Yin, Junge Zhang

Generative Adversarial Networks (GAN) are facing great challenges in synthesizing sequences of discrete elements, such as mode dropping and unstable training.

Binary Classification

Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning

no code implementations24 Oct 2020 Xiyao Wang, Junge Zhang, Wenzhen Huang, Qiyue Yin

We give an upper bound of the trajectory reward estimation error and point out that increasing the agent's exploration ability is the key to reduce trajectory reward estimation error, thereby alleviating dynamics bottleneck dilemma.

continuous-control Continuous Control +5

Point Cloud Super Resolution with Adversarial Residual Graph Networks

1 code implementation arXiv:1908.02111 2019 Huikai Wu, Junge Zhang, Kaiqi Huang

The key idea of the proposed network is to exploit the local similarity of point cloud and the analogy between LR input and HR output.

Graphics Image and Video Processing

SparseMask: Differentiable Connectivity Learning for Dense Image Prediction

1 code implementation ICCV 2019 Huikai Wu, Junge Zhang, Kaiqi Huang

In this paper, we aim at automatically searching an efficient network architecture for dense image prediction.

Decoder

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation

12 code implementations28 Mar 2019 Huikai Wu, Junge Zhang, Kaiqi Huang, Kongming Liang, Yizhou Yu

Modern approaches for semantic segmentation usually employ dilated convolutions in the backbone to extract high-resolution feature maps, which brings heavy computation complexity and memory footprint.

Semantic Segmentation

Transductive Zero-Shot Learning with Visual Structure Constraint

1 code implementation NeurIPS 2019 Zi-Yu Wan, Dong-Dong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao

Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i. e. alleviate the above domain shift problem).

Zero-Shot Learning

Discriminative Learning of Latent Features for Zero-Shot Recognition

1 code implementation CVPR 2018 Yan Li, Junge Zhang, Jian-Guo Zhang, Kaiqi Huang

In this work, we retrospect existing methods and demonstrate the necessity to learn discriminative representations for both visual and semantic instances of ZSL.

Zero-Shot Learning

Fast End-to-End Trainable Guided Filter

1 code implementation CVPR 2018 Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang

To address the problem, we present a novel building block for FCNs, namely guided filtering layer, which is designed for efficiently generating a high-resolution output given the corresponding low-resolution one and a high-resolution guidance map.

Mixed Supervised Object Detection with Robust Objectness Transfer

no code implementations27 Feb 2018 Yan Li, Junge Zhang, Kaiqi Huang, Jian-Guo Zhang

Different from previous MSD methods that directly transfer the pre-trained object detectors from existing categories to new categories, we propose a more reasonable and robust objectness transfer approach for MSD.

Multiple Instance Learning Object +2

MSC: A Dataset for Macro-Management in StarCraft II

2 code implementations9 Oct 2017 Huikai Wu, Yanqi Zong, Junge Zhang, Kaiqi Huang

We also split MSC into training, validation and test set for the convenience of evaluation and comparison.

Management Starcraft +1

GP-GAN: Towards Realistic High-Resolution Image Blending

2 code implementations21 Mar 2017 Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang

Concretely, we propose Gaussian-Poisson Equation to formulate the high-resolution image blending problem, which is a joint optimization constrained by the gradient and color information.

Conditional Image Generation Generative Adversarial Network +1

GRSA: Generalized Range Swap Algorithm for the Efficient Optimization of MRFs

no code implementations CVPR 2015 Kangwei Liu, Junge Zhang, Peipei Yang, Kaiqi Huang

al propose the range move algorithms, which are one of the most successful solvers to this problem.

Cannot find the paper you are looking for? You can Submit a new open access paper.