no code implementations • Findings (EMNLP) 2021 • Yekun Chai, Haidong Zhang, Qiyue Yin, Junge Zhang
Generative Adversarial Networks (GANs) have achieved great success in image synthesis, but have proven to be difficult to generate natural language.
1 code implementation • 30 Nov 2024 • Jiangmeng Li, Zehua Zang, Qirui Ji, Chuxiong Sun, Wenwen Qiang, Junge Zhang, Changwen Zheng, Fuchun Sun, Hui Xiong
Thus, to enhance the methodological generalization, we propose a novel self-supervised learning method that leverages advancements in reinforcement learning to jointly benefit from the general guidance of EGT and sequentially optimize the model to chase the consistent improvement of generalizability and discriminability for specific target domains during pre-training.
no code implementations • 1 Oct 2024 • Xingzhou Lou, Dong Yan, Wei Shen, Yuzi Yan, Jian Xie, Junge Zhang
Reward models (RM) play a critical role in aligning generations of large language models (LLM) to human expectations.
no code implementations • 5 Sep 2024 • Jing Cui, Yishi Xu, Zhewei Huang, Shuchang Zhou, Jianbin Jiao, Junge Zhang
Given the extensive research in the field of LLM security, we believe that summarizing the current state of affairs will help the research community better understand the present landscape and inform future developments.
1 code implementation • 27 May 2024 • Xiaoqian Liu, Xingzhou Lou, Jianbin Jiao, Junge Zhang
Decision making demands intricate interplay between perception, memory, and reasoning to discern optimal policies.
1 code implementation • 21 May 2024 • Xingzhou Lou, Junge Zhang, Jian Xie, Lifeng Liu, Dong Yan, Kaiqi Huang
Human preference alignment is critical in building powerful and reliable large language models (LLMs).
no code implementations • 18 Mar 2024 • Junge Zhang, Qihang Zhang, Li Zhang, Ramana Rao Kompella, Gaowen Liu, Bolei Zhou
Generating unbounded 3D scenes is crucial for large-scale scene understanding and simulation.
no code implementations • 3 Feb 2024 • Yurui Chen, Junge Zhang, Ziyang Xie, Wenye Li, Feihu Zhang, Jiachen Lu, Li Zhang
Autonomous driving simulation system plays a crucial role in enhancing self-driving data and simulating complex and rare traffic scenarios, ensuring navigation safety.
no code implementations • 15 Jan 2024 • Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du
Through the use of pre-trained LMs and the elimination of the need for a ground-truth cost, our method enhances safe policy learning under a diverse set of human-derived free-form natural language constraints.
no code implementations • 29 Dec 2023 • Xiaoqian Liu, Jianbin Jiao, Junge Zhang
Decision-making is a dynamic process requiring perception, memory, and reasoning to make choices and find optimal policies.
2 code implementations • 26 Dec 2023 • Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin
Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL.
1 code implementation • 25 Dec 2023 • Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du
We propose Topology-based multi-Agent Policy gradiEnt (TAPE) for both stochastic and deterministic MAPG methods.
1 code implementation • 19 Dec 2023 • Jing Cui, Yufei Han, Yuzhe ma, Jianbin Jiao, Junge Zhang
Our algorithm, BadRL, strategically chooses state observations with high attack values to inject triggers during training and testing, thereby reducing the chances of detection.
no code implementations • 6 Dec 2023 • Xiaoqian Liu, Junge Zhang, Mingyi Zhang, Peipei Yang
To address these issues, we propose to integrate model cognitive capacities and evaluation metrics into a unified evaluation paradigm.
no code implementations • 22 Aug 2023 • Ceyao Zhang, Kaijie Yang, Siyi Hu, ZiHao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in the realm of multi-agent systems.
1 code implementation • 19 Jun 2023 • Yonggang Jin, Chenxu Wang, Tianyu Zheng, Liuyu Xiang, Yaodong Yang, Junge Zhang, Jie Fu, Zhaofeng He
Deep reinforcement learning algorithms are usually impeded by sampling inefficiency, heavily depending on multiple interactions with the environment to acquire accurate decision-making capabilities.
1 code implementation • 28 Apr 2023 • Junge Zhang, Feihu Zhang, Shaochen Kuang, Li Zhang
We verify the effectiveness of our NeRF-LiDAR by training different 3D segmentation models on the generated LiDAR point clouds.
no code implementations • 1 Mar 2023 • Ziyang Xie, Junge Zhang, Wenye Li, Feihu Zhang, Li Zhang
Specifically, we improve the scene parameterization function and the camera poses for learning better neural representations from street views.
no code implementations • 23 Feb 2023 • Yekun Chai, Qiyue Yin, Junge Zhang
In this work, we (1) first empirically show that the mixture-of-experts approach is able to enhance the representation capacity of the generator for language GANs and (2) harness the Feature Statistics Alignment (FSA) paradigm to render fine-grained learning signals to advance the generator training.
1 code implementation • 16 Jan 2023 • Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du
We conduct experiments on the Overcooked environment, and evaluate the zero-shot human-AI coordination performance of our method with both behavior-cloned human proxies and real humans.
1 code implementation • 30 Dec 2022 • Hangyu Mao, Rui Zhao, Hao Chen, Jianye Hao, Yiqun Chen, Dong Li, Junge Zhang, Zhen Xiao
Recent methods combine the Transformer with these modules for better performance.
no code implementations • 18 Nov 2022 • Zekai Xu, Mingyi Zhang, Jiayue Hou, Xing Gong, Chuan Wen, Chengjie Wang, Junge Zhang
In contrast, a Transformer based method has a natural advantage in curbing catastrophic forgetting due to its ability to model both long-term and short-term tasks.
1 code implementation • 5 Jul 2022 • Jiachen Lu, Junge Zhang, Xiatian Zhu, Jianfeng Feng, Tao Xiang, Li Zhang
With linear complexity, much longer token sequences are permitted by SOFT, resulting in superior trade-off between accuracy and complexity.
no code implementations • 2 Jun 2022 • Hao Chen, Guangkai Yang, Junge Zhang, Qiyue Yin, Kaiqi Huang
Specifically, these methods do not explicitly utilize the relationship between agents and cannot adapt to different sizes of inputs.
1 code implementation • 27 Jan 2022 • Qibin Zhou, Dongdong Bai, Junge Zhang, Fuqing Duan, Kaiqi Huang
It is more common in life than perfect-information game.
1 code implementation • NeurIPS 2021 • Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo
We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting.
2 code implementations • NeurIPS 2021 • Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang
Crucially, with a linear complexity, much longer token sequences are permitted in SOFT, resulting in superior trade-off between accuracy and complexity.
no code implementations • 9 Apr 2021 • Wenzhen Huang, Qiyue Yin, Junge Zhang, Kaiqi Huang
More specifically, we evaluate the effect of an imaginary transition by calculating the change of the loss computed on the real samples when we use the transition to train the action-value and policy functions.
Model-based Reinforcement Learning reinforcement-learning +2
no code implementations • 1 Jan 2021 • Yekun Chai, Qiyue Yin, Junge Zhang
Generative Adversarial Networks (GAN) are facing great challenges in synthesizing sequences of discrete elements, such as mode dropping and unstable training.
no code implementations • 24 Oct 2020 • Xiyao Wang, Junge Zhang, Wenzhen Huang, Qiyue Yin
We give an upper bound of the trajectory reward estimation error and point out that increasing the agent's exploration ability is the key to reduce trajectory reward estimation error, thereby alleviating dynamics bottleneck dilemma.
1 code implementation • 10 Sep 2019 • Zihao Li, Shu Zhang, Junge Zhang, Kaiqi Huang, Yizhou Wang, Yizhou Yu
In this paper, we propose to incorporate domain knowledge in clinical practice into the model design of universal lesion detectors.
Ranked #8 on Medical Object Detection on DeepLesion
1 code implementation • arXiv:1908.02111 2019 • Huikai Wu, Junge Zhang, Kaiqi Huang
The key idea of the proposed network is to exploit the local similarity of point cloud and the analogy between LR input and HR output.
Ranked #2 on Point Cloud Super Resolution on SHREC15
Graphics Image and Video Processing
1 code implementation • ICCV 2019 • Huikai Wu, Junge Zhang, Kaiqi Huang
In this paper, we aim at automatically searching an efficient network architecture for dense image prediction.
12 code implementations • 28 Mar 2019 • Huikai Wu, Junge Zhang, Kaiqi Huang, Kongming Liang, Yizhou Yu
Modern approaches for semantic segmentation usually employ dilated convolutions in the backbone to extract high-resolution feature maps, which brings heavy computation complexity and memory footprint.
Ranked #40 on Semantic Segmentation on PASCAL Context
1 code implementation • NeurIPS 2019 • Zi-Yu Wan, Dong-Dong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao
Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i. e. alleviate the above domain shift problem).
1 code implementation • CVPR 2018 • Yan Li, Junge Zhang, Jian-Guo Zhang, Kaiqi Huang
In this work, we retrospect existing methods and demonstrate the necessity to learn discriminative representations for both visual and semantic instances of ZSL.
1 code implementation • CVPR 2018 • Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang
To address the problem, we present a novel building block for FCNs, namely guided filtering layer, which is designed for efficiently generating a high-resolution output given the corresponding low-resolution one and a high-resolution guidance map.
no code implementations • 27 Feb 2018 • Yan Li, Junge Zhang, Kaiqi Huang, Jian-Guo Zhang
Different from previous MSD methods that directly transfer the pre-trained object detectors from existing categories to new categories, we propose a more reasonable and robust objectness transfer approach for MSD.
2 code implementations • 9 Oct 2017 • Huikai Wu, Yanqi Zong, Junge Zhang, Kaiqi Huang
We also split MSC into training, validation and test set for the convenience of evaluation and comparison.
3 code implementations • CVPR 2018 • Debang Li, Huikai Wu, Junge Zhang, Kaiqi Huang
Image cropping aims at improving the aesthetic quality of images by adjusting their composition.
2 code implementations • 21 Mar 2017 • Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang
Concretely, we propose Gaussian-Poisson Equation to formulate the high-resolution image blending problem, which is a joint optimization constrained by the gradient and color information.
Conditional Image Generation Generative Adversarial Network +1
no code implementations • ICCV 2015 • Lianrui Fu, Junge Zhang, Kaiqi Huang
Occlusion is a main challenge for human pose estimation, which is largely ignored in popular tree structure models.
no code implementations • CVPR 2015 • Kangwei Liu, Junge Zhang, Peipei Yang, Kaiqi Huang
al propose the range move algorithms, which are one of the most successful solvers to this problem.
no code implementations • CVPR 2014 • Kangwei Liu, Junge Zhang, Kaiqi Huang, Tieniu Tan
The MRF energy function is derived from the deformation decomposition model.