no code implementations • ECCV 2020 • Fang Zhao, Shengcai Liao, Guo-Sen Xie, Jian Zhao, Kaihao Zhang, Ling Shao
On the other hand, mutual instance selection further selects reliable and informative instances for training according to the peer-confidence and relationship disagreement of the networks.
no code implementations • 29 Dec 2024 • Daiheng Gao, Shilin Lu, Shaw Walters, Wenbo Zhou, Jiaming Chu, Jie Zhang, Bang Zhang, Mengxi Jia, Jian Zhao, Zhaoxin Fan, Weiming Zhang
Removing unwanted concepts from large-scale text-to-image (T2I) diffusion models while maintaining their overall generative quality remains an open challenge.
no code implementations • 28 Dec 2024 • Shuo Wang, Wanting Li, Yongcai Wang, Zhaoxin Fan, Zhe Huang, Xudong Cai, Jian Zhao, Deying Li
To address this challenge, this paper proposes MambaVO, which conducts robust initialization, Mamba-based sequential matching refinement, and smoothed training to enhance the matching quality and improve the pose estimation in deep visual odometry.
1 code implementation • 23 Dec 2024 • Yue Deng, Yan Yu, Weiyu Ma, ZiRui Wang, Wenhui Zhu, Jian Zhao, Yin Zhang
SMAC-HARD supports customizable opponent strategies, randomization of adversarial policies, and interfaces for MARL self-play, enabling agents to generalize to varying opponent behaviors and improve model stability.
1 code implementation • 16 Dec 2024 • Junjie Lin, Jian Zhao, Lin Liu, Yue Deng, Youpeng Zhao, Lanxiao Huang, Xia Lin, Wengang Zhou, Houqiang Li
After iterative refinements, our curling AI based on the decision tree ranks first on the Jidi platform among 34 curling AIs in total, which demonstrates that LLMs can significantly enhance the robustness and adaptability of decision trees, representing a substantial advancement in the field of Game AI.
no code implementations • 12 Dec 2024 • Hui Dou, Xinyu Mu, Mengjun Yi, Feng Han, Jian Zhao, Furao Shen
Convolutional Neural Networks (CNNs) have demonstrated remarkable prowess in the field of computer vision.
no code implementations • 26 Nov 2024 • Zhiming Xu, Suorong Yang, Baile Xu, Jian Zhao, Furao Shen
This paper proposes the Dual Prototype network for Task-wise Adaption (DPTA) of PTM-based CIL.
no code implementations • 25 Nov 2024 • Guangzhao Dai, Jian Zhao, Yuantao Chen, Yusen Qin, Hao Zhao, GuoSen Xie, Yazhou Yao, Xiangbo Shu, Xuelong Li
Vision-and-Language Navigation (VLN), where an agent follows instructions to reach a target destination, has recently seen significant advancements.
no code implementations • 14 Nov 2024 • Dunwei Tu, Huiyu Yi, Tieyi Zhang, Ruotong Li, Furao Shen, Jian Zhao
FSCIL combines the characteristics and challenges of class-incremental learning and few-shot learning: (i) Current classes occupy the entire feature space, which is detrimental to learning new classes.
class-incremental learning Few-Shot Class-Incremental Learning +2
no code implementations • 5 Nov 2024 • Xiaoliang Liu, Furao Shen, Jian Zhao
The Segment Anything Model (SAM) is a cornerstone of image segmentation, demonstrating exceptional performance across various applications, particularly in autonomous driving and medical imaging, where precise segmentation is crucial.
no code implementations • 31 Oct 2024 • Suhan Guo, Jiahong Deng, Yi Wei, Hui Dou, Furao Shen, Jian Zhao
Attention-based architectures have become ubiquitous in time series forecasting tasks, including spatio-temporal (STF) and long-term time series forecasting (LTSF).
no code implementations • 28 Oct 2024 • Reuben Luera, Ryan A. Rossi, Alexa Siu, Franck Dernoncourt, Tong Yu, Sungchul Kim, Ruiyi Zhang, Xiang Chen, Hanieh Salehy, Jian Zhao, Samyadeep Basu, Puneet Mathur, Nedim Lipka
The applications of generative AI have become extremely impressive, and the interplay between users and AI is even more so.
1 code implementation • 21 Oct 2024 • Yue Deng, Weiyu Ma, Yuxin Fan, Yin Zhang, Haifeng Zhang, Jian Zhao
StarCraft Multi-Agent Challenge (SMAC) is one of the most commonly used experimental environments in multi-agent reinforcement learning (MARL), where the specific task is to control a set number of allied units to defeat enemy forces.
1 code implementation • 9 Oct 2024 • Yushen Chen, Zhikang Niu, Ziyang Ma, Keqi Deng, Chunhui Wang, Jian Zhao, Kai Yu, Xie Chen
This sampling strategy for flow step can be easily applied to existing flow matching based models without retraining.
1 code implementation • 9 Oct 2024 • Yuhan Kang, Qingpeng Li, Leyuan Fang, Jian Zhao, Xuelong Li
In this paper, considering that the surrounding environment information can be well utilized to identify the concealed objects, and thus, we propose a novel deep Surrounding-Aware Network, namely SurANet, for COD tasks, which introduces surrounding information into feature extraction and loss function to improve the discrimination.
no code implementations • 29 Sep 2024 • Yifan Duan, Jian Zhao, Pengcheng, Junyuan Mao, Hao Wu, Jingyu Xu, Shilong Wang, Caoyuan Ma, Kai Wang, Kun Wang, Xuelong Li
To this end, we establish a causal framework for ST predictions, termed CaPaint, which targets to identify causal regions in data and endow model with causal reasoning ability in a two-stage process.
1 code implementation • 10 Sep 2024 • Suorong Yang, Furao Shen, Jian Zhao
In addition, to further enhance the efficacy of EntAugment, we introduce a novel entropy regularization term, EntLoss, which complements the EntAugment approach.
no code implementations • 10 Sep 2024 • Chang Zeng, Chunhui Wang, Xiaoxiao Miao, Jian Zhao, Zhonglin Jiang, Yong Chen
Subsequently, the HN module is connected with an extended WaveNet by an UNet-based module, which transforms the output of the HN module to a latent variable sequence containing essential periodic and aperiodic information.
no code implementations • 25 Jun 2024 • Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao
Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks.
no code implementations • 15 Jun 2024 • Ying Fu, Yu Li, ShaoDi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu, Yunkang Zhang, Siyuan Jiang, Xiaoqiang Lu, Licheng Jiao, Fang Liu, Xu Liu, Lingling Li, Wenping Ma, Shuyuan Yang, Haiyang Xie, Jian Zhao, Shihua Huang, Peng Cheng, Xi Shen, Zheng Wang, Shuai An, Caizhi Zhu, Xuelong Li, Tao Zhang, Liang Li, Yu Liu, Chenggang Yan, Gengchen Zhang, Linyan Jiang, Bingyi Song, Zhuoyu An, Haibo Lei, Qing Luo, Jie Song, YuAn Liu, Haoyuan Zhang, Lingfeng Wang, Wei Chen, Aling Luo, Cheng Li, Jun Cao, Shu Chen, Zifei Dou, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Xuejian Gou, Qinliang Wang, Yang Liu, Shizhan Zhao, Yanzhao Zhang, Libo Yan, Yuwei Guo, Guoxin Li, Qiong Gao, Chenyue Che, Long Sun, Xiang Chen, Hao Li, Jinshan Pan, Chuanlong Xie, Hongming Chen, Mingrui Li, Tianchen Deng, Jingwei Huang, Yufeng Li, Fei Wan, Bingxin Xu, Jian Cheng, Hongzhe Liu, Cheng Xu, Yuxiang Zou, Weiguo Pan, Songyin Dai, Sen Jia, Junpei Zhang, Puhua Chen, Qihang Li
The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies.
1 code implementation • 6 Jun 2024 • Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li, Xia Lin, Lanxiao Huang
To address these issues, we introduce the first publicly available map editor for the popular mobile game Honor of Kings and design a lightweight environment, Mini Honor of Kings (Mini HoK), for researchers to conduct experiments.
no code implementations • 3 Jun 2024 • Chen Zhang, Qiang He, Zhou Yuan, Elvis S. Liu, Hong Wang, Jian Zhao, Yang Wang
Sh\=ukai quantifies the state to enhance generalizability, introducing Heterogeneous League Training (HELT) to achieve balanced competence, generalizability, and training efficiency.
no code implementations • 27 May 2024 • Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shen Shengmei
The SkatingVerse Workshop & Challenge aims to encourage research in developing novel and accurate methods for human action understanding.
no code implementations • 19 May 2024 • Suorong Yang, Peijia Li, Xin Xiong, Furao Shen, Jian Zhao
The policy network optimizes the variability within the augmented data, while the target network utilizes the adaptively augmented samples for training.
no code implementations • 9 May 2024 • Feifei Li, Suhan Guo, Feng Han, Jian Zhao, Furao Shen
Accurate forecasting of long-term time series has important applications for decision making and planning.
no code implementations • CVPR 2024 • Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai
In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.
1 code implementation • 6 May 2024 • Zheng Zhu, XiaoFeng Wang, Wangbo Zhao, Chen Min, Nianchen Deng, Min Dou, Yuqi Wang, Botian Shi, Kai Wang, Chi Zhang, Yang You, Zhaoxiang Zhang, Dawei Zhao, Liang Xiao, Jian Zhao, Jiwen Lu, Guan Huang
General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.
1 code implementation • 1 May 2024 • Runyi Yang, Zhenxin Zhu, Zhou Jiang, Baijun Ye, Xiaoxue Chen, Yifei Zhang, Yuantao Chen, Jian Zhao, Hao Zhao
However, this comes with high memory consumption, e. g., a well-trained Gaussian field may utilize three million Gaussian primitives and over 700 MB of memory.
no code implementations • 26 Apr 2024 • Xinpeng Li, Teng Wang, Jian Zhao, Shuyi Mao, Jinbao Wang, Feng Zheng, Xiaojiang Peng, Xuelong Li
Emotion recognition aims to discern the emotional state of subjects within an image, relying on subject-centric and contextual visual cues.
1 code implementation • 23 Apr 2024 • Ziqi Xie, Weidong Zhao, Xianhui Liu, Jian Zhao, Ning Jia
This paper proposes the Simple and Robust Stitcher (SRStitcher), which revolutionizes the image stitching pipeline by simplifying the fusion and rectangling stages into a unified inpainting model, requiring no model training or fine-tuning.
no code implementations • 23 Apr 2024 • Fan Zhang, Zhi-Qi Cheng, Jian Zhao, Xiaojiang Peng, Xuelong Li
LEAF introduces a hierarchical expression-aware aggregation strategy that operates at three levels: semantic, instance, and category.
Facial Expression Recognition Facial Expression Recognition (FER)
no code implementations • 11 Mar 2024 • Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao
2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose.
no code implementations • 28 Feb 2024 • Haoyu Xie, Changqi Wang, Jian Zhao, Yang Liu, Jun Dan, Chong Fu, Baigui Sun
To address this issue, we propose a robust contrastive-based S4 framework, termed the Probabilistic Representation Contrastive Learning (PRCL) framework to enhance the robustness of the unsupervised training process.
no code implementations • 11 Feb 2024 • Pengcheng An, Jiawen Zhu, Zibo Zhang, Yifei Yin, Qingyuan Ma, Che Yan, Linghao Du, Jian Zhao
We introduce EmoWear, a smartwatch voice messaging system enabling users to apply 30 animation teasers on message bubbles to reflect emotions.
1 code implementation • CVPR 2024 • Tao Wang, Lei Jin, Zheng Wang, Jianshu Li, Liang Li, Fang Zhao, Yu Cheng, Li Yuan, Li Zhou, Junliang Xing, Jian Zhao
To leverage this quality information we propose a motion refinement network termed SynSP to achieve a Synergy of Smoothness and Precision in the sequence refinement tasks.
no code implementations • 9 Dec 2023 • Suorong Yang, Hongchao Yang, Suhan Guo, Furao Shen, Jian Zhao
AdaPruner can still significantly enhance model performance even after pruning up to 10-30\% of the training data.
1 code implementation • 5 Dec 2023 • Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li
The utilization of artificial intelligence (AI) in card games has been a well-explored subject within AI research for an extensive period.
no code implementations • 29 Nov 2023 • Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie
RADAP employs innovative techniques, such as FCutout and F-patch, which use Fourier space sampling masks to improve the occlusion robustness of the FR model and the performance of the patch segmenter.
no code implementations • 29 Nov 2023 • Xiaoliang Liu, Furao Shen, Feng Han, Jian Zhao, Changhai Nie
Face recognition (FR) technology plays a crucial role in various applications, but its vulnerability to adversarial attacks poses significant security concerns.
no code implementations • 27 Nov 2023 • Suorong Yang, Geng Zhang, Jian Zhao, Furao Shen
Interpolation methodologies have been widely used within the domain of indoor positioning systems.
no code implementations • 22 Nov 2023 • Yuzhe You, Jarvis Tse, Jian Zhao
Adversarial machine learning (AML) studies attacks that can fool machine learning algorithms into generating incorrect outcomes as well as the defenses against worst-case attacks to strengthen model robustness.
1 code implementation • 13 Oct 2023 • Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao
Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information.
Ranked #1 on Multi-Human Parsing on MHP v2.0
no code implementations • 25 Sep 2023 • Ping Li, Yu Zhang, Li Yuan, Jian Zhao, Xianghua Xu, Xiaoqin Zhang
Particularly, the gradients from the segmentation model are exploited to discover the easily confused region, in which it is difficult to identify the pixel-wise objects from the background in a frame.
1 code implementation • ICCV 2023 • Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao
To address this issue, the Transporter method was introduced for 2D data, which reconstructs the target frame from the source frame to incorporate both spatial and temporal information.
no code implementations • 29 Aug 2023 • Zhuo Zhou, Wenxuan Liu, Danni Xu, Zheng Wang, Jian Zhao
HID presents a unique challenge in that hidden intentions lack the obvious visual representations to distinguish them from normal intentions.
1 code implementation • 26 Aug 2023 • Jianqiang Xia, Dianxi Shi, Ke Song, Linna Song, Xiaolei Wang, Songchang Jin, Li Zhou, Yu Cheng, Lei Jin, Zheng Zhu, Jianan Li, Gang Wang, Junliang Xing, Jian Zhao
With this structure, the network can extract fusion features of the template and search region under the mutual interaction of modalities.
Ranked #4 on Rgb-T Tracking on GTOT
1 code implementation • 21 Aug 2023 • Jianyang Gu, Hao Luo, Kai Wang, Wei Jiang, Yang You, Jian Zhao
In this work, we propose a Color Prompting (CoP) method for data-free continual unsupervised domain adaptive person Re-ID.
Domain Adaptive Person Re-Identification Person Re-Identification +1
no code implementations • 3 Aug 2023 • Jiazheng Xing, Chao Xu, Mengmeng Wang, Guang Dai, Baigui Sun, Yong liu, Jingdong Wang, Jian Zhao
To tackle these issues, we introduce MA-FSAR, a framework that employs the Parameter-Efficient Fine-Tuning (PEFT) technique to enhance the CLIP visual encoder in terms of action-related temporal and semantic representations.
no code implementations • 11 Jul 2023 • Sihan Song, Furao Shen, Jian Zhao
Data augmentation has been widely used in low-resource NER tasks to tackle the problem of data sparsity.
1 code implementation • 27 Jun 2023 • Xue-Feng Zhu, Tianyang Xu, Jian Zhao, Jia-Wei Liu, Kai Wang, Gang Wang, Jianan Li, Qiang Wang, Lei Jin, Zheng Zhu, Junliang Xing, Xiao-Jun Wu
Still, previous works have simplified such an anti-UAV task as a tracking problem, where the prior information of UAVs is always provided; such a scheme fails in real-world anti-UAV tasks (i. e. complex scenes, indeterminate-appear and -reappear UAVs, and real-time UAV surveillance).
no code implementations • 27 May 2023 • Xin Xiong, Furao Shen, Xiangyu Wang, Jian Zhao
Many GCL methods with automated data augmentation face the risk of insufficient information as they fail to preserve the essential information necessary for the downstream task.
no code implementations • 14 May 2023 • Xiaowen Tao, Pengxiang Meng, Bing Zhu, Jian Zhao
Autonomous driving has spurred the development of sensor fusion techniques, which combine data from multiple sensors to improve system performance.
no code implementations • 12 May 2023 • Jian Zhao, Jianan Li, Lei Jin, Jiaming Chu, Zhihao Zhang, Jun Wang, Jiangqiang Xia, Kai Wang, Yang Liu, Sadaf Gulshad, Jiaojiao Zhao, Tianyang Xu, XueFeng Zhu, Shihan Liu, Zheng Zhu, Guibo Zhu, Zechao Li, Zheng Wang, Baigui Sun, Yandong Guo, Shin ichi Satoh, Junliang Xing, Jane Shen Shengmei
Second, we set up two tracks for the first time, i. e., Anti-UAV Tracking and Anti-UAV Detection & Tracking.
1 code implementation • 22 Apr 2023 • Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao
We instead present a high-performance Single-stage Multi-human Parsing (SMP) deep architecture that decouples the multi-human parsing problem into two fine-grained sub-problems, i. e., locating the human body and parts.
Ranked #2 on Multi-Human Parsing on MHP v2.0
no code implementations • 5 Apr 2023 • Linzhi Huang, Mei Wang, Jiahao Liang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Jian Zhao
Specifically, we use the gradient attention map (GAM) of the face recognition network to track the sensitive facial regions and make the GAMs of different races tend to be consistent through adversarial learning.
1 code implementation • CVPR 2023 • Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao
Neural Architecture Search (NAS) has been increasingly appealing to the society of object Re-Identification (ReID), for that task-specific architectures significantly improve the retrieval performance.
Ranked #8 on Vehicle Re-Identification on VehicleID Large
no code implementations • 7 Mar 2023 • Emily Kuang, Ehsan Jahangirzadeh Soure, Mingming Fan, Jian Zhao, Kristen Shinohara
Those who used the text assistant asked more questions, but the question lengths were similar.
no code implementations • 18 Feb 2023 • Yuanjie Yan, Jian Zhao, Furao Shen
We analyse the gradients layer by layer on the style space.
no code implementations • ICCV 2023 • Rui Li, Baopeng Zhang, Jun Liu, Wei Liu, Jian Zhao, Zhu Teng
HD-AMOT defines the diversified informative representation by encoding the geometric and semantic information, and formulates the frame inference strategy as a Markov decision process to learn an optimal sampling policy based on the designed informative representation.
no code implementations • 29 Nov 2022 • Suorong Yang, Jinqiao Li, Jian Zhao, Furao Shen
The experimental results on various datasets and CNN models verify that the proposed method outperforms other previous data augmentation methods in image classification tasks.
no code implementations • CVPR 2023 • Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha
Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others.
no code implementations • 31 Oct 2022 • Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li
We compare it with 8 baseline AI programs which are based on heuristic rules and the results reveal the outstanding performance of DanZero.
1 code implementation • 8 Oct 2022 • Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Kai Su, Lei Jin, Mei Song, Shuicheng Yan, Jian Zhao
With the proposed body representation, we further deliver a compact single-stage multi-person pose regression network, termed as AdaptivePose.
no code implementations • 21 Jul 2022 • Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie
In this paper, we propose a new data processing and training method, called AugRmixAT, which can simultaneously improve the generalization ability and multiple robustness of neural network models.
3 code implementations • 14 Jul 2022 • Pengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen Li, Jian Zhao, Humphrey Shi, Zhenjun Han, Qixiang Ye
However, the performance gap between point supervised object detection (PSOD) and bounding box supervised detection remains large.
no code implementations • 25 Jun 2022 • Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie
Furthermore, we propose a random meta-optimization strategy for ensembling several pre-trained face models to generate more general adversarial masks.
no code implementations • 21 Jun 2022 • Yuanjie Yan, Suorong Yang, Yan Wang, Jian Zhao, Furao Shen
From the perspective of this framework, we review those subtasks and give a unified interpretation of various scenarios.
no code implementations • 18 May 2022 • Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie
Data augmentation plays a crucial role in enhancing the robustness and performance of machine learning models across various domains.
no code implementations • 7 May 2022 • Zheng Chen, Jian Zhao, Mingyu Yang, Wengang Zhou, Houqiang Li
In this work, we are dedicated to multi-target active object tracking (AOT), where there are multiple targets as well as multiple cameras in the environment.
no code implementations • 5 May 2022 • Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
In this way, agents dealing with the same subtask share their learning of specific abilities and different subtasks correspond to different specific abilities.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 21 Apr 2022 • Anjul Tyagi, Jian Zhao, Pushkar Patel, Swasti Khurana, Klaus Mueller
With the help of designers, we propose a semi-automated infographic framework for general structured and flow-based infographic design generation.
no code implementations • 19 Apr 2022 • Suorong Yang, Weikang Xiao, Mengchen Zhang, Suhan Guo, Jian Zhao, Furao Shen
By improving the quantity and diversity of training data, data augmentation has become an inevitable part of deep learning model training with image data.
1 code implementation • 6 Apr 2022 • Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li
Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games.
1 code implementation • CVPR 2022 • Jian Zhao, HUI ZHANG
Firstly, we propose thin-plate spline motion estimation to produce a more flexible optical flow, which warps the feature maps of the source image to the feature domain of the driving image.
no code implementations • 18 Mar 2022 • Jinqiao Li, Xiaotao Liu, Jian Zhao, Furao Shen
A special branch of adversarial examples, namely sparse adversarial examples, can fool the target DNNs by perturbing only a few pixels.
1 code implementation • 16 Mar 2022 • Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li
To the best of our knowledge, this work is the first to study the unexpected crashes in the multi-agent system.
Multi-agent Reinforcement Learning reinforcement-learning +4
1 code implementation • 16 Mar 2022 • Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
In this way, CTDS balances the full utilization of global observation during training and the feasibility of decentralized execution for online inference.
Multi-agent Reinforcement Learning reinforcement-learning +3
2 code implementations • CVPR 2022 • Shiming Chen, Ziming Hong, Guo-Sen Xie, Wenhan Yang, Qinmu Peng, Kai Wang, Jian Zhao, Xinge You
Prior works either simply align the global features of an image with its associated class semantic vector or utilize unidirectional attention to learn the limited latent semantic representations, which could not effectively discover the intrinsic semantic knowledge e. g., attribute semantics) between visual and attribute features.
1 code implementation • 22 Feb 2022 • Zeyu Fang, Jian Zhao, Mingyu Yang, Wengang Zhou, Zhenbo Lu, Houqiang Li
In our approach, we regard each camera as an agent and address AMOT with a multi-agent reinforcement learning solution.
1 code implementation • 21 Feb 2022 • Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li
Specifically, we model both individual Q-values and global Q-value with categorical distribution.
no code implementations • 9 Feb 2022 • Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li
In cooperative multi-agent systems, agents jointly take actions and receive a team reward instead of individual rewards.
no code implementations • CVPR 2022 • Lei Jin, Chenyang Xu, Xiaojuan Wang, Yabo Xiao, Yandong Guo, Xuecheng Nie, Jian Zhao
The existing multi-person absolute 3D pose estimation methods are mainly based on two-stage paradigm, i. e., top-down or bottom-up, leading to redundant pipelines with high computation cost.
1 code implementation • 16 Dec 2021 • Shiming Chen, Ziming Hong, Wenjin Hou, Guo-Sen Xie, Yibing Song, Jian Zhao, Xinge You, Shuicheng Yan, Ling Shao
Analogously, VAT uses the similar feature augmentation encoder to refine the visual features, which are further applied in visual$\rightarrow$attribute decoder to learn visual-based attribute features.
no code implementations • 15 Dec 2021 • Xingjun Li, Yizhi Zhang, Justin Leung, Chengnian Sun, Jian Zhao
This paper presents EDAssistant, a JupyterLab extension that supports EDA with in-situ search of example notebooks and recommendation of useful APIs, powered by novel interactive visualization of search results.
no code implementations • 29 Sep 2021 • Qibin Li, Nianmin Yao, Jian Zhao, Yanan Zhang
Based on the traditional attention mechanism, multi-scale fusion self attention extracts phrase information at different scales by setting convolution kernels at different levels, and calculates the corresponding attention matrix at different scales, so that the model can better extract phrase level information.
no code implementations • 26 Aug 2021 • Anjul Tyagi, Jian Zhao, Pushkar Patel, Swasti Khurana, Klaus Mueller
Based on the framework, we also propose an interactive tool, \name{}, for assisting novice designers with creating high-quality infographics from an input in a markdown format by offering recommendations of different design components of infographics.
no code implementations • 23 Aug 2021 • Jian Zhao, Gang Wang, Jianan Li, Lei Jin, Nana Fan, Min Wang, Xiaojuan Wang, Ting Yong, Yafeng Deng, Yandong Guo, Shiming Ge, Guodong Guo
The 2nd Anti-UAV Workshop \& Challenge aims to encourage research in developing novel and accurate methods for multi-scale object tracking.
1 code implementation • 19 Jul 2021 • Qingzhong Wang, Pengfei Zhang, Haoyi Xiong, Jian Zhao
In this paper, we develop face. evoLVe -- a comprehensive library that collects and implements a wide range of popular deep learning-based methods for face recognition.
2 code implementations • 7 Jul 2021 • Xumeng Han, Xuehui Yu, Guorong Li, Jian Zhao, Gang Pan, Qixiang Ye, Jianbin Jiao, Zhenjun Han
While extensive research has focused on the framework design and loss function, this paper shows that sampling strategy plays an equally important role.
1 code implementation • 29 Jun 2021 • Takanori Fujiwara, Xinhai Wei, Jian Zhao, Kwan-Liu Ma
However, existing DR methods provide limited capability and flexibility for such comparative analysis as each method is designed only for a narrow analysis target, such as identifying factors that most differentiate groups.
no code implementations • 13 Jun 2021 • Hua Yan, Feng Han, Junyi An, Weikang Xiao, Jian Zhao, Furao Shen
The F1 score of SASICMBERT, whose pretrained model is BERT, is 65. 12%, which is 0. 75% higher than that of SASICMg.
no code implementations • 31 May 2021 • Xiaoguang Tu, Yingtian Zou, Jian Zhao, Wenjie Ai, Jian Dong, Yuan YAO, Zhikang Wang, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng
Video generation from a single face image is an interesting problem and usually tackled by utilizing Generative Adversarial Networks (GANs) to integrate information from the input face image and a sequence of sparse facial landmarks.
no code implementations • 12 May 2021 • Xiaoguang Tu, Jian Zhao, Qiankun Liu, Wenjie Ai, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng
First, MDFR is a well-designed encoder-decoder architecture which extracts feature representation from an input face image with arbitrary low-quality factors and restores it to a high-quality counterpart.
no code implementations • 7 May 2021 • Shaokui Jiang, Baile Xu, Jian Zhao, Furao Shen
With the development of the deep network and the release for a series of large scale datasets for single object tracking, siamese networks have been proposed and perform better than most of the traditional methods.
no code implementations • 6 Feb 2021 • Junyi An, Fengshan Liu, Jian Zhao, Furao Shen
Inspired by the elastic collision model in physics, we present a general structure which can be integrated into the existing CNNs to improve their performance.
1 code implementation • 21 Jan 2021 • Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li, Jian Zhao, Guodong Guo, Zhenjun Han
The releasing of such a large-scale dataset could be a useful initial step in research of tracking UAVs.
no code implementations • 23 Nov 2020 • Junyi An, Fengshan Liu, Jian Zhao, Furao Shen
We believe that the IC neuron can be a basic unit to build network structures.
no code implementations • 4 Nov 2020 • Yuqi Gong, Xuehui Yu, Yao Ding, Xiaoke Peng, Jian Zhao, Zhenjun Han
We propose a novel concept, fusion factor, to control information that deep layers deliver to shallow layers, for adapting FPN to tiny object detection.
1 code implementation • 5 Oct 2020 • Zejiang Shen, Jian Zhao, Melissa Dell, YaoLiang Yu, Weining Li
Document images often have intricate layout structures, with numerous content regions (e. g. texts, figures, tables) densely arranged on each page.
1 code implementation • 16 Sep 2020 • Xuehui Yu, Zhenjun Han, Yuqi Gong, Nan Jiang, Jian Zhao, Qixiang Ye, Jie Chen, Yuan Feng, Bin Zhang, Xiaodi Wang, Ying Xin, Jingwei Liu, Mingyuan Mao, Sheng Xu, Baochang Zhang, Shumin Han, Cheng Gao, Wei Tang, Lizuo Jin, Mingbo Hong, Yuchao Yang, Shuiwang Li, Huan Luo, Qijun Zhao, Humphrey Shi
The 1st Tiny Object Detection (TOD) Challenge aims to encourage research in developing novel and accurate methods for tiny object detection in images which have wide views, with a current focus on tiny person detection.
no code implementations • 1 Aug 2020 • Takanori Fujiwara, Jian Zhao, Francine Chen, Kwan-Liu Ma
A common network analysis task is comparison of two networks to identify unique characteristics in one network with respect to the other.
3 code implementations • 25 May 2020 • Takanori Fujiwara, Jian Zhao, Francine Chen, Yao-Liang Yu, Kwan-Liu Ma
This analysis task could be greatly assisted by contrastive learning, which is an emerging analysis approach to discover salient patterns in one dataset relative to another.
no code implementations • 7 Apr 2020 • Zhecan Wang, Jian Zhao, Cheng Lu, Han Huang, Fan Yang, Lianji Li, Yandong Guo
To better demonstrate the advantage of our methods, we further propose a new benchmark dataset with the most rich distribution of head-gaze combination reflecting real-world scenarios.
1 code implementation • 28 Feb 2020 • Hongyan Hao, Yan Wang, Siqiao Xue, Yudi Xia, Jian Zhao, Furao Shen
So we propose an exploratory architecture referred to Temporal Convolutional Attention-based Network (TCAN) which combines temporal convolutional network and attention mechanism.
no code implementations • 19 Nov 2019 • Junyi An, Fengshan Liu, Jian Zhao, Furao Shen
We term this structure the "Inter-layer Collision" (IC) structure.
no code implementations • 18 Nov 2019 • Yahui Liu, Furao Shen, Jian Zhao
PIGAT introduces the attention mechanism to consider the importance of each interacted user/item to both the user and the item, which captures user interests, item attractions and their influence on the recommendation context.
no code implementations • 13 Sep 2019 • Cheonbok Park, Inyoup Na, Yongjang Jo, Sungbok Shin, Jaehyo Yoo, Bum Chul Kwon, Jian Zhao, Hyungjong Noh, Yeonsoo Lee, Jaegul Choo
Attention networks, a deep neural network architecture inspired by humans' attention mechanism, have seen significant success in image captioning, machine translation, and many other applications.
1 code implementation • 29 May 2019 • Yang Yao, Xu Zhang, Baile Xu, Furao Shen, Jian Zhao
Recent studies have demonstrated that the convolutional networks heavily rely on the quality and quantity of generated features.
no code implementations • 26 May 2019 • Hanyang Kong, Jian Zhao, Xiaoguang Tu, Junliang Xing, ShengMei Shen, Jiashi Feng
Recent deep learning based face recognition methods have achieved great performance, but it still remains challenging to recognize very low-resolution query face like 28x28 pixels when CCTV camera is far from the captured subject.
no code implementations • 20 May 2019 • Xu Zhang, Yang Yao, Baile Xu, Lekun Mao, Furao Shen, Jian Zhao, QIngwei Lin
In this paper, it is the first time to discuss the difficulty without support of old classes in class incremental learning, which is called as softmax suppression problem.
no code implementations • 14 May 2019 • Xingqian Li, Chenwei Lou, Jian Zhao, HuaPeng Wei, Hongwei Zhao
The consequent urban autism problem has become more and more serious.
4 code implementations • 2 Apr 2019 • Yi Yang, Baile Xu, Furao Shen, Jian Zhao
Many deep models are proposed to automatically learn high-order feature interactions.
2 code implementations • 22 Mar 2019 • Xiaoguang Tu, Jian Zhao, Zi-Hang Jiang, Yao Luo, Mei Xie, Yang Zhao, Linxiao He, Zheng Ma, Jiashi Feng
3D face reconstruction from a single 2D image is a challenging problem with broad applications.
Ranked #7 on Face Alignment on AFLW2000-3D
no code implementations • 13 Feb 2019 • Jian Zhao, Jianshu Li, Xiaoguang Tu, Fang Zhao, Yuan Xin, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng
In this paper, we study the challenging unconstrained set-based face recognition problem where each subject face is instantiated by a set of media (images and videos) instead of a single image.
1 code implementation • 17 Jan 2019 • Xiaoguang Tu, Jian Zhao, Mei Xie, Guodong Du, Hengsheng Zhang, Jianshu Li, Zheng Ma, Jiashi Feng
Face anti-spoofing (a. k. a presentation attack detection) has drawn growing attention due to the high-security demand in face authentication systems.
Ranked #2 on Face Anti-Spoofing on MSU-MFSD
1 code implementation • 2 Sep 2018 • Jian Zhao, Yu Cheng, Yi Cheng, Yang Yang, Haochong Lan, Fang Zhao, Lin Xiong, Yan Xu, Jianshu Li, Sugiri Pranata, ShengMei Shen, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng
Benchmarking our model on one of the most popular unconstrained face recognition datasets IJB-C additionally verifies the promising generalizability of AIM in recognizing faces in the wild.
Ranked #1 on Age-Invariant Face Recognition on MORPH Album2
no code implementations • ECCV 2018 • Fang Zhao, Jian Zhao, Shuicheng Yan, Jiashi Feng
This paper proposes a novel Dynamic Conditional Convolutional Network (DCCN) to handle conditional few-shot learning, i. e, only a few training samples are available for each condition.
no code implementations • 16 Jul 2018 • Li Zhou, Jian Zhao, Jianshu Li, Li Yuan, Jiashi Feng
Detecting the relations among objects, such as "cat on sofa" and "person ride horse", is a crucial task in image understanding, and beneficial to bridging the semantic gap between images and natural language.
no code implementations • CVPR 2018 • Fang Zhao, Jianshu Li, Jian Zhao, Jiashi Feng
In this paper, we propose a novel weakly supervised model, Multi-scale Anchored Transformer Network (MATN), to accurately localize free-form textual phrases with only image-level supervision.
no code implementations • CVPR 2018 • Jian Zhao, Yu Cheng, Yan Xu, Lin Xiong, Jianshu Li, Fang Zhao, Karlekar Jayashree, Sugiri Pranata, ShengMei Shen, Junliang Xing, Shuicheng Yan, Jiashi Feng
To this end, we propose a Pose Invariant Model (PIM) for face recognition in the wild, with three distinct novelties.
2 code implementations • 10 Apr 2018 • Jian Zhao, Jianshu Li, Yu Cheng, Li Zhou, Terence Sim, Shuicheng Yan, Jiashi Feng
Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification and autonomous driving, etc.
Ranked #1 on Multi-Human Parsing on PASCAL-Part
no code implementations • NeurIPS 2017 • Jian Zhao, Lin Xiong, Panasonic Karlekar Jayashree, Jianshu Li, Fang Zhao, Zhecan Wang, Panasonic Sugiri Pranata, Panasonic Shengmei Shen, Shuicheng Yan, Jiashi Feng
In particular, we employ an off-the-shelf 3D face model as a simulator to generate profile face images with varying poses.
Ranked #1 on Face Verification on IJB-A
no code implementations • 16 Nov 2017 • Jianshu Li, Shengtao Xiao, Fang Zhao, Jian Zhao, Jianan Li, Jiashi Feng, Shuicheng Yan, Terence Sim
Specifically, iFAN achieves an overall F-score of 91. 15% on the Helen dataset for face parsing, a normalized mean error of 5. 81% on the MTFL dataset for facial landmark localization and an accuracy of 45. 73% on the BNU dataset for emotion recognition with a single model.
2 code implementations • 19 May 2017 • Jianshu Li, Jian Zhao, Yunchao Wei, Congyan Lang, Yidong Li, Terence Sim, Shuicheng Yan, Jiashi Feng
To address the multi-human parsing problem, we introduce a new multi-human parsing (MHP) dataset and a novel multi-human parsing model named MH-Parser.
Ranked #3 on Multi-Human Parsing on MHP v1.0
1 code implementation • 3 Apr 2017 • Lin Xiong, Jayashree Karlekar, Jian Zhao, Yi Cheng, Yan Xu, Jiashi Feng, Sugiri Pranata, ShengMei Shen
In this paper, we propose a unified learning framework named Transferred Deep Feature Fusion (TDFF) targeting at the new IARPA Janus Benchmark A (IJB-A) face recognition dataset released by NIST face challenge.
no code implementations • 27 Dec 2016 • Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan
The first one, named multi-scale spatial LSTM encoder, reads facial patches of various scales sequentially to output a latent representation, and occlusion-robustness is achieved owing to the fact that the influence of occlusion is only upon some of the patches.
no code implementations • 30 Oct 2016 • De Zhao, Teng Wang, Jian Zhao, Dianjie Li, Zhili Lin, Zeyan Chen, Qi Ouyang, Hong Qian, Yu V. Fu, Fangting Li
A living cell is an open, nonequilibrium biochemical system where ATP hydrolysis serves as the energy source for a wide range of intracellular processes, possibly including the assurance for decision-making.