Search Results for author: Feng Zhao

Found 66 papers, 22 papers with code

Disentangle Your Dense Object Detector

2 code implementations7 Jul 2021 Zehui Chen, Chenhongyi Yang, Qiaofei Li, Feng Zhao, Zheng-Jun Zha, Feng Wu

Extensive experiments on MS COCO benchmark show that our approach can lead to 2. 0 mAP, 2. 4 mAP and 2. 2 mAP absolute improvements on RetinaNet, FCOS, and ATSS baselines with negligible extra overhead.

Disentanglement Object +2

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

1 code implementation21 Nov 2023 Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin

In the realm of large multi-modal models (LMMs), efficient modality alignment is crucial yet often constrained by the scarcity of high-quality image-text data.

Descriptive visual instruction following +2

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

1 code implementation19 Mar 2024 Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao

Open-sourced Large Language Models (LLMs) have achieved great success in various NLP tasks, however, they are still far inferior to API-based models when acting as agents.

Hallucination

BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection

1 code implementation17 Nov 2022 Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao

Instead of directly training a depth prediction network, we unify the image and LiDAR features in the Bird-Eye-View (BEV) space and adaptively transfer knowledge across non-homogenous representations in a teacher-student paradigm.

3D Object Detection Depth Estimation +4

Are We on the Right Way for Evaluating Large Vision-Language Models?

1 code implementation29 Mar 2024 Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao

We evaluate 16 leading LVLMs on MMStar to assess their multi-modal capabilities, and on 7 benchmarks with the proposed metrics to investigate their data leakage and actual multi-modal gain.

World Knowledge

MMNet: Muscle motion-guided network for micro-expression recognition

1 code implementation14 Jan 2022 Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

By adding the position embeddings of the face generated by PC module at the end of the two branches, the PC module can help to add position information to facial muscle motion pattern features for the MER.

Micro Expression Recognition Micro-Expression Recognition +1

Towards Fine-grained Large Object Segmentation 1st Place Solution to 3D AI Challenge 2020 -- Instance Segmentation Track

1 code implementation10 Sep 2020 Zehui Chen, Qiaofei Li, Feng Zhao

This technical report introduces our solutions of Team 'FineGrainedSeg' for Instance Segmentation track in 3D AI Challenge 2020.

Instance Segmentation Semantic Segmentation

Bijective Mapping Network for Shadow Removal

2 code implementations CVPR 2022 Yurui Zhu, Jie Huang, Xueyang Fu, Feng Zhao, Qibin Sun, Zheng-Jun Zha

Shadow removal, which aims to restore the background in the shadow regions, is challenging due to the highly ill-posed nature.

Shadow Removal

Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

1 code implementation CVPR 2023 Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jieping Ye

With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field.

Land Cover Classification Semantic Segmentation

Domain-Unified Prompt Representations for Source-Free Domain Generalization

1 code implementation29 Sep 2022 Hongjing Niu, Hanting Li, Feng Zhao, Bin Li

The proposed scheme generates diverse prompts from a domain bank that contains many more diverse domains than existing DG datasets.

Source-free Domain Generalization

Empowering Low-Light Image Enhancer through Customized Learnable Priors

1 code implementation ICCV 2023 Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

In this work, we propose a paradigm for low-light image enhancement that explores the potential of customized learnable priors to improve the transparency of the deep unfolding paradigm.

Low-Light Image Enhancement

Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild

1 code implementation19 Aug 2022 Hanting Li, Hongjing Niu, Zhaoqing Zhu, Feng Zhao

One of the main reasons is that video sequences often contain frames with different expression intensities, especially for the facial expressions in the real-world scenarios, while the images in SFER frequently present uniform and high expression intensities.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety

1 code implementation22 Jan 2024 Zaibin Zhang, Yongting Zhang, Lijun Li, Hongzhi Gao, Lijun Wang, Huchuan Lu, Feng Zhao, Yu Qiao, Jing Shao

In this paper, we explore these concerns through the innovative lens of agent psychology, revealing that the dark psychological states of agents constitute a significant threat to safety.

Ingredient-Oriented Multi-Degradation Learning for Image Restoration

1 code implementation CVPR 2023 Jinghao Zhang, Jie Huang, Mingde Yao, Zizheng Yang, Hu Yu, Man Zhou, Feng Zhao

Learning to leverage the relationship among diverse image restoration tasks is quite beneficial for unraveling the intrinsic ingredients behind the degradation.

Image Restoration

Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification

1 code implementation1 Dec 2021 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

High-quality Image Dehazing with Diffusion Model

1 code implementation23 Aug 2023 Hu Yu, Jie Huang, Kaiwen Zheng, Feng Zhao

The latter stage exploits the strong generation ability of DDPM to compensate for the haze-induced huge information loss, by working in conjunction with the physical modelling.

Denoising Image Dehazing

Deep Fourier Up-Sampling

1 code implementation11 Oct 2022 Man Zhou, Hu Yu, Jie Huang, Feng Zhao, Jinwei Gu, Chen Change Loy, Deyu Meng, Chongyi Li

Existing convolutional neural networks widely adopt spatial down-/up-sampling for multi-scale modeling.

Image Dehazing Image Segmentation +4

Structural Learning for Template-free Protein Folding

no code implementations6 Nov 2013 Feng Zhao

The thesis is aimed to solve the template-free protein folding problem by tackling two important components: efficient sampling in vast conformation space, and design of knowledge-based potentials with high accuracy.

Protein Folding

MVT: Mask Vision Transformer for Facial Expression Recognition in the wild

no code implementations8 Jun 2021 Hanting Li, Mingzhe Sui, Feng Zhao, ZhengJun Zha, Feng Wu

Facial Expression Recognition (FER) in the wild is an extremely challenging task in computer vision due to variant backgrounds, low-quality facial images, and the subjectiveness of annotators.

Facial Expression Recognition Facial Expression Recognition (FER)

Performance-Guaranteed ODE Solvers with Complexity-Informed Neural Networks

no code implementations NeurIPS Workshop DLDE 2021 Feng Zhao, Xiang Chen, Jun Wang, Zuoqiang Shi, Shao-Lun Huang

Traditionally, we provide technical parameters for ODE solvers, such as the order, the stepsize and the local error threshold.

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

no code implementations3 Feb 2022 Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Wu, Feng Zhao

However, in some video-based scenarios such as video depth estimation and 3D scene reconstruction from a video, the unknown scale and shift residing in per-frame prediction may cause the depth inconsistency.

3D Scene Reconstruction Depth Completion +1

AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition

no code implementations24 May 2022 Mingzhe Sui, Hanting Li, Zhaoqing Zhu, Feng Zhao

2D+3D facial expression recognition (FER) can effectively cope with illumination changes and pose variations by simultaneously merging 2D texture and more robust 3D depth information.

3D Facial Expression Recognition Facial Expression Recognition

Unleashing Potential of Unsupervised Pre-Training With Intra-Identity Regularization for Person Re-Identification

no code implementations CVPR 2022 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

Mutual Information-Driven Pan-Sharpening

no code implementations CVPR 2022 Man Zhou, Keyu Yan, Jie Huang, Zihe Yang, Xueyang Fu, Feng Zhao

Despite the remarkable progress, existing state-of-the-art Pan-sharpening methods don't explicitly enforce the complementary information learning between two modalities of PAN and MS images.

Exposure Normalization and Compensation for Multiple-Exposure Correction

no code implementations CVPR 2022 Jie Huang, Yajing Liu, Xueyang Fu, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong

However, the procedures of correcting underexposure and overexposure to normal exposures are much different from each other, leading to large discrepancies for the network in correcting multiple exposures, thus resulting in poor performance.

Image Enhancement

Underdetermined 2D-DOD and 2D-DOA Estimation for Bistatic Coprime EMVS-MIMO Radar: From the Difference Coarray Perspective

no code implementations6 Jun 2022 Qianpeng Xie, Yihang Du, He Wang, Xiaoyi Pan, Feng Zhao

Firstly, a 5-D tensor model was constructed by using the multi-dimensional space-time characteristics of the received data.

8D Parameters Estimation for Bistatic EMVS-MIMO Radar via the nested PARAFAC

no code implementations4 Jun 2022 Qianpeng Xie, He Wang, Yihang Du, Xiaoyi Pan, Feng Zhao

Firstly, the outer part PARAFAC algorithm was carried out to estimate the receive spatial response matrix and its first way factor matrix.

NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition

no code implementations10 Jun 2022 Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

Dynamic facial expression recognition (DFER) in the wild is an extremely challenging task, due to a large number of noisy frames in the video sequences.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Source-Free Domain Adaptation for Real-world Image Dehazing

no code implementations14 Jul 2022 Hu Yu, Jie Huang, Yajing Liu, Qi Zhu, Man Zhou, Feng Zhao

Although certain Domain Adaptation (DA) dehazing methods have been presented, they inevitably require access to the source dataset to reduce the gap between the source synthetic and target real domains.

Image Dehazing Source-Free Domain Adaptation +1

CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal

no code implementations6 Sep 2022 Qianhao Yu, Naishan Zheng, Jie Huang, Feng Zhao

The key to shadow removal is recovering the contents of the shadow regions with the guidance of the non-shadow regions.

Long-range modeling Shadow Removal

KSG: Knowledge and Skill Graph

no code implementations13 Sep 2022 Feng Zhao, Ziqi Zhang, Donglin Wang

This is the first study that we are aware of that looks into dynamic KSG for skill retrieval and learning.

Attribute Knowledge Graphs +2

A similarity measurement for time series and its application to the stock market

no code implementations Expert Systems with Applications 2021 Feng Zhao, Yating Gao, Xinning Li, Zhiyong An, Shiyu Ge, Caiming Zhang

In this paper, for accurately describing the similarity between a pair of time series, a novel similarity measurement is proposed, which is named as the dynamic multi-perspective personalized similarity measurement (DMPSM).

Dynamic Time Warping Time Series +1

Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network

no code implementations15 Oct 2022 Keyu Yan, Man Zhou, Jie Huang, Feng Zhao, Chengjun Xie, Chongyi Li, Danfeng Hong

Panchromatic (PAN) and multi-spectral (MS) image fusion, named Pan-sharpening, refers to super-resolve the low-resolution (LR) multi-spectral (MS) images in the spatial domain to generate the expected high-resolution (HR) MS images, conditioning on the corresponding high-resolution PAN images.

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

no code implementations CVPR 2023 Shuo Wang, Xinhai Zhao, Hai-Ming Xu, Zehui Chen, Dameng Yu, Jiahao Chang, Zhen Yang, Feng Zhao

Based on the covariate shift assumption, we find that the gap mainly attributes to the feature distribution of BEV, which is determined by the quality of both depth estimation and 2D image's feature representation.

3D Object Detection Depth Estimation +3

Novel Quality Measure and Efficient Resolution of Convex Hull Pricing for Unit Commitment

no code implementations17 Apr 2023 Mikhail A. Bragin, Farhan Hyder, Bing Yan, Peter B. Luh, Jinye Zhao, Feng Zhao, Dane A. Schiro, Tongxin Zheng

Several CH pricing methods have been presented, and a feasible cost has been used as a quality measure for the CH price.

Learning Sample Relationship for Exposure Correction

no code implementations CVPR 2023 Jie Huang, Feng Zhao, Man Zhou, Jie Xiao, Naishan Zheng, Kaiwen Zheng, Zhiwei Xiong

Exposure correction task aims to correct the underexposure and its adverse overexposure images to the normal exposure in a single network.

Task 2

Cooperative IoT Data Sharing with Heterogeneity of Participants Based on Electricity Retail

no code implementations31 May 2023 Bohong Wang, Qinglai Guo, Tian Xia, Qiang Li, Di Liu, Feng Zhao

With the development of Internet of Things (IoT) and big data technology, the data value is increasingly explored in multiple practical scenarios, including electricity transactions.

Data Valuation Fairness

Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation

no code implementations3 Jul 2023 Deyi Ji, Feng Zhao, Hongtao Lu

For the sake of high inference speed and low computation complexity, $\mathcal{T}$ partitions the original UHR image into patches and groups them dynamically, then learns the low-level local details with the lightweight multi-head Wavelet Transformer (WFormer) network.

Coverage Enhancement Strategy in WMSNs Based on a Novel Swarm Intelligence Algorithm: Army Ant Search Optimizer

no code implementations3 Jul 2023 Yindi Yao, Qin Wen, Yanpeng Cui, Feng Zhao, Bozhan Zhao, Yaoping Zeng

As one of the most crucial scenarios of the Internet of Things (IoT), wireless multimedia sensor networks (WMSNs) pay more attention to the information-intensive data (e. g., audio, video, image) for remote environments.

Decomposition Ascribed Synergistic Learning for Unified Image Restoration

no code implementations1 Aug 2023 Jinghao Zhang, Feng Zhao

Learning to restore multiple image degradations within a single model is quite beneficial for real-world applications.

Deblurring Image Deblurring +5

Debias the Training of Diffusion Models

no code implementations12 Oct 2023 Hu Yu, Li Shen, Jie Huang, Man Zhou, Hongsheng Li, Feng Zhao

Diffusion models have demonstrated compelling generation quality by optimizing the variational lower bound through a simple denoising score matching loss.

Denoising

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph

no code implementations10 Nov 2023 Hongyin Zhang, Diyuan Shi, Zifeng Zhuang, Han Zhao, Zhenyu Wei, Feng Zhao, Sibo Gai, Shangke Lyu, Donglin Wang

Developing robotic intelligent systems that can adapt quickly to unseen wild situations is one of the critical challenges in pursuing autonomous robotics.

Implicit Relations

ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

no code implementations29 Dec 2023 Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

The ChangeNet dataset is suitable for both binary change detection (BCD) and semantic change detection (SCD) tasks.

Change Detection

Coverage Control Algorithm for DSNs Based on Improved Gravitational Search

no code implementations IEEE Sensors Journal 2022 Yindi Yao, Huanmin Liao, Xiong Li, Student Member, IEEE, Feng Zhao, Xuan Yang, and Shanshan Hu

—In directional sensor networks (DSNs), coverage control is an important way to ensure efficient communication and reliable data transmission.

Position

Stream Query Denoising for Vectorized HD Map Construction

no code implementations17 Jan 2024 Shuo Wang, Fan Jia, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang, Xiangyu Zhang, Feng Zhao

This paper introduces the Stream Query Denoising (SQD) strategy as a novel approach for temporal modeling in high-definition map (HD-map) construction.

Autonomous Driving Denoising

Prompt Learning on Temporal Interaction Graphs

no code implementations9 Feb 2024 Xi Chen, Siwei Zhang, Yun Xiong, Xixi Wu, Jiawei Zhang, Xiangguo Sun, Yao Zhang, Feng Zhao, Yulin kang

In detail, we propose a temporal prompt generator to offer temporally-aware prompts for different tasks.

Representation Learning

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

no code implementations27 Feb 2024 Zehui Chen, Qiuchen Wang, Zhenyu Li, Jiaming Liu, Shanghang Zhang, Feng Zhao

In this report, we present our solution to the multi-task robustness track of the 1st Visual Continual Learning (VCL) Challenge at ICCV 2023 Workshop.

3D Object Detection Continual Learning +5

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

no code implementations16 Mar 2024 Deyi Ji, Siqi Gao, Lanyun Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao

In this paper, we address the challenge of multi-object tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT.

Homography Estimation Multi-Object Tracking +1

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

no code implementations22 Mar 2024 Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao

Training high-accuracy 3D detectors necessitates massive labeled 3D annotations with 7 degree-of-freedom, which is laborious and time-consuming.

3D Object Detection object-detection +2

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

no code implementations28 Mar 2024 BoWen Zhang, Yiji Cheng, Jiaolong Yang, Chunyu Wang, Feng Zhao, Yansong Tang, Dong Chen, Baining Guo

To address the problem, we introduce GaussianCube, a structured GS representation that is both powerful and efficient for generative modeling.

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations1 Apr 2024 Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

Cannot find the paper you are looking for? You can Submit a new open access paper.