Search Results for author: Feng Zhao

Found 83 papers, 32 papers with code

Beyond Tree Models: A Hybrid Model of KAN and gMLP for Large-Scale Financial Tabular Data

no code implementations3 Dec 2024 Mingming Zhang, Jiahao Hu, Pengfei Shi, Ningtao Wang, Ruizhe Gao, Guandong Sun, Feng Zhao, Yulin kang, Xing Fu, Weiqiang Wang, Junbo Zhao

However, financial datasets in the industry often encounter some challenges, such as data heterogeneity, the predominance of numerical features and the large scale of the data, which can range from tens of millions to hundreds of millions of records.

Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes

no code implementations2 Dec 2024 Lihan Jiang, Kerui Ren, Mulin Yu, Linning Xu, Junting Dong, Tao Lu, Feng Zhao, Dahua Lin, Bo Dai

Seamless integration of both aerial and street view images remains a significant challenge in neural scene reconstruction and rendering.

Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding

no code implementations13 Nov 2024 Deyi Ji, Lanyun Zhu, Siqi Gao, Peng Xu, Hongtao Lu, Jieping Ye, Feng Zhao

The ubiquity and value of tables as semi-structured data across various domains necessitate advanced methods for understanding their complexity and vast amounts of information.

Natural Language Understanding

AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status

no code implementations26 Sep 2024 Jinghao Zhang, Wen Qian, Hao Luo, Fan Wang, Feng Zhao

Diffusion models have made compelling progress on facilitating high-throughput daily production.

Denoising Image Generation

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

1 code implementation29 Jul 2024 Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao

Inspired by the cognitive process when humans solve these problems, we introduce MindSearch to mimic the human minds in web information seeking and integration, which can be instantiated by a simple yet effective LLM-based multi-agent framework.

2D Semantic Segmentation task 1 (8 classes) graph construction +1

Prototype Clustered Diffusion Models for Versatile Inverse Problems

no code implementations13 Jul 2024 Jinghao Zhang, Zizheng Yang, Qi Zhu, Feng Zhao

To address this obstacle, we show that the measurement-based likelihood can be renovated with restoration-based likelihood via the opposite probabilistic graphic direction, licencing the patronage of various off-the-shelf restoration models and extending the strictly deterministic deterioration process to adaptable clustered processes with the supposed prototype, in what we call restorer guidance.

Deblurring Image Dehazing

PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation

no code implementations28 Jun 2024 Deyi Ji, Wenwei Jin, Hongtao Lu, Feng Zhao

The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV-captured images.

Decoder Image Segmentation +3

DiffLoss: unleashing diffusion model as constraint for training image restoration network

1 code implementation27 Jun 2024 Jiangtong Tan, Feng Zhao

To achieve this, we utilize the mode coverage capability of the diffusion model to approximate the distribution of natural images and explore its ability to capture image semantic attributes.

Image Generation Image Restoration

Discrete Latent Perspective Learning for Segmentation and Detection

no code implementations15 Jun 2024 Deyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye

In this paper, we address the challenge of Perspective-Invariant Learning in machine learning and computer vision, which involves enabling a network to understand images from varying perspectives to achieve consistent semantic interpretation.

Data Augmentation

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

no code implementations6 Jun 2024 Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang

To this end, we meticulously designed a differential video captioning strategy, which is stable, scalable, and efficient for generating captions for videos with arbitrary resolution, aspect ratios, and length.

Video Captioning Video Generation +2

From Macro to Micro: Boosting micro-expression recognition via pre-training on macro-expression videos

no code implementations26 May 2024 Hanting Li, Hongjing Niu, Feng Zhao

Micro-expression recognition (MER) has drawn increasing attention in recent years due to its potential applications in intelligent medical and lie detection.

Micro Expression Recognition Micro-Expression Recognition +1

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations1 Apr 2024 Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

Are We on the Right Way for Evaluating Large Vision-Language Models?

1 code implementation29 Mar 2024 Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao

We evaluate 16 leading LVLMs on MMStar to assess their multi-modal capabilities, and on 7 benchmarks with the proposed metrics to investigate their data leakage and actual multi-modal gain.

World Knowledge

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

no code implementations28 Mar 2024 BoWen Zhang, Yiji Cheng, Jiaolong Yang, Chunyu Wang, Feng Zhao, Yansong Tang, Dong Chen, Baining Guo

We introduce a radiance representation that is both structured and fully explicit and thus greatly facilitates 3D generative modeling.

Decoder Text to 3D

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

no code implementations22 Mar 2024 Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao

Training high-accuracy 3D detectors necessitates massive labeled 3D annotations with 7 degree-of-freedom, which is laborious and time-consuming.

3D Object Detection object-detection +2

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

1 code implementation19 Mar 2024 Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao

Open-sourced Large Language Models (LLMs) have achieved great success in various NLP tasks, however, they are still far inferior to API-based models when acting as agents.

Hallucination

View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

no code implementations16 Mar 2024 Deyi Ji, Siqi Gao, Lanyun Zhu, Qi Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao, Jieping Ye

In this paper, we address the challenge of multi-object tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT.

Homography Estimation Multi-Object Tracking +1

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

no code implementations27 Feb 2024 Zehui Chen, Qiuchen Wang, Zhenyu Li, Jiaming Liu, Shanghang Zhang, Feng Zhao

In this report, we present our solution to the multi-task robustness track of the 1st Visual Continual Learning (VCL) Challenge at ICCV 2023 Workshop.

3D Object Detection Continual Learning +5

Prompt Learning on Temporal Interaction Graphs

1 code implementation9 Feb 2024 Xi Chen, Siwei Zhang, Yun Xiong, Xixi Wu, Jiawei Zhang, Xiangguo Sun, Yao Zhang, Feng Zhao, Yulin kang

In detail, we propose a temporal prompt generator to offer temporally-aware prompts for different tasks.

Representation Learning

PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety

1 code implementation22 Jan 2024 Zaibin Zhang, Yongting Zhang, Lijun Li, Hongzhi Gao, Lijun Wang, Huchuan Lu, Feng Zhao, Yu Qiao, Jing Shao

In this paper, we explore these concerns through the innovative lens of agent psychology, revealing that the dark psychological states of agents constitute a significant threat to safety.

Stream Query Denoising for Vectorized HD Map Construction

no code implementations17 Jan 2024 Shuo Wang, Fan Jia, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang, Xiangyu Zhang, Feng Zhao

This paper introduces the Stream Query Denoising (SQD) strategy as a novel approach for temporal modeling in high-definition map (HD-map) construction.

Autonomous Driving Denoising

Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion

1 code implementation CVPR 2024 Naishan Zheng, Man Zhou, Jie Huang, JunMing Hou, Haoying Li, Yuan Xu, Feng Zhao

To bridge this gap we introduce a Synergistic High-order Interaction Paradigm (SHIP) designed to systematically investigate spatial fine-grained and global statistics collaborations between infrared and visible images across two fundamental dimensions: 1) Spatial dimension: we construct spatial fine-grained interactions through element-wise multiplication mathematically equivalent to global interactions and then foster high-order formats by iteratively aggregating and evolving complementary information enhancing both efficiency and flexibility.

Infrared And Visible Image Fusion

Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance

1 code implementation CVPR 2024 Wei Yu, Jie Huang, Bing Li, Kaiwen Zheng, Qi Zhu, Man Zhou, Feng Zhao

At the second stage the image-wise compensatory information is derived with the compensatory kernels and embedded into the rescaled input images.

Image Enhancement

ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

no code implementations29 Dec 2023 Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

The ChangeNet dataset is suitable for both binary change detection (BCD) and semantic change detection (SCD) tasks.

Change Detection

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

1 code implementation21 Nov 2023 Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin

In the realm of large multi-modal models (LMMs), efficient modality alignment is crucial yet often constrained by the scarcity of high-quality image-text data.

Descriptive visual instruction following +2

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph

no code implementations10 Nov 2023 Hongyin Zhang, Diyuan Shi, Zifeng Zhuang, Han Zhao, Zhenyu Wei, Feng Zhao, Sibo Gai, Shangke Lyu, Donglin Wang

Developing robotic intelligent systems that can adapt quickly to unseen wild situations is one of the critical challenges in pursuing autonomous robotics.

Implicit Relations

Unmasking Bias in Diffusion Model Training

1 code implementation12 Oct 2023 Hu Yu, Li Shen, Jie Huang, Hongsheng Li, Feng Zhao

In this paper, we identify that these obstacles can be largely attributed to bias and suboptimality inherent in the default training paradigm of diffusion models.

Denoising Image Generation

Empowering Low-Light Image Enhancer through Customized Learnable Priors

1 code implementation ICCV 2023 Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

In this work, we propose a paradigm for low-light image enhancement that explores the potential of customized learnable priors to improve the transparency of the deep unfolding paradigm.

Low-Light Image Enhancement

High-quality Image Dehazing with Diffusion Model

1 code implementation23 Aug 2023 Hu Yu, Jie Huang, Kaiwen Zheng, Feng Zhao

The latter stage exploits the strong generation ability of DDPM to compensate for the haze-induced huge information loss, by working in conjunction with the physical modelling.

Denoising Image Dehazing

Decomposition Ascribed Synergistic Learning for Unified Image Restoration

no code implementations1 Aug 2023 Jinghao Zhang, Feng Zhao

Learning to restore multiple image degradations within a single model is quite beneficial for real-world applications.

Deblurring Image Deblurring +5

Coverage Enhancement Strategy in WMSNs Based on a Novel Swarm Intelligence Algorithm: Army Ant Search Optimizer

no code implementations3 Jul 2023 Yindi Yao, Qin Wen, Yanpeng Cui, Feng Zhao, Bozhan Zhao, Yaoping Zeng

As one of the most crucial scenarios of the Internet of Things (IoT), wireless multimedia sensor networks (WMSNs) pay more attention to the information-intensive data (e. g., audio, video, image) for remote environments.

Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation

no code implementations3 Jul 2023 Deyi Ji, Feng Zhao, Hongtao Lu

For the sake of high inference speed and low computation complexity, $\mathcal{T}$ partitions the original UHR image into patches and groups them dynamically, then learns the low-level local details with the lightweight multi-head Wavelet Transformer (WFormer) network.

Cooperative IoT Data Sharing with Heterogeneity of Participants Based on Electricity Retail

no code implementations31 May 2023 Bohong Wang, Qinglai Guo, Tian Xia, Qiang Li, Di Liu, Feng Zhao

With the development of Internet of Things (IoT) and big data technology, the data value is increasingly explored in multiple practical scenarios, including electricity transactions.

Data Valuation Fairness

Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

1 code implementation CVPR 2023 Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jieping Ye

With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field.

Land Cover Classification Semantic Segmentation

Novel Quality Measure and Efficient Resolution of Convex Hull Pricing for Unit Commitment

no code implementations17 Apr 2023 Mikhail A. Bragin, Farhan Hyder, Bing Yan, Peter B. Luh, Jinye Zhao, Feng Zhao, Dane A. Schiro, Tongxin Zheng

Several CH pricing methods have been presented, and a feasible cost has been used as a quality measure for the CH price.

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

no code implementations CVPR 2023 Shuo Wang, Xinhai Zhao, Hai-Ming Xu, Zehui Chen, Dameng Yu, Jiahao Chang, Zhen Yang, Feng Zhao

Based on the covariate shift assumption, we find that the gap mainly attributes to the feature distribution of BEV, which is determined by the quality of both depth estimation and 2D image's feature representation.

3D Object Detection Depth Estimation +3

Learning Sample Relationship for Exposure Correction

no code implementations CVPR 2023 Jie Huang, Feng Zhao, Man Zhou, Jie Xiao, Naishan Zheng, Kaiwen Zheng, Zhiwei Xiong

Exposure correction task aims to correct the underexposure and its adverse overexposure images to the normal exposure in a single network.

Exposure Correction Task 2

BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection

1 code implementation17 Nov 2022 Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao

Instead of directly training a depth prediction network, we unify the image and LiDAR features in the Bird-Eye-View (BEV) space and adaptively transfer knowledge across non-homogenous representations in a teacher-student paradigm.

3D Object Detection Depth Estimation +4

Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network

no code implementations15 Oct 2022 Keyu Yan, Man Zhou, Jie Huang, Feng Zhao, Chengjun Xie, Chongyi Li, Danfeng Hong

Panchromatic (PAN) and multi-spectral (MS) image fusion, named Pan-sharpening, refers to super-resolve the low-resolution (LR) multi-spectral (MS) images in the spatial domain to generate the expected high-resolution (HR) MS images, conditioning on the corresponding high-resolution PAN images.

Deep Fourier Up-Sampling

1 code implementation11 Oct 2022 Man Zhou, Hu Yu, Jie Huang, Feng Zhao, Jinwei Gu, Chen Change Loy, Deyu Meng, Chongyi Li

Existing convolutional neural networks widely adopt spatial down-/up-sampling for multi-scale modeling.

Image Dehazing Image Segmentation +4

Domain-Unified Prompt Representations for Source-Free Domain Generalization

1 code implementation29 Sep 2022 Hongjing Niu, Hanting Li, Feng Zhao, Bin Li

The proposed scheme generates diverse prompts from a domain bank that contains many more diverse domains than existing DG datasets.

Diversity Source-free Domain Generalization

KSG: Knowledge and Skill Graph

no code implementations13 Sep 2022 Feng Zhao, Ziqi Zhang, Donglin Wang

This is the first study that we are aware of that looks into dynamic KSG for skill retrieval and learning.

Attribute Deep Reinforcement Learning +3

CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal

1 code implementation6 Sep 2022 Qianhao Yu, Naishan Zheng, Jie Huang, Feng Zhao

The key to shadow removal is recovering the contents of the shadow regions with the guidance of the non-shadow regions.

Long-range modeling Shadow Removal

Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild

1 code implementation19 Aug 2022 Hanting Li, Hongjing Niu, Zhaoqing Zhu, Feng Zhao

One of the main reasons is that video sequences often contain frames with different expression intensities, especially for the facial expressions in the real-world scenarios, while the images in SFER frequently present uniform and high expression intensities.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Source-Free Domain Adaptation for Real-world Image Dehazing

no code implementations14 Jul 2022 Hu Yu, Jie Huang, Yajing Liu, Qi Zhu, Man Zhou, Feng Zhao

Although certain Domain Adaptation (DA) dehazing methods have been presented, they inevitably require access to the source dataset to reduce the gap between the source synthetic and target real domains.

Image Dehazing Source-Free Domain Adaptation +1

NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition

no code implementations10 Jun 2022 Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

Dynamic facial expression recognition (DFER) in the wild is an extremely challenging task, due to a large number of noisy frames in the video sequences.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Underdetermined 2D-DOD and 2D-DOA Estimation for Bistatic Coprime EMVS-MIMO Radar: From the Difference Coarray Perspective

no code implementations6 Jun 2022 Qianpeng Xie, Yihang Du, He Wang, Xiaoyi Pan, Feng Zhao

Firstly, a 5-D tensor model was constructed by using the multi-dimensional space-time characteristics of the received data.

8D Parameters Estimation for Bistatic EMVS-MIMO Radar via the nested PARAFAC

no code implementations4 Jun 2022 Qianpeng Xie, He Wang, Yihang Du, Xiaoyi Pan, Feng Zhao

Firstly, the outer part PARAFAC algorithm was carried out to estimate the receive spatial response matrix and its first way factor matrix.

AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition

no code implementations24 May 2022 Mingzhe Sui, Hanting Li, Zhaoqing Zhu, Feng Zhao

2D+3D facial expression recognition (FER) can effectively cope with illumination changes and pose variations by simultaneously merging 2D texture and more robust 3D depth information.

3D Facial Expression Recognition Facial Expression Recognition

Coverage Control Algorithm for DSNs Based on Improved Gravitational Search

no code implementations IEEE Sensors Journal 2022 Yindi Yao, Huanmin Liao, Xiong Li, Student Member, IEEE, Feng Zhao, Xuan Yang, and Shanshan Hu

—In directional sensor networks (DSNs), coverage control is an important way to ensure efficient communication and reliable data transmission.

Position

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

no code implementations3 Feb 2022 Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Wu, Feng Zhao

However, in some video-based scenarios such as video depth estimation and 3D scene reconstruction from a video, the unknown scale and shift residing in per-frame prediction may cause the depth inconsistency.

3D Scene Reconstruction Depth Completion +1

MMNet: Muscle motion-guided network for micro-expression recognition

1 code implementation14 Jan 2022 Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

By adding the position embeddings of the face generated by PC module at the end of the two branches, the PC module can help to add position information to facial muscle motion pattern features for the MER.

Micro Expression Recognition Micro-Expression Recognition +1

Exposure Normalization and Compensation for Multiple-Exposure Correction

no code implementations CVPR 2022 Jie Huang, Yajing Liu, Xueyang Fu, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong

However, the procedures of correcting underexposure and overexposure to normal exposures are much different from each other, leading to large discrepancies for the network in correcting multiple exposures, thus resulting in poor performance.

Exposure Correction Image Enhancement

Bijective Mapping Network for Shadow Removal

2 code implementations CVPR 2022 Yurui Zhu, Jie Huang, Xueyang Fu, Feng Zhao, Qibin Sun, Zheng-Jun Zha

Shadow removal, which aims to restore the background in the shadow regions, is challenging due to the highly ill-posed nature.

Shadow Removal

Mutual Information-Driven Pan-Sharpening

no code implementations CVPR 2022 Man Zhou, Keyu Yan, Jie Huang, Zihe Yang, Xueyang Fu, Feng Zhao

Despite the remarkable progress, existing state-of-the-art Pan-sharpening methods don't explicitly enforce the complementary information learning between two modalities of PAN and MS images.

Unleashing Potential of Unsupervised Pre-Training With Intra-Identity Regularization for Person Re-Identification

no code implementations CVPR 2022 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification

1 code implementation1 Dec 2021 Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

During the pre-training, we attempt to address two critical issues for learning fine-grained ReID features: (1) the augmentations in CL pipeline may distort the discriminative clues in person images.

Contrastive Learning Person Re-Identification +2

A similarity measurement for time series and its application to the stock market

no code implementations Expert Systems with Applications 2021 Feng Zhao, Yating Gao, Xinning Li, Zhiyong An, Shiyu Ge, Caiming Zhang

In this paper, for accurately describing the similarity between a pair of time series, a novel similarity measurement is proposed, which is named as the dynamic multi-perspective personalized similarity measurement (DMPSM).

Dynamic Time Warping Time Series +1

Performance-Guaranteed ODE Solvers with Complexity-Informed Neural Networks

no code implementations NeurIPS Workshop DLDE 2021 Feng Zhao, Xiang Chen, Jun Wang, Zuoqiang Shi, Shao-Lun Huang

Traditionally, we provide technical parameters for ODE solvers, such as the order, the stepsize and the local error threshold.

Disentangle Your Dense Object Detector

2 code implementations7 Jul 2021 Zehui Chen, Chenhongyi Yang, Qiaofei Li, Feng Zhao, Zheng-Jun Zha, Feng Wu

Extensive experiments on MS COCO benchmark show that our approach can lead to 2. 0 mAP, 2. 4 mAP and 2. 2 mAP absolute improvements on RetinaNet, FCOS, and ATSS baselines with negligible extra overhead.

Disentanglement Object +2

MVT: Mask Vision Transformer for Facial Expression Recognition in the wild

no code implementations8 Jun 2021 Hanting Li, Mingzhe Sui, Feng Zhao, ZhengJun Zha, Feng Wu

Facial Expression Recognition (FER) in the wild is an extremely challenging task in computer vision due to variant backgrounds, low-quality facial images, and the subjectiveness of annotators.

Facial Expression Recognition Facial Expression Recognition (FER)

Towards Fine-grained Large Object Segmentation 1st Place Solution to 3D AI Challenge 2020 -- Instance Segmentation Track

1 code implementation10 Sep 2020 Zehui Chen, Qiaofei Li, Feng Zhao

This technical report introduces our solutions of Team 'FineGrainedSeg' for Instance Segmentation track in 3D AI Challenge 2020.

Instance Segmentation Semantic Segmentation

Structural Learning for Template-free Protein Folding

no code implementations6 Nov 2013 Feng Zhao

The thesis is aimed to solve the template-free protein folding problem by tackling two important components: efficient sampling in vast conformation space, and design of knowledge-based potentials with high accuracy.

Protein Folding

Cannot find the paper you are looking for? You can Submit a new open access paper.