1 code implementation • ECCV 2020 • Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li
However, most of their training data is constructed by 3D Morphable Model, whose space spanned is only a small part of the shape space.
no code implementations • 10 Sep 2024 • Junyi Chen, Weicai Ye, Yifan Wang, Danpeng Chen, Di Huang, Wanli Ouyang, Guofeng Zhang, Yu Qiao, Tong He
To this end, we propose GigaGS, the first work for high-quality surface reconstruction for large-scale scenes using 3DGS.
no code implementations • 7 Sep 2024 • Yan Chen, Di Huang, Zhichao Liao, Xi Cheng, Xinghui Li, Lone Zeng
For the geometric branch, we adopt a non-parametric strategy to extract geometric features.
no code implementations • 2 Sep 2024 • Xiangyuan Xue, Zeyu Lu, Di Huang, Wanli Ouyang, Lei Bai
Much previous AI research has focused on developing monolithic models to maximize their intelligence and capability, with the primary goal of enhancing performance on specific tasks.
no code implementations • 22 Aug 2024 • Ziyu Tang, Weicai Ye, Yifan Wang, Di Huang, Hujun Bao, Tong He, Guofeng Zhang
Unlike previous methods that uniformly apply geometric priors on all samples, introducing significant bias in accuracy, our proposed normal deflection field dynamically learns and adapts the utilization of samples based on their specific characteristics, thereby improving both the accuracy and effectiveness of the model.
no code implementations • 19 Aug 2024 • Yifan Wang, Di Huang, Weicai Ye, Guofeng Zhang, Wanli Ouyang, Tong He
Signed Distance Function (SDF)-based volume rendering has demonstrated significant capabilities in surface reconstruction.
1 code implementation • 12 Aug 2024 • Zhichao Liao, Di Huang, Heming Fang, Yue Ma, Fengyuan Piao, Xinghui Li, Long Zeng, Pingfa Feng
To address this issue, we design a two-stage generative framework mimicking the human sketching behavior pattern, called MSFormer, which is the first time to produce humanoid freehand sketches tailored for mechanical components.
1 code implementation • 11 Aug 2024 • Yingjie Gao, Yanan Zhang, Ziyue Huang, Nanqing Liu, Di Huang
Specifically, we design a Test-Time Learning (TTL) module that employs a mean-teacher network for self-training to discover novel instances from test data, allowing detectors to learn better representations and classifiers for novel classes.
no code implementations • 19 Jul 2024 • Huiqun Wang, Yiping Bao, Panwang Pan, Zeming Li, Xiao Liu, Ruijie Yang, Di Huang
Recent advancements in multi-modal pre-training for 3D point clouds have demonstrated promising results by aligning heterogeneous features across 3D shapes and their corresponding 2D images and language descriptions.
1 code implementation • 17 Jul 2024 • Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong, Di Huang, Yunhong Wang
To address these issues, we propose a novel non-uniform quantizer, dubbed the Adaptive Logarithm AdaLog (AdaLog) quantizer.
1 code implementation • 16 Jul 2024 • Zhi Cai, Yingjie Gao, Yaoyan Zheng, Nan Zhou, Di Huang
In computer vision, object detection is an important task that finds its application in many scenarios.
no code implementations • 15 Jul 2024 • Yang Zhao, Di Huang, Chongxiao Li, Pengwei Jin, Ziyuan Nan, TianYun Ma, Lei Qi, Yansong Pan, Zhenxing Zhang, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
Instruction-tuned large language models (LLMs) have demonstrated remarkable performance in automatically generating code for general-purpose programming languages like Python.
no code implementations • 11 Jul 2024 • Zidong Wang, Zeyu Lu, Di Huang, Tong He, Xihui Liu, Wanli Ouyang, Lei Bai
In this paper, we introduce PredBench, a benchmark tailored for the holistic evaluation of spatio-temporal prediction networks.
1 code implementation • 8 Jul 2024 • Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen Pu, Dawei Yin, Xing Hu, Yunji Chen
Recent advancements in open-source code large language models (LLMs) have demonstrated remarkable coding abilities by fine-tuning on the data generated from powerful closed-source LLMs such as GPT-3. 5 and GPT-4 for instruction tuning.
no code implementations • 27 Jun 2024 • Yanan Zhang, Chao Zhou, Di Huang
Existing 3D object detection suffers from expensive annotation costs and poor transferability to unknown data due to the domain gap, Unsupervised Domain Adaptation (UDA) aims to generalize detection models trained in labeled source domains to perform robustly on unexplored target domains, providing a promising solution for cross-domain 3D object detection.
no code implementations • 17 Jun 2024 • YuAn Wang, Zhao Wang, Junhao Gong, Di Huang, Tong He, Wanli Ouyang, Jile Jiao, Xuetao Feng, Qi Dou, Shixiang Tang, Dan Xu
In this paper, we introduce a novel path to $\textit{general}$ human motion generation by focusing on 2D space.
1 code implementation • 14 Jun 2024 • YiWen Chen, Tong He, Di Huang, Weicai Ye, Sijin Chen, Jiaxiang Tang, Xin Chen, Zhongang Cai, Lei Yang, Gang Yu, Guosheng Lin, Chi Zhang
Recently, 3D assets created via reconstruction and generation have matched the quality of manually crafted assets, highlighting their potential for replacement.
no code implementations • 4 Jun 2024 • Xiefan Guo, Jinlin Liu, Miaomiao Cui, Di Huang
Text-to-video generation has lagged behind text-to-image synthesis in quality and diversity due to the complexity of spatio-temporal modeling and limited video-text datasets.
no code implementations • 24 May 2024 • Yuxuan Guo, Shaohui Peng, Jiaming Guo, Di Huang, Xishan Zhang, Rui Zhang, Yifan Hao, Ling Li, Zikang Tian, Mingju Gao, Yutai Li, Yiming Gan, Shuai Liang, Zihao Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen
In this work, we introduce autonomous embodied verification techniques for agents to fill the gap, laying the groundwork for creative tasks.
1 code implementation • 4 May 2024 • Yanan Zhang, Jinqing Zhang, Zengran Wang, Junhao Xu, Di Huang
In recent years, autonomous driving has garnered escalating attention for its potential to relieve drivers' burdens and improve driving safety.
1 code implementation • 25 Apr 2024 • Guohao Li, Hongyu Yang, Di Huang, Yunhong Wang
Generative 3D face models featuring disentangled controlling factors hold immense potential for diverse applications in computer vision and computer graphics.
no code implementations • 14 Apr 2024 • Xuening Yuan, Hongyu Yang, Yueming Zhao, Di Huang
Recent progress in text-to-3D creation has been propelled by integrating the potent prior of Diffusion Models from text-to-image generation into the 3D domain.
no code implementations • 8 Apr 2024 • Nan Zhou, Jiaxin Chen, Di Huang
It innovatively incorporates a cross-layer dynamic connection (CDC) for input prompt tokens from adjacent layers, enabling effective sharing of task-relevant information.
1 code implementation • CVPR 2024 • Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang
Recent strides in the development of diffusion models, exemplified by advancements such as Stable Diffusion, have underscored their remarkable prowess in generating visually compelling images.
1 code implementation • CVPR 2024 • Haoxiang Ma, Modi shi, Boyang Gao, Di Huang
We focus on the generalization ability of the 6-DoF grasp detection method in this paper.
no code implementations • 25 Mar 2024 • Chengxuan Li, Di Huang, Zeyu Lu, Yang Xiao, Qingqi Pei, Lei Bai
Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications.
no code implementations • 19 Mar 2024 • Xianglong He, Junyi Chen, Sida Peng, Di Huang, Yangguang Li, Xiaoshui Huang, Chun Yuan, Wanli Ouyang, Tong He
To better optimize the representation of these details, we propose a unique pruning and densifying method named the Candidate Pool Strategy, enhancing detail fidelity through selective optimization.
1 code implementation • 18 Mar 2024 • Haoxiang Ma, Ran Qin, Modi shi, Boyang Gao, Di Huang
This paper focuses on the sim-to-real issue of RGB-D grasp detection and formulates it as a domain adaptation problem.
no code implementations • 18 Mar 2024 • Sha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He, Yanyong Zhang
The ability to understand and reason the 3D real world is a crucial milestone towards artificial general intelligence.
no code implementations • 13 Mar 2024 • Hebeizi Li, Hongyu Yang, Di Huang
Facial Expression Recognition (FER) has consistently been a focal point in the field of facial analysis.
no code implementations • 6 Mar 2024 • Yajie Liu, Pu Ge, Qingjie Liu, Di Huang
Recently, learning open-vocabulary semantic segmentation from text supervision has achieved promising downstream performance.
no code implementations • 5 Mar 2024 • Yaoyan Zheng, Hongyu Yang, Di Huang
Recent advancements in video semantic segmentation have made substantial progress by exploiting temporal correlations.
1 code implementation • 22 Feb 2024 • Chenxi Huang, Yuenan Hou, Weicai Ye, Di Huang, Xiaoshui Huang, Binbin Lin, Deng Cai, Wanli Ouyang
We project the freely available 3D segmentation annotations onto the 2D plane and leverage the corresponding 2D semantic maps as the supervision signal, significantly enhancing the semantic awareness of multi-view detectors.
2 code implementations • 19 Feb 2024 • Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Wanli Ouyang, Lei Bai
Nature is infinitely resolution-free.
1 code implementation • 4 Feb 2024 • Haoyi Zhu, Yating Wang, Di Huang, Weicai Ye, Wanli Ouyang, Tong He
These outcomes suggest that the 3D point cloud is a valuable observation modality for intricate robotic tasks.
no code implementations • 23 Jan 2024 • Yunpu Zhao, Rui Zhang, Wenyi Li, Di Huang, Jiaming Guo, Shaohui Peng, Yifan Hao, Yuanbo Wen, Xing Hu, Zidong Du, Qi Guo, Ling Li, Yunji Chen
This paper aims to establish an efficient framework for assessing the level of creativity in LLMs.
1 code implementation • 7 Dec 2023 • Mingwu Zheng, Haiyu Zhang, Hongyu Yang, Liming Chen, Di Huang
Accurate representations of 3D faces are of paramount importance in various computer vision and graphics applications.
no code implementations • 1 Dec 2023 • Yajie Liu, Pu Ge, Haoxiang Ma, Shichao Fan, Qingjie Liu, Di Huang, Yunhong Wang
Referring image segmentation (RIS) aims to segment objects in an image conditioning on free-from text descriptions.
1 code implementation • 29 Oct 2023 • Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jacobs
We propose a metadata-aware self-supervised learning~(SSL)~framework useful for fine-grained classification and ecological mapping of bird species around the world.
1 code implementation • CVPR 2024 • Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang
In the context of autonomous driving, the significance of effective feature learning is widely acknowledged.
1 code implementation • 12 Oct 2023 • Haoyi Zhu, Honghui Yang, Xiaoyang Wu, Di Huang, Sha Zhang, Xianglong He, Hengshuang Zhao, Chunhua Shen, Yu Qiao, Tong He, Wanli Ouyang
In this paper, we introduce a novel universal 3D pre-training framework designed to facilitate the acquisition of efficient 3D representation, thereby establishing a pathway to 3D foundational models.
Ranked #2 on Semantic Segmentation on S3DIS (using extra training data)
no code implementations • 4 Sep 2023 • Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li
Large language models (LLMs) show their powerful automatic reasoning and planning capability with a wealth of semantic knowledge about the human world.
1 code implementation • ICCV 2023 • Nan Zhou, Jiaxin Chen, Di Huang
Furthermore, to alleviate the interference by semantic drift, we develop the semantic calibration (SC) module to align the global shape and class centers of the pretrained and downstream feature distributions.
no code implementations • ICCV 2023 • Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang
Anomaly detection (AD), aiming to find samples that deviate from the training distribution, is essential in safety-critical applications.
no code implementations • 19 Jun 2023 • Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang
Generating realistic human motion from given action descriptions has experienced significant advancements because of the emerging requirement of digital humans.
1 code implementation • 3 Jun 2023 • Pucheng Dang, Xing Hu, Kaidi Xu, Jinhao Duan, Di Huang, Husheng Han, Rui Zhang, Zidong Du, Qi Guo, Yunji Chen
Unlearning techniques are proposed to prevent third parties from exploiting unauthorized data, which generate unlearnable samples by adding imperceptible perturbations to data for public publishing.
1 code implementation • NeurIPS 2023 • Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen
We deploy ANPL on the Abstraction and Reasoning Corpus (ARC), a set of unique tasks that are challenging for state-of-the-art AI systems, showing it outperforms baseline programming systems that (a) without the ability to decompose tasks interactively and (b) without the guarantee that the modules can be correctly composed together.
Ranked #12 on Code Generation on HumanEval
no code implementations • 4 May 2023 • Tao Xu, Bo Wu, Ruilong Fan, Yun Zhou, Di Huang
Furthermore, our method outperforms existing lightweight methods in terms of accuracy and efficiency for the gaze estimation task.
1 code implementation • 15 Apr 2023 • Zhi Cai, Songtao Liu, Guodong Wang, Zheng Ge, Xiangyu Zhang, Di Huang
We propose a metric, recall of best-regressed samples, to quantitively evaluate the misalignment problem.
1 code implementation • CVPR 2023 • Bowei Du, Yecheng Huang, Jiaxin Chen, Di Huang
Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform.
1 code implementation • CVPR 2023 • Mingwu Zheng, Haiyu Zhang, Hongyu Yang, Di Huang
Realistic face rendering from multi-view images is beneficial to various computer vision and graphics applications.
no code implementations • CVPR 2023 • Chao Zhou, Yanan Zhang, Jiaxin Chen, Di Huang
A key challenge for LiDAR-based 3D object detection is to capture sufficient features from large scale 3D scenes especially for distant or/and occluded objects.
1 code implementation • ICCV 2023 • Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang
Inspired by recent advances in diffusion models, which are reminiscent of denoising autoencoders, we investigate whether they can acquire discriminative representations for classification via generative pre-training.
no code implementations • 28 Feb 2023 • Ran Qin, Haoxiang Ma, Boyang Gao, Di Huang
Planar grasp detection is one of the most fundamental tasks to robotic manipulation, and the recent progress of consumer-grade RGB-D sensors enables delivering more comprehensive features from both the texture and shape modalities.
no code implementations • 21 Feb 2023 • Pengwei Jin, Di Huang, Rui Zhang, Xing Hu, Ziyuan Nan, Zidong Du, Qi Guo, Yunji Chen
Symbolic regression, the task of extracting mathematical expressions from the observed data $\{ \vx_i, y_i \}$, plays a crucial role in scientific discovery.
no code implementations • 18 Jan 2023 • Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, Xiaowei Zhou
We propose a new method for object pose estimation without CAD models.
no code implementations • ICCV 2023 • Di Huang, Sida Peng, Tong He, Honghui Yang, Xiaowei Zhou, Wanli Ouyang
We propose a novel approach to self-supervised learning of point cloud representations by differentiable neural rendering.
1 code implementation • 10 Dec 2022 • Haoxiang Ma, Di Huang
Moreover, a Scale Balanced Learning (SBL) loss and an Object Balanced Sampling (OBS) strategy are designed, where SBL enlarges the gradients of the samples whose scales are in low frequency by apriori weights while OBS captures more points on small-scale objects with the help of an auxiliary segmentation network.
Ranked #2 on Robotic Grasping on GraspNet-1Billion
no code implementations • 7 Dec 2022 • Kaicheng Li, Hongyu Yang, Binghui Chen, Pengyu Li, Biao Wang, Di Huang
Along with the widespread use of face recognition systems, their vulnerability has become highlighted.
no code implementations • 30 Nov 2022 • Di Huang, Xiaopeng Ji, Xingyi He, Jiaming Sun, Tong He, Qing Shuai, Wanli Ouyang, Xiaowei Zhou
The key idea is that the hand motion naturally provides multiple views of the object and the motion can be reliably estimated by a hand pose tracker.
no code implementations • 10 Oct 2022 • Neal Patwari, Di Huang, Kiki Bonetta-Misteli
Studies have shown pulse oximeter measurements of blood oxygenation have statistical bias that is a function of race, which results in higher rates of occult hypoxemia, i. e., missed detection of dangerously low oxygenation, in patients of color.
no code implementations • 22 Aug 2022 • Zengran Wang, Chen Min, Zheng Ge, Yinhao Li, Zeming Li, Hongyu Yang, Di Huang
Instead of using a sole monocular depth method, in this work, we propose a novel Surround-view Temporal Stereo (STS) technique that leverages the geometry correspondence between frames across time to facilitate accurate depth learning.
no code implementations • 12 Aug 2022 • Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang
Contrastive learning has shown great potential in video representation learning.
1 code implementation • 20 Jul 2022 • Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang
Video Anomaly Detection (VAD) is an important topic in computer vision.
Ranked #6 on Anomaly Detection on ShanghaiTech
no code implementations • ICLR 2022 • Di Huang, Rui Zhang, Xing Hu, Xishan Zhang, Pengwei Jin, Nan Li, Zidong Du, Qi Guo, Yunji Chen
In this work, we propose a query-based framework that trains a query neural network to generate informative input-output examples automatically and interactively from a large query space.
no code implementations • 7 May 2022 • Bing Li, Jiaxin Chen, Dongming Zhang, Xiuguo Bao, Di Huang
To address the two issues above, this paper proposes a novel framework, namely Attentive Cross-modal Interaction Network with Motion Enhancement (MEACI-Net).
no code implementations • CVPR 2022 • Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang
Specifically, TRKP adopts the teacher-student framework, where the multi-head teacher network is built to extract knowledge from labeled source domains and guide the student network to learn detectors in unlabeled target domain.
no code implementations • CVPR 2022 • Jiaxi Wu, Jiaxin Chen, Di Huang
Active learning is a promising alternative to alleviate the issue of high annotation cost in the computer vision tasks by consciously selecting more informative samples to label.
no code implementations • 9 Apr 2022 • Xiangyu Zhu, Chang Yu, Di Huang, Zhen Lei, Hao Wang, Stan Z. Li
3D Morphable Model (3DMM) fitting has widely benefited face analysis due to its strong 3D priori.
no code implementations • CVPR 2022 • Yanan Zhang, Jiaxin Chen, Di Huang
In autonomous driving, LiDAR point-clouds and RGB images are two major data modalities with complementary cues for 3D object detection.
2 code implementations • CVPR 2022 • Mingwu Zheng, Hongyu Yang, Di Huang, Liming Chen
Precise representations of 3D faces are beneficial to various computer vision and graphics applications.
Ranked #2 on Face Alignment on FaceScape
1 code implementation • CVPR 2022 • Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang
The network is mainly composed of two components: a context-aware local retouching layer (LRL) and an adaptive blend pyramid layer (BPL).
1 code implementation • 21 Dec 2021 • Zichen Yang, Jie Qin, Di Huang
Weakly-supervised temporal action localization (WTAL) in untrimmed videos has emerged as a practical but challenging task since only video-level labels are available.
Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization
no code implementations • 20 Dec 2021 • Yecheng Huang, Jiaxin Chen, Di Huang
This paper proposes a novel approach to object detection on drone imagery, namely Multi-Proxy Detection Network with Unified Foreground Packing (UFPMP-Det).
no code implementations • 15 Dec 2021 • Xiangnan Yin, Di Huang, Zehua Fu, Yunhong Wang, Liming Chen
The proposed model consists of a 3D face reconstruction module, a face segmentation module, and an image generation module.
1 code implementation • 27 Oct 2021 • Haoxiang Ma, Hongyu Yang, Di Huang
The recent studies on semantic segmentation are starting to notice the significance of the boundary information, where most approaches see boundaries as the supplement of semantic details.
no code implementations • 26 Aug 2021 • Jingcheng Ni, Jie Qin, Di Huang
Action detection plays an important role in high-level video understanding and media interpretation.
no code implementations • 25 Aug 2021 • Huiqun Wang, Ruijie Yang, Di Huang, Yunhong Wang
Differentiable ARchiTecture Search (DARTS) uses a continuous relaxation of network representation and dramatically accelerates Neural Architecture Search (NAS) by almost thousands of times in GPU-day.
Ranked #9 on Neural Architecture Search on CIFAR-10
no code implementations • ICCV 2021 • Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang
This paper proposes a novel deep learning approach, namely Graph Convolutional Network with Point Refinement (PR-GCN), to simultaneously address the issues above in a unified way.
4 code implementations • ICCV 2021 • Xiefan Guo, Hongyu Yang, Di Huang
Deep generative approaches have recently made considerable progress in image inpainting by introducing structure priors.
1 code implementation • 8 Aug 2021 • Di Huang, Jacob Bartel, John Palowitch
The widespread adoption of online social networks in daily life has created a pressing need for effectively classifying user-generated content.
no code implementations • 3 Aug 2021 • Yao Wang, Yangtao Zheng, Boyang Gao, Di Huang
This paper proposes a new deep learning approach to antipodal grasp detection, named Double-Dot Network (DD-Net).
no code implementations • 14 Jun 2021 • Xiangnan Yin, Di Huang, Hongyu Yang, Zehua Fu, Yunhong Wang, Liming Chen
The existing auto-encoder based face pose editing methods primarily focus on modeling the identity preserving ability during pose synthesis, but are less able to preserve the image style properly, which refers to the color, brightness, saturation, etc.
no code implementations • 14 Jun 2021 • Xiangnan Yin, Di Huang, Zehua Fu, Yunhong Wang, Liming Chen
Missing textures in the incomplete UV map are further full-filled by the UV generator.
no code implementations • 5 May 2021 • Qingkai Zhen, Di Huang, Yunhong Wang, Hassen Drira, Boulbaba Ben Amor, Mohamed Daoudi
In this paper, an effective pipeline to automatic 4D Facial Expression Recognition (4D FER) is proposed.
10 code implementations • 7 Mar 2021 • Guodong Wang, Shumin Han, Errui Ding, Di Huang
Anomaly detection is a challenging task and usually formulated as an one-class learning problem for the unexpectedness of anomalies.
Ranked #28 on Anomaly Detection on VisA
no code implementations • 24 Dec 2020 • Ran Qin, Qingjie Liu, Guangshuai Gao, Di Huang, Yunhong Wang
Objects in aerial images usually have arbitrary orientations and are densely located over the ground, making them extremely challenge to be detected.
no code implementations • 18 Dec 2020 • Yanan Zhang, Di Huang, Yunhong Wang
LiDAR-based 3D object detection is an important task for autonomous driving and current approaches suffer from sparse and partial point clouds of distant and occluded objects.
Ranked #4 on 3D Object Detection on KITTI Cars Hard val
4 code implementations • ECCV 2020 • Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang
Few-shot object detection (FSOD) helps detectors adapt to unseen classes with few training instances, and is useful when manual annotation is time-consuming or data acquisition is limited.
Ranked #18 on Few-Shot Object Detection on MS-COCO (30-shot)
no code implementations • ECCV 2020 • Yandong Li, Di Huang, Danfeng Qin, Liqiang Wang, Boqing Gong
They fail to improve object detectors in their vanilla forms due to the domain gap between the Web images and curated datasets.
no code implementations • CVPR 2020 • Yangtao Zheng, Di Huang, Songtao Liu, Yunhong Wang
Thanks to this coarse-to-fine feature adaptation, domain knowledge in foreground regions can be effectively transferred.
no code implementations • 3 Feb 2020 • Di Huang, Xishan Zhang, Rui Zhang, Tian Zhi, Deyuan He, Jiaming Guo, Chang Liu, Qi Guo, Zidong Du, Shaoli Liu, Tianshi Chen, Yunji Chen
In this paper, we propose a novel Decomposable Winograd Method (DWM), which breaks through the limitation of original Winograd's minimal filtering algorithm to a wide and general convolutions.
2 code implementations • 10 Dec 2019 • Jinjin Zhang, Wei Wang, Di Huang, Qingjie Liu, Yunhong Wang
Deep learning based methods have achieved surprising progress in Scene Text Recognition (STR), one of classic problems in computer vision.
no code implementations • 26 Nov 2019 • Mingda Wu, Di Huang, Yuanfang Guo, Yunhong Wang
Recently, Human Attribute Recognition (HAR) has become a hot topic due to its scientific challenges and application potentials, where localizing attributes is a crucial stage but not well handled.
2 code implementations • ICML 2020 • Lu Jiang, Di Huang, Mason Liu, Weilong Yang
Due to the lack of suitable datasets, previous research has only examined deep learning on controlled synthetic label noise, and real-world label noise has never been studied in a controlled setting.
Ranked #12 on Image Classification on WebVision-1000
1 code implementation • 21 Nov 2019 • Songtao Liu, Di Huang, Yunhong Wang
Pyramidal feature representation is the common practice to address the challenge of scale variation in object detection.
Ranked #147 on Object Detection on COCO test-dev
no code implementations • 1 Nov 2019 • Xishan Zhang, Shaoli Liu, Rui Zhang, Chang Liu, Di Huang, Shiyi Zhou, Jiaming Guo, Yu Kang, Qi Guo, Zidong Du, Yunji Chen
Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers.
no code implementations • 25 Sep 2019 • Lu Jiang, Di Huang, Weilong Yang
Performing controlled experiments on noisy data is essential in thoroughly understanding deep learning across a spectrum of noise levels.
no code implementations • 6 Sep 2019 • Arquimedes Canedo, Palash Goyal, Di Huang, Amit Pandey, Gustavo Quiros
We show that machine learning can be leveraged to assist the automation engineer in classifying automation, finding similar code snippets, and reasoning about the hardware selection of sensors and actuators.
1 code implementation • 6 Sep 2019 • Palash Goyal, Di Huang, Sujit Rokka Chhetri, Arquimedes Canedo, Jaya Shree, Evan Patterson
In this work, we introduce the problem of graph representation ensemble learning and provide a first of its kind framework to aggregate multiple graph embedding methods efficiently.
1 code implementation • 19 Aug 2019 • Palash Goyal, Di Huang, Ankita Goswami, Sujit Rokka Chhetri, Arquimedes Canedo, Emilio Ferrara
We use the comparisons on our 100 benchmark graphs to define GFS-score, that can be applied to any embedding method to quantify its performance.
1 code implementation • 28 May 2019 • Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, Depei Qian
Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications.
no code implementations • CVPR 2019 • Songtao Liu, Di Huang, Yunhong Wang
Pedestrian detection in a crowd is a very challenging issue.
Ranked #18 on Object Detection on CrowdHuman (full body)
no code implementations • 10 Jan 2019 • Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain
The two underlying requirements of face age progression, i. e. aging accuracy and identity permanence, are not well studied in the literature.
1 code implementation • CVPR 2018 • Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain
The two underlying requirements of face age progression, i. e. aging accuracy and identity permanence, are not well studied in the literature.
no code implementations • 26 Nov 2017 • Qiang Chen, Yunhong Wang, Zheng Liu, Qingjie Liu, Di Huang
In this paper, we develop a novel convolutional neural network based approach to extract and aggregate useful information from gait silhouette sequence images instead of simply representing the gait process by averaging silhouette images.
7 code implementations • ECCV 2018 • Songtao Liu, Di Huang, Yunhong Wang
Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representations but suffering from high computational costs.
no code implementations • 18 Feb 2017 • Chunlei Li, Guangshuai Gao, Zhoufeng Liu, Di Huang, Sheng Liu, Miao Yu
In order to accurately detect defects in patterned fabric images, a novel detection algorithm based on Gabor-HOG (GHOG) and low-rank decomposition is proposed in this paper.
no code implementations • 4 Nov 2015 • Hongyu Yang, Di Huang, Yunhong Wang, Heng Wang, Yuanyan Tang
Face aging simulation has received rising investigations nowadays, whereas it still remains a challenge to generate convincing and natural age-progressed face images.