no code implementations • 13 Mar 2025 • Zeyi Xu, Jinfan Liu, Kuangxu Chen, Ye Chen, Zhangli Hu, Bingbing Ni
Additionally, compared to ViT, our approach achieves a reduction in FLOPs of up to 60 times.
no code implementations • 22 Feb 2025 • Yuxuan Xiong, Yue Shi, Yishun Dou, Bingbing Ni
Moreover, a simulated annealing strategy is embedded into IDU to endow our model with the power of addressing local optima issues.
no code implementations • 29 Nov 2024 • Xianfeng Tan, Yuhan Li, Wenxiang Shang, Yubo Wu, Jian Wang, Xuanhong Chen, Yi Zhang, Ran Lin, Bingbing Ni
Standard clothing asset generation involves creating forward-facing flat-lay garment images displayed on a clear background by extracting clothing information from diverse real-world contexts, which presents significant challenges due to highly standardized sampling distributions and precise structural requirements in the generated images.
1 code implementation • 18 Nov 2024 • Shibin Mei, Hang Wang, Bingbing Ni
Furthermore, we propose a sensor consistency training framework that enables denoising models to learn the sensor-invariant features, thereby facilitating the generalization of the consistent model to unseen sensors.
1 code implementation • 12 Aug 2024 • Jiameng Li, Yue Shi, JieZhang Cao, Bingbing Ni, Wenjun Zhang, Kai Zhang, Luc van Gool
3D Gaussian Splatting (3DGS) has attracted great attention in novel view synthesis because of its superior rendering efficiency and high fidelity.
no code implementations • 16 Jul 2024 • Qiaoqiao Jin, Rui Shi, Yishun Dou, Bingbing Ni
Current Facial Action Unit (FAU) detection methods generally encounter difficulties due to the scarcity of labeled video training data and the limited number of training face IDs, which renders the trained feature extractor insufficient coverage for modeling the large diversity of inter-person facial structures and movements.
no code implementations • 29 Jun 2024 • Yangzhou Jiang, Yinxin Lin, Yaoming Wang, Teng Li, Bilian Ke, Bingbing Ni
Appearance-based supervised methods with full-face image input have made tremendous advances in recent gaze estimation tasks.
no code implementations • 28 May 2024 • Yuhan Li, Hao Zhou, Wenxiang Shang, Ran Lin, Xuanhong Chen, Bingbing Ni
While image-based virtual try-on has made significant strides, emerging approaches still fall short of delivering high-fidelity and robust fitting images across various scenarios, as their models suffer from issues of ill-fitted garment styles and quality degrading during the training process, not to mention the lack of support for various combinations of attire.
no code implementations • 22 Mar 2024 • Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin, Ying Chen, Rui Shi, Yucheng Zheng, Yupeng Zhu, Bingbing Ni
The core idea of DAL lies in employing a Diffusion-based Data Amplifier (DDA) to "amplify" limited images for the model training, thereby enabling accurate pixel-to-pixel supervision with merely a handful of annotations.
no code implementations • 14 Feb 2024 • Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, PengFei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni
The resulting FracNet+ demonstrates competitive performance in rib fracture detection, which lays a foundation for further research and development in AI-assisted rib fracture detection and diagnosis.
no code implementations • CVPR 2024 • Zhongyin Zhao, Ye Chen, Zhangli Hu, Xuanhong Chen, Bingbing Ni
Intelligent generation of vector graphics has very promising applications in the fields of advertising and logo design artistic painting animation production etc.
no code implementations • CVPR 2024 • Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Rui Shi, Yuhan Li, Bingbing Ni
Micro-mesh (u-mesh) is a new graphics primitive for compact representation of extreme geometry consisting of a low-polygon base mesh enriched by per micro-vertex displacement.
no code implementations • CVPR 2024 • Ye Chen, Bingbing Ni, Jinfan Liu, Xiaoyang Huang, Xuanhong Chen
We develop a novel vectorized image representation scheme accommodating both shape/geometry and texture in a decoupled way particularly tailored for reconstruction and editing tasks of artistic/design images such as Emojis and Cliparts.
no code implementations • CVPR 2024 • Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni, Yugang Chen, Junxiang Ke
We propose a novel compact and efficient neural BRDF offering highly versatile material representation, yet with very-light memory and neural computation consumption towards achieving real-time rendering.
no code implementations • 24 Aug 2023 • Shengchao Yuan, Yishun Dou, Rui Shi, Bingbing Ni, Zhong Zheng
Meshes are widely used in 3D computer vision and graphics, but their irregular topology poses challenges in applying them to existing neural network architectures.
no code implementations • 21 Aug 2023 • Yuhan Li, Yishun Dou, Yue Shi, Yu Lei, Xuanhong Chen, Yi Zhang, Peng Zhou, Bingbing Ni
While text-3D editing has made significant strides in leveraging score distillation sampling, emerging approaches still fall short in delivering separable, precise and consistent outcomes that are vital to content creation.
1 code implementation • CVPR 2023 • Hang Wang, Xuanhong Chen, Bingbing Ni, Yutian Liu, Jinfan Liu
While lightweight ViT framework has made tremendous progress in image super-resolution, its uni-dimensional self-attention modeling, as well as homogeneous aggregation scheme, limit its effective receptive field (ERF) to include more comprehensive interactions from both spatial and channel dimensions.
no code implementations • 10 Apr 2023 • Jinxian Liu, Ye Chen, Bingbing Ni, Jiyao Mao, Zhenbo Yu
Humans have a strong intuitive understanding of physical processes such as fluid falling by just a glimpse of such a scene picture, i. e., quickly derived from our immersive visual experiences in memory.
1 code implementation • 18 Mar 2023 • Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang
We develop a generalized 3D shape generation prior model, tailored for multiple 3D tasks including unconditional shape generation, point cloud completion, and cross-modality shape generation, etc.
1 code implementation • CVPR 2023 • Yi Zhang, Xiaoyang Huang, Bingbing Ni, Teng Li, Wenjun Zhang
We develop an effective point cloud rendering pipeline for novel view synthesis, which enables high fidelity local detail reconstruction, real-time rendering and user-friendly editing.
no code implementations • 21 Feb 2023 • Yue Shi, Yuxuan Xiong, Jingyi Chai, Bingbing Ni, Wenjun Zhang
To address these issues, we propose an unsupervised separated 3D garments and human reconstruction model (USR), which reconstructs the human body and authentic textured clothes in layers without 3D models.
1 code implementation • 30 Jan 2023 • Xiaoyang Huang, Yanjun Wang, Yang Liu, Bingbing Ni, Wenjun Zhang, Jinxian Liu, Teng Li
To this end, we propose to achieve personalized spatial audio by reconstructing 3D human ears with single-view images.
1 code implementation • ICCV 2023 • Xiaoyang Huang, Yi Zhang, Kai Chen, Teng Li, Wenjun Zhang, Bingbing Ni
In this work, a novel regularization term named Implicit Convexity Regularization (ICR) imposed on implicit primitive learning is proposed to tackle this problem.
no code implementations • CVPR 2023 • Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni
We develop a simple yet surprisingly effective implicit representing scheme called Multiplicative Fourier Level of Detail (MFLOD) motivated by the recent success of multiplicative filter network.
1 code implementation • CVPR 2023 • Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang
We develop a generalized 3D shape generation prior model, tailored for multiple 3D tasks including unconditional shape generation, point cloud completion, and cross-modality shape generation, etc.
no code implementations • CVPR 2023 • Shibin Mei, Chenglong Zhao, Shengchao Yuan, Bingbing Ni
In this paper, we identify pattern imbalance from several aspects, and further develop a new training scheme to avert pattern preference as well as spurious correlation.
1 code implementation • CVPR 2023 • Xiaohang Wang, Xuanhong Chen, Bingbing Ni, Hang Wang, Zhengyan Tong, Yutian Liu
The ability of scale-equivariance processing blocks plays a central role in arbitrary-scale image super-resolution tasks.
no code implementations • ICCV 2023 • Ye Chen, Bingbing Ni, Xuanhong Chen, Zhangli Hu
This work explores a novel image geometric abstraction paradigm based on assembly out of a pool of pre-defined simple parametric primitives (i. e., triangle, rectangle, circle and semicircle), facilitating controllable shape editing in images.
1 code implementation • CVPR 2023 • JieZhang Cao, Qin Wang, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool
We explicitly design an implicit attention network to learn the ensemble weights for the nearby local features.
1 code implementation • 7 Dec 2022 • Xiaohang Wang, Xuanhong Chen, Bingbing Ni, Zhengyan Tong, Hang Wang
Depth map super-resolution (DSR) has been a fundamental task for 3D computer vision.
no code implementations • 5 Dec 2022 • Yue Shi, Dingyi Rong, Bingbing Ni, Chang Chen, Wenjun Zhang
To address these issues, we propose Geometry-Aware Generalized Neural Radiance Field (GARF) with a geometry-aware dynamic sampling (GADS) strategy to perform real-time novel view rendering and unsupervised depth estimation on unseen scenes without per-scene optimization.
1 code implementation • 27 Oct 2022 • Xiaoyang Huang, Yi Zhang, Bingbing Ni, Teng Li, Kai Chen, Wenjun Zhang
In this work, we focus on boosting the image quality of point clouds rendering with a compact model design.
1 code implementation • 18 Oct 2022 • Liang Jin, Shixuan Gu, Donglai Wei, Jason Ken Adhinarta, Kaiming Kuang, Yongjie Jessica Zhang, Hanspeter Pfister, Bingbing Ni, Jiancheng Yang, Ming Li
Based on the RibSeg v2, we develop a pipeline including deep learning-based methods for rib labeling, and a skeletonization-based method for centerline extraction.
1 code implementation • 9 Oct 2022 • Yunhao Li, Zhenbo Yu, Yucheng Zhu, Bingbing Ni, Guangtao Zhai, Wei Shen
Stage I introduces a test time adaptation strategy, which improves the physical plausibility of synthesized human skeleton motions by optimizing skeleton joint locations.
1 code implementation • 30 Jun 2022 • Jiancheng Yang, Rui Shi, Udaranga Wickramasinghe, Qikui Zhu, Bingbing Ni, Pascal Fua
Besides, we develop a new Adrenal gLand ANalysis (ALAN) dataset with the proposed NeAR, where each case consists of a 3D shape of adrenal gland and its diagnosis label (normal vs. abnormal) assigned by experts.
1 code implementation • 11 Jun 2022 • Dingyi Rong, Jiancheng Yang, Bingbing Ni, Bilian Ke
Projection map (PM) from optical coherence tomography (OCT) B-scan is an important tool to diagnose retinal diseases, which typically requires retinal layer segmentation.
1 code implementation • 26 May 2022 • Minghao Xu, Yuanfan Guo, Xuanyu Zhu, Jiawen Li, Zhenbang Sun, Jian Tang, Yi Xu, Bingbing Ni
This framework aims to learn multiple semantic representations for each image, and these representations are structured to encode image semantics from fine-grained to coarse-grained.
1 code implementation • ICLR 2022 • Xiaoyang Huang, Jiancheng Yang, Yanjun Wang, Ziyu Chen, Linguo Li, Teng Li, Bingbing Ni, Wenjun Zhang
In this study, we present Representation-Agnostic Shape Fields (RASF), a generalizable and computation-efficient shape embedding module for 3D deep learning.
no code implementations • 16 Mar 2022 • Zefan Li, Bingbing Ni, Teng Li, Wenjun Zhang, Wen Gao
GCGD consists of two plug-in modules: 1) inspired by the idea of gradient prediction, we propose a \textbf{GC-W} module for weight gradient correction; 2) based on Neural ODE, we propose a \textbf{GC-ODE} module for hidden states gradient correction.
2 code implementations • CVPR 2022 • Yuanfan Guo, Minghao Xu, Jiawen Li, Bingbing Ni, Xuanyu Zhu, Zhenbang Sun, Yi Xu
In this framework, a set of hierarchical prototypes are constructed and also dynamically updated to represent the hierarchical semantic structures underlying the data in the latent space.
no code implementations • CVPR 2022 • Yaoming Wang, Yangzhou Jiang, Jin Li, Bingbing Ni, Wenrui Dai, Chenglin Li, Hongkai Xiong, Teng Li
Appearance-based Gaze Estimation leverages deep neural networks to regress the gaze direction from monocular images and achieve impressive performance.
no code implementations • CVPR 2022 • Jiancheng Yang, Udaranga Wickramasinghe, Bingbing Ni, Pascal Fua
Deep implicit shape models have become popular in the computer vision community at large but less so for biomedical applications.
1 code implementation • 8 Dec 2021 • Wendong Zhang, Yunbo Wang, Bingbing Ni, Xiaokang Yang
We train the prior learner and the image generator as a unified model without any post-processing.
1 code implementation • 5 Dec 2021 • Jie Qin, Peng Zheng, Yichao Yan, Rong Quan, Xiaogang Cheng, Bingbing Ni
Person search aims to jointly localize and identify a query person from natural, uncropped images, which has been actively studied over the past few years.
Ranked #3 on
Person Search
on CUHK-SYSU
no code implementations • NeurIPS 2021 • Minghao Xu, Meng Qu, Bingbing Ni, Jian Tang
We further propose an efficient and effective algorithm for inference based on mean-field variational inference, in which we first provide a warm initialization by independently predicting the objects and their relations according to the current model, followed by a few iterations of relational reasoning.
no code implementations • 29 Nov 2021 • Yichao Yan, Junjie Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang
In the meantime, we design an adaptive BN layer in the domain-invariant stream, to approximate the statistics of various unseen domains.
Domain Generalization
Generalizable Person Re-identification
+1
1 code implementation • 7 Nov 2021 • Shanyan Guan, Jingwei Xu, Michelle Z. He, Yunbo Wang, Bingbing Ni, Xiaokang Yang
We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos, where performance of existing SMPL-based models are significantly affected by the distribution shift represented by different camera parameters, bone lengths, backgrounds, and occlusions.
Ranked #1 on
3D Absolute Human Pose Estimation
on Surreal
3 code implementations • 27 Oct 2021 • Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, Bingbing Ni
We introduce MedMNIST v2, a large-scale MNIST-like dataset collection of standardized biomedical images, including 12 datasets for 2D and 6 datasets for 3D.
1 code implementation • 19 Oct 2021 • Peng Zhou, Lingxi Xie, Bingbing Ni, Qi Tian
The style-based GAN (StyleGAN) architecture achieved state-of-the-art results for generating high-quality images, but it lacks explicit and precise control over camera poses.
Ranked #1 on
3D-Aware Image Synthesis
on FFHQ 256 x 256
no code implementations • 29 Sep 2021 • Xuanhong Chen, Kairui Feng, Naiyuan Liu, Yifan Lu, Bingbing Ni, Ziang Liu, Maofeng Liu
Spatial precipitation downscaling is one of the most important meteorological problems.
1 code implementation • 17 Sep 2021 • Jiancheng Yang, Yi He, Kaiming Kuang, Zudi Lin, Hanspeter Pfister, Bingbing Ni
The proposed A3D consistently outperforms symmetric context fusion operators by considerable margins, and establishes a new \emph{state of the art} on DeepLesion.
1 code implementation • 17 Sep 2021 • Jiancheng Yang, Shixuan Gu, Donglai Wei, Hanspeter Pfister, Bingbing Ni
Manual rib inspections in computed tomography (CT) scans are clinically critical but labor-intensive, as 24 ribs are typically elongated and oblique in 3D volumes.
no code implementations • 9 Sep 2021 • Ruoxi Shi, Borui Yang, Yangzhou Jiang, Chenglong Zhao, Bingbing Ni
Base on the eigenvalues, we can model the energy distribution of adversarial perturbations.
1 code implementation • ICCV 2021 • Minghao Xu, Hang Wang, Bingbing Ni, Riheng Zhu, Zhenbang Sun, Changhu Wang
For tackling such practical problem, we propose a Dual-Learner-based Video Highlight Detection (DL-VHD) framework.
no code implementations • 5 Aug 2021 • Ji Yang, Xinxin Zuo, Sen Wang, Zhenbo Yu, Xingyu Li, Bingbing Ni, Minglun Gong, Li Cheng
A dataset of generic 3D objects with ground-truth annotated skeletons is collected.
3 code implementations • 19 Jun 2021 • Yichao Yan, Jinpeng Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang, Ling Shao
This paper inventively considers weakly supervised person search with only bounding box annotations.
1 code implementation • 14 Jun 2021 • Wendong Zhang, Junwei Zhu, Ying Tai, Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang
Based on the semantic priors, we further propose a context-aware image inpainting model, which adaptively integrates global semantics and local features in a unified image generator.
2 code implementations • 11 Jun 2021 • Renwang Chen, Xuanhong Chen, Bingbing Ni, Yanhao Ge
In contrast to previous approaches that either lack the ability to generalize to arbitrary identity or fail to preserve attributes like facial expression and gaze direction, our framework is capable of transferring the identity of an arbitrary source face into an arbitrary target face while preserving the attributes of the target face.
Ranked #2 on
Face Swapping
on FaceForensics++
no code implementations • CVPR 2021 • Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao
For a given unsupervised task, we design multilevel tasks and define different learning stages for the deep network.
1 code implementation • 8 Jun 2021 • Minghao Xu, Hang Wang, Bingbing Ni, Hongyu Guo, Jian Tang
This paper studies unsupervised/self-supervised whole-graph representation learning, which is critical in many tasks such as molecule properties prediction in drug and material discovery.
no code implementations • 4 Jun 2021 • Xuanhong Chen, Hang Wang, Bingbing Ni
Convolution and self-attention are acting as two fundamental building blocks in deep neural networks, where the former extracts local image features in a linear way while the latter non-locally encodes high-order contextual relationships.
Ranked #84 on
Instance Segmentation
on COCO minival
1 code implementation • CVPR 2021 • Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang
In this work, we propose a Cross-view Contrastive Learning framework for unsupervised 3D skeleton-based action Representation (CrosSCLR), by leveraging multi-view complementary supervision signal.
1 code implementation • 29 Apr 2021 • Yichao Yan, Jie Qin, Bingbing Ni, Jiaxin Chen, Li Liu, Fan Zhu, Wei-Shi Zheng, Xiaokang Yang, Ling Shao
Extensive experiments on the novel dataset as well as three existing datasets clearly demonstrate the effectiveness of the proposed framework for both group-based re-id tasks.
2 code implementations • 27 Apr 2021 • Minghao Xu, Hang Wang, Bingbing Ni
Multi-Source Domain Adaptation (MSDA) focuses on transferring the knowledge from multiple source domains to the target domain, which is a more practical and challenging problem compared to the conventional single-source domain adaptation.
1 code implementation • CVPR 2021 • Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos.
Ranked #49 on
3D Human Pose Estimation
on 3DPW
no code implementations • ICCV 2021 • Ye Chen, Jinxian Liu, Bingbing Ni, Hang Wang, Jiancheng Yang, Ning Liu, Teng Li, Qi Tian
Then the destroyed shape and the normal shape are sent into a point cloud network to get representations, which are employed to segment points that belong to distorted parts and further reconstruct them to restore the shape to normal.
no code implementations • ICCV 2021 • Zhenbo Yu, Bingbing Ni, Jingwei Xu, Junjie Wang, Chenglong Zhao, Wenjun Zhang
Furthermore, two temporal constraints are proposed to alleviate the scale and pose ambiguity respectively.
Monocular 3D Human Pose Estimation
Unsupervised 3D Human Pose Estimation
no code implementations • ICCV 2021 • Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang
Pixel-to-mesh has wide applications, especially in virtual or augmented reality, animation and game industry.
no code implementations • 1 Jan 2021 • Minghao Xu, Hang Wang, Bingbing Ni, Wenjun Zhang, Jian Tang
We propose to disentangle graph structure and node attributes into two distinct sets of representations, and such disentanglement can be done in either the input or the embedding space.
no code implementations • ICCV 2021 • Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang
The challenges of the latter task are two folds: (1) pose failure (i. e., pose mismatching -- different skeleton definitions in dataset and SMPL , and pose ambiguity -- endpoints have arbitrary joint angle configurations for the same 3D joint coordinates).
1 code implementation • 21 Dec 2020 • Xuanhong Chen, Ziang Liu, Ting Qiu, Bingbing Ni, Naiyuan Liu, XiWei Hu, Yuhan Li
Extensive experiments well demonstrate the effectiveness and feasibility of our framework in different image-translation tasks.
1 code implementation • 17 Dec 2020 • Xuanhong Chen, Kairui Feng, Naiyuan Liu, Bingbing Ni, Yifan Lu, Zhengyan Tong, Ziang Liu
To alleviate these obstacles, we present the first large-scale spatial precipitation downscaling dataset named RainNet, which contains more than $62, 400$ pairs of high-quality low/high-resolution precipitation maps for over $17$ years, ready to help the evolution of deep learning models in precipitation downscaling.
1 code implementation • 16 Dec 2020 • Zhengyan Tong, Xuanhong Chen, Bingbing Ni, Xiaohang Wang
Existing pencil sketch algorithms are based on texture rendering rather than the direct imitation of strokes, making them unable to show the drawing process but only a final result.
3 code implementations • ICCV 2021 • Peng Zhou, Lingxi Xie, Bingbing Ni, Cong Geng, Qi Tian
The conditional generative adversarial network (cGAN) is a powerful tool of generating high-quality images, but existing approaches mostly suffer unsatisfying performance or the risk of mode collapse.
Ranked #9 on
Conditional Image Generation
on ImageNet 128x128
1 code implementation • ECCV 2020 • Xuanhong Chen, Bingbing Ni, Naiyuan Liu, Ziang Liu, Yiliu Jiang, Loc Truong, Qi Tian
In contrast to great success of memory-consuming face editing methods at a low resolution, to manipulate high-resolution (HR) facial images, i. e., typically larger than 7682 pixels, with very limited memory is still challenging.
3 code implementations • 28 Oct 2020 • Jiancheng Yang, Rui Shi, Bingbing Ni
We present MedMNIST, a collection of 10 pre-processed medical open datasets.
1 code implementation • NeurIPS 2020 • Jiancheng Yang, Yangzhou Jiang, Xiaoyang Huang, Bingbing Ni, Chenglong Zhao
This paper addresses the challenging black-box adversarial attack problem, where only classification confidence of a victim model is available.
1 code implementation • 16 Oct 2020 • Xuanhong Chen, Xirui Yan, Naiyuan Liu, Ting Qiu, Bingbing Ni
Furthermore, the results are with distinctive artistic style and retain the anisotropic semantic information.
1 code implementation • 8 Oct 2020 • Jiancheng Yang, Jiajun Chen, Kaiming Kuang, Tiancheng Lin, Junjun He, Bingbing Ni
Furthermore, we experiment the proposed method on an in-house, retrospective dataset of real-world non-small cell lung cancer patients under anti-PD-1 immunotherapy.
Ranked #1 on
Text-To-Speech Synthesis
on 20000 utterances
(using extra training data)
no code implementations • 8 Oct 2020 • Jiancheng Yang, Mingze Gao, Kaiming Kuang, Bingbing Ni, Yunlang She, Dong Xie, Chang Chen
A three-level hierarchical classification system for pulmonary lesions is developed, which covers most diseases in cancer-related diagnosis.
no code implementations • ECCV 2020 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell
Generating diverse and natural human motion is one of the long-standing goals for creating intelligent characters in the animated world.
no code implementations • ECCV 2020 • Jinxian Liu, Minghui Yu, Bingbing Ni, Ye Chen
We develop a novel learning scheme named Self-Prediction for 3D instance and semantic segmentation of point clouds.
1 code implementation • ECCV 2020 • Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang
Transferring knowledges learned from multiple source domains to target domain is a more practical and challenging task than conventional single-source domain adaptation.
Domain Adaptation
Multi-Source Unsupervised Domain Adaptation
1 code implementation • ICML 2020 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Trevor Darrell
In video prediction tasks, one major challenge is to capture the multi-modal nature of future contents and dynamics.
no code implementations • 3 Jul 2020 • Shanyan Guan, Ying Tai, Bingbing Ni, Feida Zhu, Feiyue Huang, Xiaokang Yang
The latent code of the recent popular model StyleGAN has learned disentangled representations thanks to the multi-layer style-based generator.
1 code implementation • 25 Jun 2020 • Peng Zhou, Lingxi Xie, Xiaopeng Zhang, Bingbing Ni, Qi Tian
To learn the sampling policy, a Markov decision process is embedded into the search algorithm and a moving average is applied for better stability.
1 code implementation • 5 May 2020 • Jiancheng Yang, Yi He, Xiaoyang Huang, Jingwei Xu, Xiaodan Ye, Guangyu Tao, Bingbing Ni
This paper addresses a fundamental challenge in 3D medical image processing: how to deal with imaging thickness.
no code implementations • 12 Apr 2020 • Jiancheng Yang, Haoran Deng, Xiaoyang Huang, Bingbing Ni, Yi Xu
In this study, we propose a multiple instance learning (MIL) approach and empirically prove the benefit to learn the relations between multiple nodules.
1 code implementation • CVPR 2020 • Minghao Xu, Hang Wang, Bingbing Ni, Qi Tian, Wenjun Zhang
To mitigate these problems, we propose a Graph-induced Prototype Alignment (GPA) framework to seek for category-level domain alignment via elaborate prototype representations.
no code implementations • ICLR 2020 • Peng Zhou, Bingbing Ni, Lingxi Xie, Xiaopeng Zhang, Hang Wang, Cong Geng, Qi Tian
In the field of Generative Adversarial Networks (GANs), how to design a stable training strategy remains an open problem.
1 code implementation • 4 Dec 2019 • Minghao Xu, Jian Zhang, Bingbing Ni, Teng Li, Chengjie Wang, Qi Tian, Wenjun Zhang
In this paper, we present adversarial domain adaptation with domain mixup (DM-ADA), which guarantees domain-invariance in a more continuous latent space and guides the domain discriminator in judging samples' difference relative to source and target domains.
2 code implementations • 24 Nov 2019 • Jiancheng Yang, Xiaoyang Huang, Yi He, Jingwei Xu, Canqian Yang, Guozheng Xu, Bingbing Ni
Theoretically, ANY 2D CNN (ResNet, DenseNet, or DeepLab) is able to be converted into a 3D ACS CNN, with pretrained weight of a same parameter size.
no code implementations • 14 Nov 2019 • Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni
In a nutshell, our method maps photo into a feature model and renders the feature model back into image space.
no code implementations • 30 Oct 2019 • Chaoyue Song, Yugang Chen, Shulai Zhang, Bingbing Ni
In this work, we use facial landmarks to make the deformation for facial images more authentic.
no code implementations • 20 Oct 2019 • Jiancheng Yang, Rongyao Fang, Bingbing Ni, Yamin Li, Yi Xu, Linguo Li
The final diagnosis is obtained by combining the ambiguity prior sample and lesion representation, and the whole network named $DenseSharp^{+}$ is end-to-end trainable.
no code implementations • 25 Sep 2019 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Trevor Darrell
Learning diverse and natural behaviors is one of the longstanding goal for creating intelligent characters in the animated world.
no code implementations • 13 Sep 2019 • Xiaoyang Huang, Jiancheng Yang, Linguo Li, Haoran Deng, Bingbing Ni, Yi Xu
Emergence of artificial intelligence techniques in biomedical applications urges the researchers to pay more attention on the uncertainty quantification (UQ) in machine-assisted medical decision making.
no code implementations • 6 Aug 2019 • Yunxiang Zhang, Chenglong Zhao, Bingbing Ni, Jian Zhang, Haoran Deng
To address the limitations of existing magnitude-based pruning algorithms in cases where model weights or activations are of large and similar magnitude, we propose a novel perspective to discover parameter redundancy among channels and accelerate deep CNNs via channel pruning.
no code implementations • CVPR 2019 • Jiancheng Yang, Qiang Zhang, Bingbing Ni, Linguo Li, Jinxian Liu, Mengdie Zhou, Qi Tian
Thereby, we for the first time propose an end-to-end learnable and task-agnostic sampling operation, named Gumbel Subset Sampling (GSS), to select a representative subset of input points.
no code implementations • CVPR 2019 • Yichao Yan, Qiang Zhang, Bingbing Ni, Wendong Zhang, Minghao Xu, Xiaokang Yang
Person re-identification has achieved great progress with deep convolutional neural networks.
no code implementations • 28 Feb 2019 • Jiancheng Yang, Qiang Zhang, Rongyao Fang, Bingbing Ni, Jinxian Liu, Qi Tian
A set of novel 3D point cloud attack operations are proposed via pointwise gradient perturbation and adversarial point attachment / detachment.
no code implementations • 27 Feb 2019 • Zhenyu Duan, Martin Renqiang Min, Li Erran Li, Mingbo Cai, Yi Xu, Bingbing Ni
In spite of achieving revolutionary successes in machine learning, deep convolutional neural networks have been recently found to be vulnerable to adversarial attacks and difficult to generalize to novel test images with reasonably large geometric transformations.
1 code implementation • NeurIPS 2018 • Jingwei Xu, Bingbing Ni, Xiaokang Yang
Most adversarial learning based video prediction methods suffer from image blur, since the commonly used adversarial and regression loss pair work rather in a competitive way than collaboration, yielding compromised blur effect.
1 code implementation • ECCV 2018 • Xiankai Lu, Chao Ma, Bingbing Ni, Xiaokang Yang, Ian Reid, Ming-Hsuan Yang
Regression trackers directly learn a mapping from regularly dense samples of target objects to soft labels, which are usually generated by a Gaussian function, to estimate target positions.
no code implementations • ECCV 2018 • Jie Zhang, Yi Xu, Bingbing Ni, Zhenyu Duan
The main contributions of the proposed frame- work are highlighted in two facets: (1) We put forward a multiple-task learning framework with mutually interlinked sub-structures between lane segmentation and lane boundary detection to improve overall performance.
no code implementations • ECCV 2018 • Yang Shen, Bingbing Ni, Zefan Li, Ning Zhuang
Predicting future activities from an egocentric viewpoint is of particular interest in assisted living.
no code implementations • CVPR 2018 • Jingwei Xu, Bingbing Ni, Zefan Li, Shuo Cheng, Xiaokang Yang
Despite recent emergence of adversarial based methods for video prediction, existing algorithms often produce unsatisfied results in image regions with rich structural information (i. e., object boundary) and detailed motion (i. e., articulated body movement).
no code implementations • CVPR 2018 • Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang
First, to facilitate this novel research of fine-grained video caption, we collected a novel dataset called Fine-grained Sports Narrative dataset (FSN) that contains 2K sports videos with ground-truth narratives from YouTube. com.
1 code implementation • CVPR 2018 • Zan Shen, Yi Xu, Bingbing Ni, Minsi Wang, Jianguo Hu, Xiaokang Yang
Crowd counting or density estimation is a challenging task in computer vision due to large scale variations, perspective distortions and serious occlusions, etc.
Ranked #4 on
Crowd Counting
on WorldExpo’10
no code implementations • CVPR 2018 • Peng Zhou, Bingbing Ni, Cong Geng, Jianguo Hu, Yi Xu
Scale problem lies in the heart of object detection.
no code implementations • CVPR 2018 • Taiping Yao, Minsi Wang, Bingbing Ni, Huawei Wei, Xiaokang Yang
Most human activity analysis works (i. e., recognition orãprediction) only focus on a single granularity, i. e., eitherãmodelling global motion based on the coarse level movement such as human trajectories orãforecasting future detailed action based on body partsâ movement such as skeleton motion.
no code implementations • CVPR 2018 • Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, Jianguo Hu
On the other hand, in addition to the conventional discriminator of GAN (i. e., to distinguish between REAL/FAKE samples), we propose a novel guider sub-network which encourages the generated sample (i. e., with novel pose) towards better satisfying the ReID loss (i. e., cross-entropy ReID loss, triplet ReID loss).
no code implementations • 13 Sep 2017 • Lixue Zhuang, Yi Xu, Bingbing Ni, Hongteng Xu
In this work, we reveal an important fact that binarizing different layers has a widely-varied effect on the compression ratio of network and the loss of performance.
no code implementations • ICCV 2017 • Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao
Input binarization has shown to be an effective way for network acceleration.
no code implementations • 4 Jul 2017 • Yichao Yan, Jingwei Xu, Bingbing Ni, Xiaokang Yang
This work make the first attempt to generate articulated human motion sequence from a single image.
Ranked #2 on
Gesture-to-Gesture Translation
on NTU Hand Digit
no code implementations • CVPR 2017 • Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang
Extensive experiments on four realistic action datasets in terms of three tasks (i. e., partial action retrieval, recognition and prediction) clearly show the superiority of PRBC over the state-of-the-art methods, along with significantly reduced memory load and computational costs during the online test.
no code implementations • CVPR 2017 • Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, Xiaokang Yang
We introduce a Multiple Granularity Analysis framework for video segmentation in a coarse-to-fine manner.
no code implementations • CVPR 2017 • Minsi Wang, Bingbing Ni, Xiaokang Yang
However, most of the previous activity recognition methods do not offer a flexible and scalable scheme to handle the high order context modeling problem.
no code implementations • CVPR 2017 • Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang
Our ZSECOC equips the conventional ECOC with the additional capability of ZSAR, by addressing the domain shift problem.
Ranked #4 on
Zero-Shot Action Recognition
on Olympics
no code implementations • 10 Jun 2017 • Donghao Luo, Bingbing Ni, Yichao Yan, Xiaokang Yang
Towards this end, we propose a novel loopy recurrent neural network (Loopy RNN), which is capable of aggregating relationship information of two input images in a progressive/iterative manner and outputting the consolidated matching score in the final iteration.
no code implementations • 1 Jun 2017 • Wendong Zhang, Bingbing Ni, Yichao Yan, Jingwei Xu, Xiaokang Yang
Key to automatically generate natural scene images is to properly arrange among various spatial elements, especially in the depth direction.
no code implementations • 26 May 2017 • Yichao Yan, Bingbing Ni, Xiaokang Yang
Predicting human interaction is challenging as the on-going activity has to be inferred based on a partially observed video.
1 code implementation • 23 Jan 2017 • Yichao Yan, Bingbing Ni, Zhichao Song, Chao Ma, Yan Yan, Xiaokang Yang
We address the person re-identification problem by effectively exploiting a globally discriminative feature representation from a sequence of tracked human regions/patches.
no code implementations • CVPR 2016 • Jun Yuan, Bingbing Ni, Xiaokang Yang, Ashraf A. Kassim
We investigate the feature design and classification architectures in temporal action localization.
no code implementations • CVPR 2016 • Bingbing Ni, Xiaokang Yang, Shenghua Gao
Fine grained video action analysis often requires reliable detection and tracking of various interacting objects and human body parts, denoted as interactional object parsing.
no code implementations • CVPR 2016 • Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian
Firstly, a novel EM-like learning framework is proposed to train the pixel-level deep convolutional neural network (DCNN) by seamlessly integrating weakly supervised data (i. e., massive bounding box annotations) with a small set of strongly supervised data (i. e., fully annotated hand segmentation maps) to achieve state-of-the-art hand segmentation performance.
no code implementations • 2 Sep 2015 • Changzhi Luo, Bingbing Ni, Jun Yuan, Jian-Feng Wang, Shuicheng Yan, Meng Wang
This scheme leverages motion cues such as motion boundary and motion magnitude (in contrast, camera motion is usually considered as "noise" for most previous methods) to generate a more compact and discriminative set of object proposals, which are more closely related to the objects which are being manipulated.
no code implementations • CVPR 2015 • Yang Zhou, Bingbing Ni, Richang Hong, Meng Wang, Qi Tian
Secondly, these object regions are matched and tracked across frames to form a large spatio-temporal graph based on the appearance matching and the dense motion trajectories through them.
Fine-grained Action Recognition
Human-Object Interaction Detection
+2
no code implementations • CVPR 2015 • Bingbing Ni, Pierre Moulin, Xiaokang Yang, Shuicheng Yan
Inspired by the recent advance in sentence regularization for text classification, we introduce a Motion Part Regularization framework to mining discriminative semi-local groups of dense trajectories.
no code implementations • 6 Feb 2015 • Teng Li, Huan Chang, Meng Wang, Bingbing Ni, Richang Hong, Shuicheng Yan
Then, existing models, popular algorithms, evaluation protocols, as well as system performance are provided corresponding to different aspects of crowded scene analysis.
no code implementations • 22 Dec 2014 • Jun Yuan, Bingbing Ni, Ashraf A. Kassim
This is a generic regression framework that fits many applications.
no code implementations • 22 Jun 2014 • Yunchao Wei, Wei Xia, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan
Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks.
no code implementations • CVPR 2014 • Bingbing Ni, Vignesh R. Paramathayalan, Pierre Moulin
We propose to decompose the fine-grained human activity analysis problem into two sequential tasks with increasing granularity.
no code implementations • CVPR 2014 • Bingbing Ni, Teng Li, Pierre Moulin
Specifically, for the kernel representation calculated for each input feature instance, we multiply it element-wise with a latent binary vector named as instance selection variables, which targets at selecting good instances and attenuate the effect of ambiguous ones in the resulting new kernel representation.