no code implementations • 23 Jan 2025 • Wailing Tang, Biqi Yang, Pheng-Ann Heng, Yun-hui Liu, Chi-Wing Fu
Few-shot Semantic Segmentation (FSS) is a challenging task that utilizes limited support images to segment associated unseen objects in query images.
1 code implementation • 23 Jan 2025 • Ziyu Guo, Renrui Zhang, Chengzhuo Tong, Zhizheng Zhao, Peng Gao, Hongsheng Li, Pheng-Ann Heng
We hope our study provides unique insights and paves a new path for integrating CoT reasoning with autoregressive image generation.
no code implementations • 20 Jan 2025 • Yiyi Zhang, Xingyu Chen, Kexin Chen, Yuyang Du, Xilin Dang, Pheng-Ann Heng
By incorporating an innovative balanced seed in the data generation process, our framework systematically considers both legitimate and illegitimate requests.
no code implementations • 16 Jan 2025 • Shi Qiu, Binzhu Xie, Qixuan Liu, Pheng-Ann Heng
3D Gaussian Splatting (3DGS) has recently emerged as an innovative and efficient 3D representation technique.
1 code implementation • 18 Dec 2024 • Jiacheng Liu, Peng Tang, Wenfeng Wang, Yuhang Ren, Xiaofeng Hou, Pheng-Ann Heng, Minyi Guo, Chao Li
This survey provides both a structured overview of existing solutions and identifies key challenges and promising research directions in MoE inference optimization.
no code implementations • 12 Dec 2024 • Yuqi Tong, Yue Qiu, Ruiyang Li, Shi Qiu, Pheng-Ann Heng
We present MS2Mesh-XR, a novel multi-modal sketch-to-mesh generation pipeline that enables users to create realistic 3D objects in extended reality (XR) environments using hand-drawn sketches assisted by voice inputs.
no code implementations • 9 Dec 2024 • Shi Qiu, Binzhu Xie, Qixuan Liu, Pheng-Ann Heng
3D Gaussian Splatting (3DGS) has attracted significant attention for its potential to revolutionize 3D representation, rendering, and interaction.
no code implementations • 22 Nov 2024 • Yi Wang, Jiaze Wang, Ziyu Guo, Renrui Zhang, Donghao Zhou, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng
Recently Transformer-based models have advanced point cloud understanding by leveraging self-attention mechanisms, however, these methods often overlook latent information in less prominent regions, leading to increased sensitivity to perturbations and limited global comprehension.
no code implementations • 3 Nov 2024 • Peng Tang, Jiacheng Liu, Xiaofeng Hou, YiFei PU, Jing Wang, Pheng-Ann Heng, Chao Li, Minyi Guo
We present HOBBIT, a mixed precision expert offloading system to enable flexible and efficient MoE inference.
no code implementations • 19 Oct 2024 • Hanqun Cao, Mutian He, Ning Ma, Chang-Yu Hsieh, Chunbin Gu, Pheng-Ann Heng
DNA-encoded library (DEL) screening has revolutionized the detection of protein-ligand interactions through read counts, enabling rapid exploration of vast chemical spaces.
no code implementations • 17 Oct 2024 • Donghao Zhou, Jiancheng Huang, Jinbin Bai, Jiaze Wang, Hao Chen, Guangyong Chen, Xiaowei Hu, Pheng-Ann Heng
Recent text-to-image models generate high-quality images from text prompts but lack precise control over specific components within visual concepts.
1 code implementation • 14 Oct 2024 • Runsong Zhu, Shi Qiu, Qianyi Wu, Ka-Hei Hui, Pheng-Ann Heng, Chi-Wing Fu
Panoptic lifting is an effective technique to address the 3D panoptic segmentation task by unprojecting 2D panoptic segmentations from multi-views to 3D scene.
no code implementations • 26 Sep 2024 • Yusong Wang, Chaoran Cheng, Shaoning Li, Yuxuan Ren, Bin Shao, Ge Liu, Pheng-Ann Heng, Nanning Zheng
Geometric graph neural networks (GNNs) have emerged as powerful tools for modeling molecular geometry.
no code implementations • 23 Sep 2024 • Rui Cao, Chuanxin Song, Biqi Yang, Jiangliu Wang, Pheng-Ann Heng, Yun-hui Liu
Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments.
1 code implementation • 3 Sep 2024 • Jiaqi Xu, Mengyang Wu, Xiaowei Hu, Chi-Wing Fu, Qi Dou, Pheng-Ann Heng
For clearness enhancement, we use real-world data, utilizing a dual-step strategy with pseudo-labels assessed by vision-language models and weather prompt learning.
1 code implementation • 3 Sep 2024 • Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi-Wing Fu, Pheng-Ann Heng
Shadows are formed when light encounters obstacles, leading to areas of diminished illumination.
1 code implementation • 29 Aug 2024 • Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Chengzhuo Tong, Peng Gao, Chunyuan Li, Pheng-Ann Heng
We introduce SAM2Point, a preliminary exploration adapting Segment Anything Model 2 (SAM 2) for zero-shot and promptable 3D segmentation.
1 code implementation • 20 Aug 2024 • Diandian Guo, Weixin Si, Zhixi Li, Jialun Pei, Pheng-Ann Heng
Pringle maneuver (PM) in laparoscopic liver resection aims to reduce blood loss and provide a clear surgical view by intermittently blocking blood inflow of the liver, whereas prolonged PM may cause ischemic injury.
1 code implementation • 18 Aug 2024 • Haoxin Yang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Jing Qin, Yi Wang, Pheng-Ann Heng, Shengfeng He
This paper introduces G\textsuperscript{2}Face, which leverages both generative and geometric priors to enhance identity manipulation, achieving high-quality reversible face anonymization without compromising data utility.
1 code implementation • 16 Aug 2024 • Kaixiang Yang, Wenqi Shan, Xudong Li, Xuan Wang, Xikai Yang, Xi Wang, Pheng-Ann Heng, Qiang Li, Zhiwei Wang
Multi-modal brain tumor segmentation typically involves four magnetic resonance imaging (MRI) modalities, while incomplete modalities significantly degrade performance.
no code implementations • 8 Jul 2024 • Shaoning Li, Mingyu Li, Yusong Wang, Xinheng He, Nanning Zheng, Jian Zhang, Pheng-Ann Heng
Investigating conformational landscapes of proteins is a crucial way to understand their biological functions and properties.
1 code implementation • 7 Jul 2024 • Juzheng Miao, Cheng Chen, Keli Zhang, Jie Chuai, Quanzheng Li, Pheng-Ann Heng
To harness the power of foundation models for application in SSL, we propose a cross prompting consistency method with segment anything model (CPC-SAM) for semi-supervised medical image segmentation.
1 code implementation • 7 Jul 2024 • Juzheng Miao, Cheng Chen, Keli Zhang, Jie Chuai, Quanzheng Li, Pheng-Ann Heng
By using solely a single template image, our method demonstrates significant superiority over strong state-of-the-art one-shot landmark detection methods.
no code implementations • 30 Jun 2024 • Haiqiao Wang, Hong Wu, Zhuoyuan Wang, Peiyan Yue, Dong Ni, Pheng-Ann Heng, Yi Wang
In consequence, this survey provides a \textcolor{blue}{narrative } analysis of this field, outlining the evolution of image processing methods in the context of TRUS image analysis and meanwhile highlighting their relevant contributions.
1 code implementation • 28 Jun 2024 • Wei Li, Jingyang Zhang, Pheng-Ann Heng, Lixu Gu
Generalist segmentation models are increasingly favored for diverse tasks involving various objects from different image sources.
1 code implementation • 26 Jun 2024 • Dunyuan Xu, Xi Wang, Jingyang Zhang, Pheng-Ann Heng
To achieve this, we create the orientational gradient alignment to ensure memorizability on previous sites, and arbitrary gradient alignment to enhance generalizability on unseen sites.
1 code implementation • 25 Jun 2024 • Jialun Pei, Ruize Cui, Yaoqian Li, Weixin Si, Jing Qin, Pheng-Ann Heng
Liver anatomical landmarks, e. g., ridge and ligament, serve as important markers for 2D-3D alignment, which can significantly enhance the spatial perception of surgeons for precise surgery.
1 code implementation • 20 Jun 2024 • Long Lei, Jun Zhou, Jialun Pei, Baoliang Zhao, Yueming Jin, Yuen-Chun Jeremy Teoh, Jing Qin, Pheng-Ann Heng
A comprehensive guidance view for cardiac interventional surgery can be provided by the real-time fusion of the intraoperative 2D images and preoperative 3D volume based on the ultrasound frame-to-volume registration.
no code implementations • 29 May 2024 • Jiaze Wang, Hao Chen, Hongcan Xu, Jinpeng Li, Bowen Wang, Kun Shao, Furui Liu, Huaxi Chen, Guangyong Chen, Pheng-Ann Heng
Weather forecasting plays a critical role in various sectors, driving decision-making and risk management.
no code implementations • 28 May 2024 • Jiaze Wang, Yi Wang, Ziyu Guo, Renrui Zhang, Donghao Zhou, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng
We introduce MM-Mixing, a multi-modal mixing alignment framework for 3D understanding.
no code implementations • 14 Apr 2024 • Diandian Guo, Manxi Lin, Jialun Pei, He Tang, Yueming Jin, Pheng-Ann Heng
A comprehensive understanding of surgical scenes allows for monitoring of the surgical process, reducing the occurrence of accidents and enhancing efficiency for medical professionals.
1 code implementation • 8 Mar 2024 • Shoujin Huang, GuanXiong Luo, Xi Wang, Ziran Chen, Yuwan Wang, Huaishui Yang, Pheng-Ann Heng, Lingyan Zhang, Mengye Lyu
In general, diffusion model-based MRI reconstruction methods incrementally remove artificially added noise while imposing data consistency to reconstruct the underlying images.
1 code implementation • 26 Feb 2024 • Yuyang Du, Kexin Chen, Yue Zhan, Chang Han Low, Tao You, Mobarakol Islam, Ziyu Guo, Yueming Jin, Guangyong Chen, Pheng-Ann Heng
We further design an adaptive weight assignment approach that balances the generalization ability of the LLM and the domain expertise of the old CL model.
1 code implementation • 22 Feb 2024 • Jialun Pei, Diandian Guo, Jingyang Zhang, Manxi Lin, Yueming Jin, Pheng-Ann Heng
In this study, we introduce a novel single-stage bi-modal transformer framework for SGG in the OR, termed S^2Former-OR, aimed to complementally leverage multi-view 2D scenes and 3D point clouds for SGG in an end-to-end manner.
no code implementations • 21 Feb 2024 • Xikai Yang, Jian Wu, Xi Wang, Yuchen Yuan, Ning Li Wang, Pheng-Ann Heng
Extensive experiments on the Sequential fundus Images for Glaucoma Forecast (SIGF) dataset demonstrate the superiority of the proposed MST-former method, achieving an AUC of 98. 6% for glaucoma forecasting.
1 code implementation • 4 Feb 2024 • Lanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Yang Yu, Junqiao Zhao, Pheng-Ann Heng
As demonstrations, we propose a supervised and a self-supervised implementation of $I(Z; M)$, and empirically show that the corresponding optimization algorithms exhibit remarkable generalization across a broad spectrum of RL benchmarks, context shift scenarios, data qualities and deep learning architectures.
1 code implementation • 2 Feb 2024 • Yinqiao Wang, Hao Xu, Pheng-Ann Heng, Chi-Wing Fu
Estimating 3D hand mesh from RGB images is a longstanding track, in which occlusion is one of the most challenging problems.
no code implementations • 22 Jan 2024 • Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
Sign language recognition (SLR) plays a vital role in facilitating communication for the hearing-impaired community.
no code implementations • 22 Jan 2024 • Yu Zhu, Kang Li, Lequan Yu, Pheng-Ann Heng
Recent studies have made remarkable progress in histopathology classification.
no code implementations • 17 Jan 2024 • Dunyuan Xu, Xi Wang, Jinyue Cai, Pheng-Ann Heng
Brain tumor represents one of the most fatal cancers around the world, and is very common in children and the elderly.
no code implementations • 8 Nov 2023 • Biqi Yang, Weiliang Tang, Xiaojie Gao, Xianzhi Li, Yun-hui Liu, Chi-Wing Fu, Pheng-Ann Heng
In large-scale storehouses, precise instance masks are crucial for robotic bin picking but are challenging to obtain.
1 code implementation • 16 Sep 2023 • Cheng Chen, Juzheng Miao, Dufan Wu, Zhiling Yan, Sekeun Kim, Jiang Hu, Aoxiao Zhong, Zhengliang Liu, Lichao Sun, Xiang Li, Tianming Liu, Pheng-Ann Heng, Quanzheng Li
The Segment Anything Model (SAM), a foundation model for general image segmentation, has demonstrated impressive zero-shot performance across numerous natural image segmentation tasks.
5 code implementations • 1 Sep 2023 • Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng
We introduce Point-Bind, a 3D multi-modality model aligning point clouds with 2D image, language, audio, and video.
Ranked #5 on
3D Question Answering (3D-QA)
on 3D MM-Vet
1 code implementation • 23 Aug 2023 • Donghao Zhou, Jialin Li, Jinpeng Li, Jiancheng Huang, Qiang Nie, Yong liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen
Unfortunately, the resultant noisy bounding boxes could cause corrupt supervision signals and thus diminish detection performance.
1 code implementation • 26 Jul 2023 • Jialun Pei, Zhangjun Zhou, Yueming Jin, He Tang, Pheng-Ann Heng
First, a dual-size input feeds into the shared backbone to produce more holistic and detailed features while keeping the model lightweight.
1 code implementation • 16 Jul 2023 • Jialun Pei, Tao Jiang, He Tang, Nian Liu, Yueming Jin, Deng-Ping Fan, Pheng-Ann Heng
We propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet.
1 code implementation • 23 Jun 2023 • Shizhan Gong, Yuan Zhong, Wenao Ma, Jinpeng Li, Zhao Wang, Jingyang Zhang, Pheng-Ann Heng, Qi Dou
Notably, the original SAM architecture is designed for 2D natural images, therefore would not be able to extract the 3D spatial information from volumetric medical data effectively.
1 code implementation • 23 Jun 2023 • Zhizhong Chai, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen
To tackle this challenge, the literature on object detection has witnessed an increase of weakly-supervised and semi-supervised approaches, yet still lacks a unified framework that leverages various forms of fully-labeled, weakly-labeled, and unlabeled data.
no code implementations • 30 May 2023 • Yanwen Li, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen
To guide the segmentation branch to learn from richer high-resolution features, we propose a feature affinity module and a scale affinity module to enhance the multi-task learning of the dual branches.
1 code implementation • 21 Mar 2023 • Yang Yu, Danruo Deng, Furui Liu, Yueming Jin, Qi Dou, Guangyong Chen, Pheng-Ann Heng
Open-set semi-supervised learning (Open-set SSL) considers a more practical scenario, where unlabeled data and test data contain new categories (outliers) not observed in labeled data (inliers).
1 code implementation • CVPR 2023 • Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng
Video dehazing aims to recover haze-free frames with high visibility and contrast.
no code implementations • 13 Mar 2023 • Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang, Xiangyun Liao, Bian Wu, Guangyong Chen, Pheng-Ann Heng
In particular, we evaluate our method on two representative MIM frameworks, MAE and iBOT.
no code implementations • ICCV 2023 • Hao Chen, Jiaze Wang, Kun Shao, Furui Liu, Jianye Hao, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
Specifically, our Traj-MAE employs diverse masking strategies to pre-train the trajectory encoder and map encoder, allowing for the capture of social and temporal information among agents while leveraging the effect of environment from multiple granularities.
no code implementations • 12 Mar 2023 • Yi Wang, Jiaze Wang, Jinpeng Li, Zixu Zhao, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng
With Point-MAE as our baseline, our model surpasses previous methods by a significant margin, achieving 86. 3% accuracy on ScanObjectNN and 94. 1% accuracy on ModelNet40.
1 code implementation • 6 Mar 2023 • Bowen Wang, Chen Liang, Jiaze Wang, Furui Liu, Shaogang Hao, Dong Li, Jianye Hao, Guangyong Chen, Xiaolong Zou, Pheng-Ann Heng
Reversely, the model Reconstructs a more robust equilibrium state prediction by transforming edge-level predictions to node-level with a sphere-fitting algorithm.
Graph Neural Network
Initial Structure to Relaxed Energy (IS2RE), Direct
+1
1 code implementation • 3 Mar 2023 • Danruo Deng, Guangyong Chen, Yang Yu, Furui Liu, Pheng-Ann Heng
To address this problem, we propose a novel method, Fisher Information-based Evidential Deep Learning ($\mathcal{I}$-EDL).
no code implementations • 27 Feb 2023 • Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzhi Li, Pheng-Ann Heng
In this paper, we explore how the 2D modality can benefit 3D masked autoencoding, and propose Joint-MAE, a 2D-3D joint MAE framework for self-supervised 3D point cloud pre-training.
1 code implementation • CVPR 2023 • Deng-Bao Wang, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Min-Ling Zhang
It has been recently found that models trained with mixup also perform well on uncertainty calibration.
1 code implementation • CVPR 2023 • Zhipeng Zhou, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Wei Gong
It's widely acknowledged that deep learning models with flatter minima in its loss landscape tend to generalize better.
1 code implementation • ICCV 2023 • Juzheng Miao, Cheng Chen, Furui Liu, Hao Wei, Pheng-Ann Heng
Specifically, we first point out the importance of algorithmic independence between two networks or branches in SSL, which is often overlooked in the literature.
2 code implementations • CVPR 2023 • Donghao Zhou, Chunbin Gu, Junde Xu, Furui Liu, Qiong Wang, Guangyong Chen, Pheng-Ann Heng
In biological research, fluorescence staining is a key technique to reveal the locations and morphology of subcellular structures.
1 code implementation • 23 Nov 2022 • Zhenghao Xing, Tianyu Wang, Xiaowei Hu, Haoran Wu, Chi-Wing Fu, Pheng-Ann Heng
Instance shadow detection, crucial for applications such as photo editing and light direction estimation, has undergone significant advancements in predicting shadow instances, object instances, and their associations.
1 code implementation • 10 Nov 2022 • Liansheng Wang, Jiacheng Wang, Lei Zhu, Huazhu Fu, Ping Li, Gary Cheng, Zhipeng Feng, Shuo Li, Pheng-Ann Heng
Automated detecting lung infections from computed tomography (CT) data plays an important role for combating COVID-19.
no code implementations • 9 Nov 2022 • Kang Li, Lequan Yu, Pheng-Ann Heng
Particularly, we first present a style-oriented replay module to enable structure-realistic and memory-efficient reproduction of past data, and then incorporate the replayed past data to jointly optimize the model with current data to alleviate catastrophic forgetting.
1 code implementation • 16 Sep 2022 • Lanqing Li, Liang Zeng, Ziqi Gao, Shen Yuan, Yatao Bian, Bingzhe Wu, Hengtong Zhang, Yang Yu, Chan Lu, Zhipeng Zhou, Hongteng Xu, Jia Li, Peilin Zhao, Pheng-Ann Heng
The last decade has witnessed a prosperous development of computational methods and dataset curation for AI-aided drug discovery (AIDD).
no code implementations • 15 Sep 2022 • Chen Liang, Bowen Wang, Shaogang Hao, Guangyong Chen, Pheng-Ann Heng, Xiaolong Zou
Graph neural networks (GNNs) have drawn more and more attention from material scientists and demonstrated a high capacity to establish connections between the structure and properties.
1 code implementation • 6 Sep 2022 • Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu, Guangyong Chen, Pheng-Ann Heng, Stan Z. Li
Deep generative models are a prominent approach for data generation, and have been used to produce high quality samples in various domains.
no code implementations • 22 Aug 2022 • Kexin Chen, Guangyong Chen, Junyou Li, Yuansheng Huang, Pheng-Ann Heng
In high-throughput experimentation (HTE) datasets, the average yield of our methodology's top 10 high-yield reactions is relatively close to the results of ideal yield selection.
no code implementations • 20 Jul 2022 • Yang Yu, Zixu Zhao, Yueming Jin, Guangyong Chen, Qi Dou, Pheng-Ann Heng
Concretely, for trusty representation learning, we propose to incorporate pseudo labels to instruct the pair selection, obtaining more reliable representation pairs for pixel contrast.
2 code implementations • 11 Jul 2022 • Tianyu Wang, Xiaowei Hu, Pheng-Ann Heng, Chi-Wing Fu
This paper formulates a new problem, instance shadow detection, which aims to detect shadow instance and the associated object instance that cast each shadow in the input image.
Ranked #1 on
Instance Shadow Detection
on SOBA
no code implementations • 8 Jul 2022 • Jinpeng Li, Haibo Jin, Shengcai Liao, Ling Shao, Pheng-Ann Heng
This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection.
no code implementations • 5 Jul 2022 • Zhizhong Chai, Huangjing Lin, Luyang Luo, Pheng-Ann Heng, Hao Chen
In this paper, we proposed a novel omni-supervised object detection network, which can exploit multiple different forms of annotated data to further improve the detection performance.
no code implementations • 29 Jun 2022 • Quande Liu, Cheng Chen, Qi Dou, Pheng-Ann Heng
Domain generalization typically requires data from multiple source domains for model learning.
1 code implementation • 27 Jun 2022 • Meirui Jiang, Hongzheng Yang, Xiaoxiao Li, Quande Liu, Pheng-Ann Heng, Qi Dou
Despite recent progress on semi-supervised federated learning (FL) for medical image diagnosis, the problem of imbalanced class distributions among unlabeled clients is still unsolved for real-world use.
no code implementations • 14 Jun 2022 • Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Xuefei Zhe, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing Fu
To guide the network quickly fit the coarse shape, we propose to utilize the signed supervision in regions that are obviously outside the object and can be easily determined, resulting in our semi-signed supervision.
no code implementations • 10 May 2022 • Cheng Xue, Lequan Yu, Pengfei Chen, Qi Dou, Pheng-Ann Heng
In this paper, we propose a novel collaborative training paradigm with global and local representation learning for robust medical image classification from noisy-labeled data to combat the lack of high quality annotated medical data.
1 code implementation • 30 Mar 2022 • Donghao Zhou, Pengfei Chen, Qiong Wang, Guangyong Chen, Pheng-Ann Heng
Due to the difficulty of collecting exhaustive multi-label annotations, multi-label datasets often contain partial labels.
1 code implementation • 29 Mar 2022 • Yueming Jin, Yang Yu, Cheng Chen, Zixu Zhao, Pheng-Ann Heng, Danail Stoyanov
Automatic surgical scene segmentation is fundamental for facilitating cognitive intelligence in the modern operating theatre.
1 code implementation • 18 Mar 2022 • Luyang Luo, Dunyuan Xu, Hao Chen, Tien-Tsin Wong, Pheng-Ann Heng
Deep learning models were frequently reported to learn from shortcuts like dataset biases.
no code implementations • 5 Mar 2022 • Yidan Feng, Biqi Yang, Xianzhi Li, Chi-Wing Fu, Rui Cao, Kai Chen, Qi Dou, Mingqiang Wei, Yun-hui Liu, Pheng-Ann Heng
Industrial bin picking is a challenging task that requires accurate and robust segmentation of individual object instances.
no code implementations • 17 Feb 2022 • Zixu Zhao, Yueming Jin, Pheng-Ann Heng
Specifically, we introduce the prior query that encoded with previous temporal knowledge, to transfer tracking signals to current instances via identity matching.
1 code implementation • 8 Nov 2021 • Jiacheng Wang, Yueming Jin, Shuntian Cai, Hongzhi Xu, Pheng-Ann Heng, Jing Qin, Liansheng Wang
Compared with existing solutions, which either neglect geometric relationships among targeting objects or capture the relationships by using complicated aggregation schemes, the proposed network is capable of achieving satisfactory accuracy while maintaining real-time performance by taking full advantage of the spatial relations among landmarks.
no code implementations • 5 Nov 2021 • Mian Wu, Yinling Qian, Xiangyun Liao, Qiong Wang, Pheng-Ann Heng
In practice, we introduce the voxel-wise embedding rather than patch-wise embedding to locate precise liver vessel voxels, and adopt multi-scale convolutional operators to gain local spatial information.
1 code implementation • NeurIPS 2021 • Danruo Deng, Guangyong Chen, Jianye Hao, Qiong Wang, Pheng-Ann Heng
The backpropagation networks are notably susceptible to catastrophic forgetting, where networks tend to forget previously learned skills upon learning new ones.
no code implementations • ICCV 2021 • Zixu Zhao, Yueming Jin, Pheng-Ann Heng
This paper presents a self-supervised method for learning reliable visual correspondence from unlabeled videos.
1 code implementation • 28 Sep 2021 • Jiacheng Wang, Yueming Jin, Liansheng Wang, Shuntian Cai, Pheng-Ann Heng, Jing Qin
On the other hand, we develop an active global memory to gather the global semantic correlation in long temporal range to current one, in which we gather the most informative frames derived from model uncertainty and frame similarity.
1 code implementation • 19 Sep 2021 • Cheng Chen, Quande Liu, Yueming Jin, Qi Dou, Pheng-Ann Heng
We present a novel denoised pseudo-labeling method for this problem, which effectively makes use of the source model and unlabeled target data to promote model self-adaptation from pseudo labels.
1 code implementation • 30 Aug 2021 • Jiaqi Xu, Bin Li, Bo Lu, Yun-hui Liu, Qi Dou, Pheng-Ann Heng
Ten learning-based surgical tasks are built in the platform, which are common in the real autonomous surgical execution.
1 code implementation • 28 Jul 2021 • Xiaojie Gao, Yueming Jin, Qi Dou, Chi-Wing Fu, Pheng-Ann Heng
Video prediction methods generally consume substantial computing resources in training and deployment, among which keypoint-based approaches show promising improvement in efficiency by simplifying dense image prediction to light keypoint prediction.
Ranked #1 on
Video Prediction
on KTH
1 code implementation • CVPR 2021 • Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng
Instance shadow detection aims to find shadow instances paired with the objects that cast the shadows.
Ranked #2 on
Instance Shadow Detection
on SOBA
1 code implementation • 16 Jun 2021 • Quande Liu, Hongzheng Yang, Qi Dou, Pheng-Ann Heng
This paper studies a practical yet challenging FL problem, named \textit{Federated Semi-supervised Learning} (FSSL), which aims to learn a federated model by jointly utilizing the data from both labeled and unlabeled clients (i. e., hospitals).
3 code implementations • CVPR 2021 • Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu
Point clouds produced by 3D scanning are often sparse, non-uniform, and noisy.
no code implementations • 21 Apr 2021 • Luyang Luo, Hao Chen, Yongjie Xiao, Yanning Zhou, Xi Wang, Varut Vardhanabhuti, Mingxiang Wu, Chu Han, Zaiyi Liu, Xin Hao Benjamin Fang, Efstratios Tsougenis, Huangjing Lin, Pheng-Ann Heng
The models were also compared to radiologists on a subset of the internal testing set (n=496).
1 code implementation • 7 Apr 2021 • Yanwen Li, Luyang Luo, Huangjing Lin, Hao Chen, Pheng-Ann Heng
The novel coronavirus disease 2019 (COVID-19) characterized by atypical pneumonia has caused millions of deaths worldwide.
no code implementations • 7 Apr 2021 • Zhizhong Chai, Luyang Luo, Huangjing Lin, Hao Chen, Anjia Han, Pheng-Ann Heng
Specifically, our model learns a metric space and conducts dual alignment of semantic features on both the proposal level and the prototype levels.
1 code implementation • 30 Mar 2021 • Yueming Jin, Yonghao Long, Cheng Chen, Zixu Zhao, Qi Dou, Pheng-Ann Heng
In this paper, we propose a novel end-to-end temporal memory relation network (TMRNet) for relating long-range and multi-scale temporal patterns to augment the present features.
no code implementations • 24 Mar 2021 • Zixu Zhao, Yueming Jin, Bo Lu, Chi-Fai Ng, Qi Dou, Yun-hui Liu, Pheng-Ann Heng
To greatly increase the label efficiency, we explore a new problem, i. e., adaptive instrument segmentation, which is to effectively adapt one source model to new robotic surgical videos from multiple target domains, only given the annotated instruments in the first frame.
no code implementations • 18 Mar 2021 • Xiaojie Gao, Yueming Jin, Zixu Zhao, Qi Dou, Pheng-Ann Heng
Predicting future frames for robotic surgical video is an interesting, important yet extremely challenging problem, given that the operative tasks may have complex dynamics.
1 code implementation • 17 Mar 2021 • Xiaojie Gao, Yueming Jin, Yonghao Long, Qi Dou, Pheng-Ann Heng
In this paper, we introduce, for the first time in surgical workflow analysis, Transformer to reconsider the ignored complementary effects of spatial and temporal features for accurate surgical phase recognition.
1 code implementation • CVPR 2021 • Quande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng
Federated learning allows distributed medical institutions to collaboratively learn a shared prediction model with privacy protection.
no code implementations • 6 Mar 2021 • Xueying Shi, Yueming Jin, Qi Dou, Jing Qin, Pheng-Ann Heng
In this paper, we propose a novel unsupervised domain adaptation framework which can simultaneously transfer multi-modality knowledge, i. e., both kinematic and visual data, from simulator to real robot.
no code implementations • 5 Feb 2021 • Jingjing Ren, Xiaowei Hu, Lei Zhu, Xuemiao Xu, Yangyang Xu, Weiming Wang, Zijun Deng, Pheng-Ann Heng
Camouflaged object detection is a challenging task that aims to identify objects having similar texture to the surroundings.
1 code implementation • 7 Jan 2021 • Kang Li, Shujun Wang, Lequan Yu, Pheng-Ann Heng
In this way, the dual teacher models would transfer acquired inter- and intra-domain knowledge to the student model for further integration and exploitation.
no code implementations • ICCV 2021 • Yanning Zhou, Hang Xu, Wei zhang, Bin Gao, Pheng-Ann Heng
The semi-supervised semantic segmentation methods utilize the unlabeled data to increase the feature discriminative ability to alleviate the burden of the annotated data.
no code implementations • ICLR 2021 • Pengfei Chen, Guangyong Chen, Junjie Ye, Jingwei Zhao, Pheng-Ann Heng
The noise in stochastic gradient descent (SGD) provides a crucial implicit regularization effect, previously studied in optimization by analyzing the dynamics of parameter updates.
1 code implementation • 10 Dec 2020 • Pengfei Chen, Junjie Ye, Guangyong Chen, Jingwei Zhao, Pheng-Ann Heng
In this work, we present a theoretical hypothesis testing and prove that noise in real-world dataset is unlikely to be CCN, which confirms that label noise should depend on the instance and justifies the urgent need to go beyond the CCN assumption. The theoretical results motivate us to study the more general and practical-relevant instance-dependent noise (IDN).
Ranked #45 on
Image Classification
on Clothing1M
1 code implementation • 8 Dec 2020 • Pengfei Chen, Junjie Ye, Guangyong Chen, Jingwei Zhao, Pheng-Ann Heng
For validation, we prove that a noisy validation set is reliable, addressing the critical demand of model selection in scenarios like hyperparameter-tuning and early stopping.
1 code implementation • 13 Oct 2020 • Shujun Wang, Lequan Yu, Kang Li, Xin Yang, Chi-Wing Fu, Pheng-Ann Heng
Our DoFE framework dynamically enriches the image features with additional domain prior knowledge learned from multi-source domains to make the semantic features more discriminative.
no code implementations • 4 Oct 2020 • Kang Li, Lequan Yu, Shujun Wang, Pheng-Ann Heng
Considering multi-modality data with the same anatomic structures are widely available in clinic routine, in this paper, we aim to exploit the prior knowledge (e. g., shape priors) learned from one modality (aka., assistant modality) to improve the segmentation performance on another modality (aka., target modality) to make up annotation scarcity.
1 code implementation • 21 Jul 2020 • Yanning Zhou, Hao Chen, Huangjing Lin, Pheng-Ann Heng
The teacher's self-ensemble predictions from $K$-time augmented samples are used to construct the reliable pseudo-labels for optimizing the student.
no code implementations • ECCV 2020 • Shujun Wang, Lequan Yu, Caizi Li, Chi-Wing Fu, Pheng-Ann Heng
To this end, we present a new domain generalization framework that learns how to generalize across domains simultaneously from extrinsic relationship supervision and intrinsic self-supervision for images from multi-source domains.
no code implementations • 13 Jul 2020 • Kang Li, Shujun Wang, Lequan Yu, Pheng-Ann Heng
Medical image annotations are prohibitively time-consuming and expensive to obtain.
1 code implementation • 6 Jul 2020 • Zixu Zhao, Yueming Jin, Xiaojie Gao, Qi Dou, Pheng-Ann Heng
Considering the fast instrument motion, we further introduce a flow compensator to estimate intermediate motion within continuous frames, with a novel cycle learning strategy.
1 code implementation • 4 Jul 2020 • Quande Liu, Qi Dou, Pheng-Ann Heng
We present a novel shape-aware meta-learning scheme to improve the model generalization in prostate MRI segmentation.
no code implementations • 6 Jun 2020 • Luyang Luo, Lequan Yu, Hao Chen, Quande Liu, Xi Wang, Jiaqi Xu, Pheng-Ann Heng
Recent researches have demonstrated that performance bottleneck exists in joint training on different CXR datasets, and few made efforts to address the obstacle.
1 code implementation • 28 Apr 2020 • Xin Yang, Xu Wang, Yi Wang, Haoran Dou, Shengli Li, Huaxuan Wen, Yi Lin, Pheng-Ann Heng, Dong Ni
In this paper, we propose the first fully-automated solution to segment the whole fetal head in US volumes.
1 code implementation • 26 Apr 2020 • Zhaohan Xiong, Qing Xia, Zhiqiang Hu, Ning Huang, Cheng Bian, Yefeng Zheng, Sulaiman Vesal, Nishant Ravikumar, Andreas Maier, Xin Yang, Pheng-Ann Heng, Dong Ni, Caizi Li, Qianqian Tong, Weixin Si, Elodie Puybareau, Younes Khoudli, Thierry Geraud, Chen Chen, Wenjia Bai, Daniel Rueckert, Lingchao Xu, Xiahai Zhuang, Xinzhe Luo, Shuman Jia, Maxime Sermesant, Yashu Liu, Kuanquan Wang, Davide Borra, Alessandro Masci, Cristiana Corsi, Coen de Vente, Mitko Veta, Rashed Karim, Chandrakanth Jayachandran Preetha, Sandy Engelhardt, Menyun Qiao, Yuanyuan Wang, Qian Tao, Marta Nunez-Garcia, Oscar Camara, Nicolo Savioli, Pablo Lamata, Jichao Zhao
Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment.
1 code implementation • 21 Apr 2020 • Xueying Shi, Yueming Jin, Qi Dou, Pheng-Ann Heng
Specifically, we propose a non-local recurrent convolutional network (NL-RCNet), which introduces non-local block to capture the long-range temporal dependency (LRTD) among continuous frames.
no code implementations • 23 Mar 2020 • Tobias Ross, Annika Reinke, Peter M. Full, Martin Wagner, Hannes Kenngott, Martin Apitz, Hellena Hempe, Diana Mindroc Filimon, Patrick Scholz, Thuy Nuong Tran, Pierangela Bruno, Pablo Arbeláez, Gui-Bin Bian, Sebastian Bodenstedt, Jon Lindström Bolmgren, Laura Bravo-Sánchez, Hua-Bin Chen, Cristina González, Dong Guo, Pål Halvorsen, Pheng-Ann Heng, Enes Hosgor, Zeng-Guang Hou, Fabian Isensee, Debesh Jha, Tingting Jiang, Yueming Jin, Kadir Kirtac, Sabrina Kletz, Stefan Leger, Zhixuan Li, Klaus H. Maier-Hein, Zhen-Liang Ni, Michael A. Riegler, Klaus Schoeffmann, Ruohua Shi, Stefanie Speidel, Michael Stenzel, Isabell Twick, Gutai Wang, Jiacheng Wang, Liansheng Wang, Lu Wang, Yu-Jie Zhang, Yan-Jie Zhou, Lei Zhu, Manuel Wiesenfarth, Annette Kopp-Schneider, Beat P. Müller-Stich, Lena Maier-Hein
The validation of the competing methods for the three tasks (binary segmentation, multi-instance detection and multi-instance segmentation) was performed in three different stages with an increasing domain gap between the training and the test data.
1 code implementation • 16 Mar 2020 • Xianzhi Li, Ruihui Li, Guangyong Chen, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Recently, many deep neural networks were designed to process 3D point clouds, but a common drawback is that rotation invariance is not ensured, leading to poor generalization to arbitrary orientations.
2 code implementations • CVPR 2020 • Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu
We present PointAugment, a new auto-augmentation framework that automatically optimizes and augments point cloud samples to enrich the data diversity when we train a classification network.
Ranked #2 on
3D Point Cloud Data Augmentation
on ModelNet40
1 code implementation • 22 Feb 2020 • Cheng Chen, Qi Dou, Yueming Jin, Hao Chen, Jing Qin, Pheng-Ann Heng
We tackle this challenge and propose a novel multimodal segmentation framework which is robust to the absence of imaging modalities.
no code implementations • 20 Feb 2020 • Xiaojie Gao, Yueming Jin, Qi Dou, Pheng-Ann Heng
Automatic surgical gesture recognition is fundamental for improving intelligence in robot-assisted surgery, such as conducting complicated tasks of surgery surveillance and skill evaluation.
Ranked #2 on
Action Segmentation
on JIGSAWS
2 code implementations • 16 Nov 2019 • Xiaowei Hu, Tianyu Wang, Chi-Wing Fu, Yitong Jiang, Qiong Wang, Pheng-Ann Heng
Shadow detection in general photos is a nontrivial problem, due to the complexity of the real world.
Ranked #10 on
Shadow Detection
on CUHK-Shadow
3 code implementations • CVPR 2020 • Tianyu Wang, Xiao-Wei Hu, Qiong Wang, Pheng-Ann Heng, Chi-Wing Fu
Then, we pair up the predicted shadow and object instances, and match them with the predicted shadow-object associations to generate the final results.
Ranked #3 on
Instance Shadow Detection
on SOBA
1 code implementation • 4 Nov 2019 • Xiaomeng Li, Xiao-Wei Hu, Lequan Yu, Lei Zhu, Chi-Wing Fu, Pheng-Ann Heng
In this paper, we present a novel cross-disease attention network (CANet) to jointly grade DR and DME by exploring the internal relationship between the diseases with only image-level supervision.
no code implementations • 11 Oct 2019 • Xin Yang, Wenlong Shi, Haoran Dou, Jikuan Qian, Yi Wang, Wufeng Xue, Shengli Li, Dong Ni, Pheng-Ann Heng
(i) This is the first work about 3D pose estimation of fetus in the literature.
1 code implementation • 10 Oct 2019 • Haoran Dou, Xin Yang, Jikuan Qian, Wufeng Xue, Hao Qin, Xu Wang, Lequan Yu, Shujun Wang, Yi Xiong, Pheng-Ann Heng, Dong Ni
In this study, we propose a novel reinforcement learning (RL) framework to automatically localize fetal brain standard planes in 3D US.
no code implementations • 8 Oct 2019 • José Ignacio Orlando, Huazhu Fu, João Barbossa Breda, Karel van Keer, Deepti. R. Bathula, Andrés Diaz-Pinto, Ruogu Fang, Pheng-Ann Heng, Jeyoung Kim, Joonho Lee, Joonseok Lee, Xiaoxiao Li, Peng Liu, Shuai Lu, Balamurali Murugesan, Valery Naranjo, Sai Samarth R. Phaye, Sharath M. Shankaranarayana, Apoorva Sikka, Jaemin Son, Anton Van Den Hengel, Shujun Wang, Junyan Wu, Zifeng Wu, Guanghui Xu, Yongli Xu, Pengshuai Yin, Fei Li, Yanwu Xu, Xiulan Zhang, Hrvoje Bogunović
As part of REFUGE, we have publicly released a data set of 1200 fundus images with ground truth segmentations and clinical glaucoma labels, currently the largest existing one.
1 code implementation • 5 Sep 2019 • Xueying Shi, Qi Dou, Cheng Xue, Jing Qin, Hao Chen, Pheng-Ann Heng
In this paper, we present a novel active learning framework for cost-effective skin lesion analysis.
1 code implementation • 3 Sep 2019 • Yanning Zhou, Simon Graham, Navid Alemi Koohbanani, Muhammad Shaban, Pheng-Ann Heng, Nasir Rajpoot
Furthermore, to deal with redundancy in the graph, we propose a sampling technique that removes nodes in areas of dense nuclear activity.
no code implementations • 31 Aug 2019 • Xu Wang, Xin Yang, Haoran Dou, Shengli Li, Pheng-Ann Heng, Dong Ni
In this paper, we propose an effective framework for simultaneous segmentation and landmark localization in prenatal ultrasound volumes.
1 code implementation • 19 Aug 2019 • Yanning Zhou, Hao Chen, Jiaqi Xu, Qi Dou, Pheng-Ann Heng
In this paper, we propose a novel Instance Relation Network (IRNet) for robust overlapping cell segmentation by exploring instance relation interaction.
no code implementations • 26 Jul 2019 • Xi Wang, Hao Chen, Luyang Luo, An-ran Ran, Poemen P. Chan, Clement C. Tham, Carol Y. Cheung, Pheng-Ann Heng
Besides, the proposed multi-task learning network is capable of exploring the structure and function relationship from the OCT image and visual field measurement simultaneously, which contributes to classification performance boosting.
3 code implementations • ICCV 2019 • Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Point clouds acquired from range scans are often sparse, noisy, and non-uniform.
1 code implementation • 18 Jul 2019 • Yueming Jin, Keyun Cheng, Qi Dou, Pheng-Ann Heng
In this paper, we propose a novel framework to leverage instrument motion information, by incorporating a derived temporal prior to an attention pyramid network for accurate segmentation.
8 code implementations • 16 Jul 2019 • Lequan Yu, Shujun Wang, Xiaomeng Li, Chi-Wing Fu, Pheng-Ann Heng
We design a novel uncertainty-aware scheme to enable the student model to gradually learn from the meaningful and reliable targets by exploiting the uncertainty information.