1 code implementation • 20 Jan 2025 • Guankun Wang, Long Bai, Junyi Wang, Kun Yuan, Zhen Li, Tianxu Jiang, Xiting He, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu, Jiazheng Wang, Fan Zhang, Nicolas Padoy, Nassir Navab, Hongliang Ren
Recently, Multimodal Large Language Models (MLLMs) have demonstrated their immense potential in computer-aided diagnosis and decision-making.
no code implementations • 28 Dec 2024 • Gaoang Wang, Hang Wu, Yang Liao, Zhen Chen, Qing Zhou, Wenxing Wang, Yifei Liu, Yilin Wang, Meijing Wu, Ruiqi Xiang, Yuntao Yu, Xi Zhou, Feng Zhu, Zhonghua Liu, Tingjun Hou
Biotoxins, mainly produced by venomous animals, plants and microorganisms, exhibit high physiological activity and unique effects such as lowering blood pressure and analgesia.
1 code implementation • 16 Dec 2024 • Yuanfan Zheng, Jinlin Wu, Wuyang Li, Zhen Chen
IDSA utilizes instance-level sampling to mine domain-shared category samples and calculates alignment weight through Gaussian distribution to conduct the domain-shared category domain alignment to address the feature heterogeneity.
1 code implementation • 6 Dec 2024 • Jinlin Wu, Xusheng Liang, Xuexue Bai, Zhen Chen
To address these cognitive challenges in surgical training and operation, we propose SurgBox, an agent-driven sandbox framework to systematically enhance the cognitive capabilities of surgeons in immersive surgical simulations.
no code implementations • 11 Oct 2024 • Hao Jiang, Wangqi Shi, Xiao Chen, Qiuming Zhu, Zhen Chen
A comprehensive analysis is conducted to investigate the influence of the height of the BS, motion characteristics of the MR, and antenna configurations on the channel statistics.
1 code implementation • 26 Sep 2024 • Yi Zhang, Zhen Chen, Chih-Hong Cheng, Wenjie Ruan, Xiaowei Huang, Dezong Zhao, David Flynn, Siddartha Khastgir, Xingyu Zhao
In this survey, we provide a timely and focused review of the literature on trustworthy T2I DMs, covering a concise-structured taxonomy from the perspectives of property, means, benchmarks and applications.
1 code implementation • 21 Sep 2024 • Qi Chen, Xiaohan Xing, Zhen Chen, Zhiwei Xiong
To exploit complementary information from the auxiliary modality, we propose a Cross-Modal Selective fusion (CMS-fusion) module that selectively incorporate the frequency and spatial features from the auxiliary modality to enhance the corresponding branch of the target modality.
1 code implementation • 19 Sep 2024 • Zhen Chen, Xingjian Luo, Jinlin Wu, Long Bai, Zhen Lei, Hongliang Ren, Sebastien Ourselin, Hongbin Liu
To ensure a global understanding of the surgical procedure, we devise a phase localization strategy for SurgPLAN++ to predict phase segments across the entire video through phase proposals.
1 code implementation • 9 Sep 2024 • Qingyao Tian, Zhen Chen, Huai Liao, Xinyan Huang, Lujie Li, Sebastien Ourselin, Hongbin Liu
In this work, we present EndoOmni, the first foundation model for zero-shot cross-domain depth estimation for endoscopy.
1 code implementation • 4 Sep 2024 • Wenwu Guo, Jinlin Wu, Zhen Chen, Qingxiang Zhao, Miao Xu, Zhen Lei, Hongbin Liu
Compared with 2D instrument tracking methods, 3D instrument tracking has broader value in clinical practice, but is also more challenging due to weak texture, occlusion, and lack of Computer-Aided Design (CAD) models for 3D registration.
no code implementations • 2 Sep 2024 • Adrito Das, Danyal Z. Khan, Dimitrios Psychogyios, Yitong Zhang, John G. Hanrahan, Francisco Vasconcelos, You Pang, Zhen Chen, Jinlin Wu, Xiaoyang Zou, Guoyan Zheng, Abdul Qayyum, Moona Mazher, Imran Razzak, Tianbin Li, Jin Ye, Junjun He, Szymon Płotka, Joanna Kaleta, Amine Yamlahi, Antoine Jund, Patrick Godau, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Dominik Rivoir, Alejandra Pérez, Santiago Rodriguez, Pablo Arbeláez, Danail Stoyanov, Hani J. Marcus, Sophia Bano
The field of computer vision applied to videos of minimally invasive surgery is ever-growing.
1 code implementation • 21 Aug 2024 • Zhenye Lou, Qing Xu, Zekun Jiang, Xiangjian He, Zhen Chen, Yi Wang, Chenxin Li, Maggie M. He, Wenting Duan
To alleviate the labor-intensive requirement of manual prompts, we introduce a Gaussian-Kernel Prompt Encoder (GKP-Encoder) to generate density maps driven by a single point, which guides segmentation predictions by mixing position prompts and semantic prompts.
1 code implementation • 16 Aug 2024 • Li Pan, Yupei Zhang, Qiushi Yang, Tan Li, Xiaohan Xing, Maximus C. F. Yeung, Zhen Chen
Remarkably, our FoF achieves superior performance using only histopathology slides compared to existing multimodal methods.
1 code implementation • 28 Jul 2024 • Zhen Chen, Zongming Zhang, Wenwu Guo, Xingjian Luo, Long Bai, Jinlin Wu, Hongliang Ren, Hongbin Liu
To address these limitations in operating rooms, we propose an audio-driven surgical instrument segmentation framework, named ASI-Seg, to accurately segment the required surgical instruments by parsing the audio commands of surgeons.
1 code implementation • 19 Jul 2024 • Qing Xu, Jiaxuan Li, Xiangjian He, Ziyu Liu, Zhen Chen, Wenting Duan, Chenxin Li, Maggie M. He, Fiseha B. Tesema, Wooi P. Cheah, Yi Wang, Rong Qu, Jonathan M. Garibaldi
Finally, we design the Query-Decoupled Modality Decoder (QDMD) that leverages a one-to-one strategy to provide an independent decoding channel for every modality.
no code implementations • 8 Jul 2024 • Qingyao Tian, Zhen Chen, Huai Liao, Xinyan Huang, Bingyu Yang, Lujie Li, Hongbin Liu
To overcome these challenges, we propose a novel Probabilistic Airway Navigation System (PANS), leveraging Monte-Carlo method with pose hypotheses and likelihoods to achieve robust and real-time bronchoscope localization.
no code implementations • 25 Jun 2024 • Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao
Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks.
1 code implementation • 19 Jun 2024 • Long Bai, Tong Chen, Qiaozhi Tan, Wan Jun Nah, Yanheng Li, ZhiCheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren
While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels remains underexplored.
no code implementations • 18 Jun 2024 • Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Chen, Zhen Lei, Hongbin Liu
Ultrasonography has revolutionized non-invasive diagnostic methodologies, significantly enhancing patient outcomes across various medical domains.
2 code implementations • 5 Jun 2024 • Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, Yixuan Yuan
We further delved into the potential of U-KAN as an alternative U-Net noise predictor in diffusion models, demonstrating its applicability in generating task-oriented model architectures.
no code implementations • 1 Jun 2024 • Dugang Liu, Shenxian Xian, Xiaolin Lin, Xiaolian Zhang, Hong Zhu, Yuan Fang, Zhen Chen, Zhong Ming
Specifically, in the information reconstruction stage, we design a new user-level SFT task for collaborative information injection with the assistance of a pre-trained SRS model, which is more efficient and compatible with limited text information.
no code implementations • 1 Jun 2024 • Hanxiao Wang, Mingyang Zhao, Weize Quan, Zhen Chen, Dong-Ming Yan, Peter Wonka
To address this issue, we propose E3-Net to achieve equivariance for normal estimation.
no code implementations • 14 May 2024 • Zhen Chen, Xingjian Luo, Jinlin Wu, Danny T. M. Chan, Zhen Lei, Jinqiao Wang, Sebastien Ourselin, Hongbin Liu
In this work, by leveraging advanced multimodal large language models (MLLMs), we propose a Versatile Surgery Assistant (VS-Assistant) that can accurately understand the surgeon's intention and complete a series of surgical understanding tasks, e. g., surgical scene analysis, surgical instrument detection, and segmentation on demand.
no code implementations • 1 May 2024 • Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Lei, Zhen Chen, Hongbin Liu
Ultrasound robots are increasingly used in medical diagnostics and early disease screening.
no code implementations • 29 Apr 2024 • Liyuan Wang, Yan Jin, Zhen Chen, Jinlin Wu, Mengke Li, Yang Lu, Hanzi Wang
The vision-language pre-training has enabled deep models to make a huge step forward in generalizing across unseen domains.
1 code implementation • 12 Apr 2024 • Dongbo Xi, Zhen Chen, Yuexian Wang, He Cui, Chong Peng, Fuzhen Zhuang, Peng Yan
Besides, by personalized integration of domain features from other domains for each user and the innovation in the training mode, the DFEI framework can yield more accurate conversion identification.
1 code implementation • 9 Apr 2024 • Yupei Zhang, Li Pan, Qiushi Yang, Tan Li, Zhen Chen
Specifically, to enhance the representation abilities of vision and language encoders, we propose the Multi-level Reconstruction Pre-training (MR-Pretrain) strategy, including a feature-level and data-level reconstruction, which guides models to capture the semantic information from masked inputs of different modalities.
1 code implementation • 26 Mar 2024 • Junjie Ye, Lei Huang, Zhen Chen, Peichang Zhang, Mohamed Rihan
It is critical to design efficient beamforming in reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) systems for enhancing spectrum utilization.
no code implementations • 17 Mar 2024 • Chenxin Li, Hengyu Liu, Yifan Liu, Brandon Y. Feng, Wuyang Li, Xinyu Liu, Zhen Chen, Jing Shao, Yixuan Yuan
In a nutshell, Endora marks a notable breakthrough in the deployment of generative AI for clinical endoscopy research, setting a substantial stage for further advances in medical content generation.
1 code implementation • 14 Mar 2024 • Yuhang Zheng, Xiangyu Chen, Yupeng Zheng, Songen Gu, Runyi Yang, Bu Jin, Pengfei Li, Chengliang Zhong, Zengmao Wang, Lina Liu, Chao Yang, Dawei Wang, Zhen Chen, Xiaoxiao Long, Meiqing Wang
In particular, we propose an Efficient Feature Distillation (EFD) module that employs contrastive learning to efficiently and accurately distill language embeddings derived from foundational models.
1 code implementation • 26 Feb 2024 • Zhen Chen, Qing Xu, Xinyu Liu, Yixuan Yuan
Moreover, to unleash the generalization capability of SAM across a variety of nuclei images, we devise a Domain-adaptive Tuning Encoder (DT-Encoder) to seamlessly harmonize visual features with domain-common and domain-specific knowledge, and further devise a Domain Query-enhanced Decoder (DQ-Decoder) by leveraging learnable domain queries for segmentation decoding in different nuclei domains.
1 code implementation • 20 Jan 2024 • Zhen Chen, Jingping Liu, Deqing Yang, Yanghua Xiao, Huimin Xu, ZongYu Wang, Rui Xie, Yunsen Xian
Open information extraction (OpenIE) aims to extract the schema-free triplets in the form of (\emph{subject}, \emph{predicate}, \emph{object}) from a given sentence.
no code implementations • 28 Nov 2023 • Junjie Ye, Mohamed Rihan, Peichang Zhang, Lei Huang, Stefano Buzzi, Zhen Chen
Energy efficiency (EE) is a challenging task in integrated sensing and communication (ISAC) systems, where high spectral efficiency and low energy consumption appear as conflicting requirements.
no code implementations • 21 Nov 2023 • Zhen Chen, Yuhao Zhai, Jun Zhang, Jinqiao Wang
Specifically, we propose an efficient multi-scale surgical temporal action (MS-STA) module, which integrates visual features with spatial and temporal knowledge of surgical actions at the cost of 2D networks.
1 code implementation • 16 Nov 2023 • Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu
To address this issue, we propose a novel yet effective weakly-supervised surgical instrument instance segmentation approach, named Point-based Weakly-supervised Instance Segmentation (PWISeg).
no code implementations • 16 Nov 2023 • Xingjian Luo, You Pang, Zhen Chen, Jinlin Wu, Zongmin Zhang, Zhen Lei, Hongbin Liu
To address these two challenges, we propose a Surgical Phase LocAlization Network, named SurgPLAN, to facilitate a more accurate and stable surgical phase recognition with the principle of temporal detection.
2 code implementations • 23 Sep 2023 • Rongfeng Wei, Jinlin Wu, Xuexue Bai, Ming Feng, Zhen Lei, Hongbin Liu, Zhen Chen
In minimally invasive surgery, surgical instrument localization is a crucial task for endoscopic videos, which enables various applications for improving surgical outcomes.
no code implementations • 21 Aug 2023 • Cheng Feng, Congxuan Zhang, Zhen Chen, Weiming Hu, Liyue Ge
Depth sensing is of paramount importance for unmanned aerial and autonomous vehicles.
no code implementations • 8 Mar 2023 • Yang Cheng, Zhen Chen, Daming Liu
Power line detection is a critical inspection task for electricity companies and is also useful in avoiding drone obstacles.
no code implementations • 23 Nov 2022 • Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao, Yusen He, Yaohua Wang
We first argue that the MSE minimization approach is equivalent to a conditional entropy learning problem, and then propose a mutual information learning formulation for solving regression problems by using a reparameterization technique.
no code implementations • COLING 2022 • Chengwei Hu, Deqing Yang, Haoliang Jin, Zhen Chen, Yanghua Xiao
Continual relation extraction (CRE) aims to extract relations towards the continuous and iterative arrival of new data, of which the major challenge is the catastrophic forgetting of old tasks.
no code implementations • 6 Oct 2022 • Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, QiXing Huang
We first propose a diffusion-based generative model to tackle this problem by generating voxelized shapes with close-to-reality outlines and structures.
no code implementations • 3 Oct 2022 • Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao
Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone feature extractors in downstream tasks.
no code implementations • 21 Sep 2022 • Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao
Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets.
no code implementations • 7 Mar 2022 • Victor Churchill, Steve Manns, Zhen Chen, Dongbin Xiu
In the proposed ensemble averaging method, multiple models are independently trained and model predictions are averaged at each time step.
1 code implementation • 24 Oct 2021 • Xinyu Liu, Baopu Li, Zhen Chen, Yixuan Yuan
Model pruning aims to reduce the deep neural network (DNN) model size or computational overhead.
no code implementations • 10 Oct 2021 • Yuyang Zhang, Dik Hin Leung, Min Guo, Yijia Xiao, Haoyue Liu, Yunfei Li, Jiyuan Zhang, Guan Wang, Zhen Chen
Matrix multiplication is the bedrock in Deep Learning inference application.
1 code implementation • 1 Oct 2021 • Zhen Chen, Meilu Zhu, Chen Yang, Yixuan Yuan
To address this problem, we propose a personalized retrogress-resilient framework to produce a superior personalized model for each client.
no code implementations • 7 Jun 2021 • Zhen Chen, Victor Churchill, Kailiang Wu, Dongbin Xiu
Consequently, a trained DNN defines a predictive model for the underlying unknown PDE over structureless grids.
3 code implementations • 18 May 2021 • Dongbo Xi, Zhen Chen, Peng Yan, Yinger Zhang, Yongchun Zhu, Fuzhen Zhuang, Yu Chen
While considerable multi-task efforts have been made in this direction, a long-standing challenge is how to explicitly model the long-path sequential dependence among audience multi-step conversions for improving the end-to-end conversion.
no code implementations • 9 Aug 2020 • Feng Xia, Nana Yaw Asabere, Haifeng Liu, Zhen Chen, Wei Wang
As a result of the importance of academic collaboration at smart conferences, various researchers have utilized recommender systems to generate effective recommendations for participants.
no code implementations • 2 Jun 2020 • Tong Qin, Zhen Chen, John Jakeman, Dongbin Xiu
To circumvent the difficulty presented by the non-autonomous nature of the system, our method transforms the solution state into piecewise integration of the system over a discrete set of time instances.
no code implementations • 4 May 2020 • Marcel Schloz, Thomas C. Pekin, Zhen Chen, Wouter Van den Broek, David A. Muller, Christoph T. Koch
The overdetermination of the mathematical problem underlying ptychography is reduced by a host of experimentally more desirable settings.
no code implementations • 5 Mar 2020 • Zhen Chen, Kailiang Wu, Dongbin Xiu
Various numerical examples are then presented to demonstrate the performance and properties of the numerical methods.
no code implementations • 23 Jan 2020 • Zhen Chen, Dongbin Xiu
When an existing coarse model is not available, we present numerical strategies for fast creation of coarse models, to be used in conjunction with the generalized ResNet.
1 code implementation • 12 Jan 2020 • Xiaoqing Guo, Zhen Chen, Yixuan Yuan
To tackle these issues, we propose a novel complementary network with adaptive receptive filed learning.
no code implementations • 12 Nov 2018 • Zhen Chen, Anthimos Georgiadis
Based on different projection geometry, a fisheye image can be presented as a parameterized non-rectilinear image.
no code implementations • 3 Feb 2018 • Yeeleng S. Vang, Zhen Chen, Xiaohui Xie
In this work, we present a deep learning framework for multi-class breast cancer image classification as our submission to the International Conference on Image Analysis and Recognition (ICIAR) 2018 Grand Challenge on BreAst Cancer Histology images (BACH).
Breast Cancer Histology Image Classification General Classification +2