Search Results for author: Zhen Chen

Found 58 papers, 26 papers with code

EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery

1 code implementation20 Jan 2025 Guankun Wang, Long Bai, Junyi Wang, Kun Yuan, Zhen Li, Tianxu Jiang, Xiting He, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu, Jiazheng Wang, Fan Zhang, Nicolas Padoy, Nassir Navab, Hongliang Ren

Recently, Multimodal Large Language Models (MLLMs) have demonstrated their immense potential in computer-aided diagnosis and decision-making.

BioTD: an online database of biotoxins

no code implementations28 Dec 2024 Gaoang Wang, Hang Wu, Yang Liao, Zhen Chen, Qing Zhou, Wenxing Wang, Yifei Liu, Yilin Wang, Meijing Wu, Ruiqi Xiang, Yuntao Yu, Xi Zhou, Feng Zhu, Zhonghua Liu, Tingjun Hou

Biotoxins, mainly produced by venomous animals, plants and microorganisms, exhibit high physiological activity and unique effects such as lowering blood pressure and analgesia.

Drug Design Drug Discovery

Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment

1 code implementation16 Dec 2024 Yuanfan Zheng, Jinlin Wu, Wuyang Li, Zhen Chen

IDSA utilizes instance-level sampling to mine domain-shared category samples and calculates alignment weight through Gaussian distribution to conduct the domain-shared category domain alignment to address the feature heterogeneity.

Domain Adaptation object-detection +1

SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot

1 code implementation6 Dec 2024 Jinlin Wu, Xusheng Liang, Xuexue Bai, Zhen Chen

To address these cognitive challenges in surgical training and operation, we propose SurgBox, an agent-driven sandbox framework to systematically enhance the cognitive capabilities of surgeons in immersive surgical simulations.

Decision Making RAG

High-Efficient Near-Field Channel Characteristics Analysis for Large-Scale MIMO Communication Systems

no code implementations11 Oct 2024 Hao Jiang, Wangqi Shi, Xiao Chen, Qiuming Zhu, Zhen Chen

A comprehensive analysis is conducted to investigate the influence of the height of the BS, motion characteristics of the MR, and antenna configurations on the channel statistics.

Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey

1 code implementation26 Sep 2024 Yi Zhang, Zhen Chen, Chih-Hong Cheng, Wenjie Ruan, Xiaowei Huang, Dezong Zhao, David Flynn, Siddartha Khastgir, Xingyu Zhao

In this survey, we provide a timely and focused review of the literature on trustworthy T2I DMs, covering a concise-structured taxonomy from the perspectives of property, means, benchmarks and applications.

Fairness Image Generation

Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning

1 code implementation21 Sep 2024 Qi Chen, Xiaohan Xing, Zhen Chen, Zhiwei Xiong

To exploit complementary information from the auxiliary modality, we propose a Cross-Modal Selective fusion (CMS-fusion) module that selectively incorporate the frequency and spatial features from the auxiliary modality to enhance the corresponding branch of the target modality.

MRI Reconstruction

SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference

1 code implementation19 Sep 2024 Zhen Chen, Xingjian Luo, Jinlin Wu, Long Bai, Zhen Lei, Hongliang Ren, Sebastien Ourselin, Hongbin Liu

To ensure a global understanding of the surgical procedure, we devise a phase localization strategy for SurgPLAN++ to predict phase segments across the entire video through phase proposals.

Data Augmentation Offline surgical phase recognition +1

SurgTrack: CAD-Free 3D Tracking of Real-world Surgical Instruments

1 code implementation4 Sep 2024 Wenwu Guo, Jinlin Wu, Zhen Chen, Qingxiang Zhao, Miao Xu, Zhen Lei, Hongbin Liu

Compared with 2D instrument tracking methods, 3D instrument tracking has broader value in clinical practice, but is also more challenging due to weak texture, occlusion, and lack of Computer-Aided Design (CAD) models for 3D registration.

NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation

1 code implementation21 Aug 2024 Zhenye Lou, Qing Xu, Zekun Jiang, Xiangjian He, Zhen Chen, Yi Wang, Chenxin Li, Maggie M. He, Wenting Duan

To alleviate the labor-intensive requirement of manual prompts, we introduce a Gaussian-Kernel Prompt Encoder (GKP-Encoder) to generate density maps driven by a single point, which guides segmentation predictions by mixing position prompts and semantic prompts.

Decoder Domain Generalization +4

ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding

1 code implementation28 Jul 2024 Zhen Chen, Zongming Zhang, Wenwu Guo, Xingjian Luo, Long Bai, Jinlin Wu, Hongliang Ren, Hongbin Liu

To address these limitations in operating rooms, we propose an audio-driven surgical instrument segmentation framework, named ASI-Seg, to accurately segment the required surgical instruments by parsing the audio commands of surgeons.

Contrastive Learning Intention-oriented Segmentation +3

ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation

1 code implementation19 Jul 2024 Qing Xu, Jiaxuan Li, Xiangjian He, Ziyu Liu, Zhen Chen, Wenting Duan, Chenxin Li, Maggie M. He, Fiseha B. Tesema, Wooi P. Cheah, Yi Wang, Rong Qu, Jonathan M. Garibaldi

Finally, we design the Query-Decoupled Modality Decoder (QDMD) that leverages a one-to-one strategy to provide an independent decoding channel for every modality.

Decoder Image Segmentation +5

PANS: Probabilistic Airway Navigation System for Real-time Robust Bronchoscope Localization

no code implementations8 Jul 2024 Qingyao Tian, Zhen Chen, Huai Liao, Xinyan Huang, Bingyu Yang, Lujie Li, Hongbin Liu

To overcome these challenges, we propose a novel Probabilistic Airway Navigation System (PANS), leveraging Monte-Carlo method with pose hypotheses and likelihoods to achieve robust and real-time bronchoscope localization.

CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems

no code implementations25 Jun 2024 Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao

Previous works on adversarial attacks have primarily focused on white-box attacks that directly perturb the states or actions of victim agents, often in scenarios with a limited number of attacks.

Adversarial Attack SMAC+

Transforming Surgical Interventions with Embodied Intelligence for Ultrasound Robotics

no code implementations18 Jun 2024 Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Chen, Zhen Lei, Hongbin Liu

Ultrasonography has revolutionized non-invasive diagnostic methodologies, significantly enhancing patient outcomes across various medical domains.

Motion Planning

U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

2 code implementations5 Jun 2024 Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, Yixuan Yuan

We further delved into the potential of U-KAN as an alternative U-Net noise predictor in diffusion models, demonstrating its applicability in generating task-oriented model architectures.

Image Segmentation Kolmogorov-Arnold Networks +3

A Practice-Friendly LLM-Enhanced Paradigm with Preference Parsing for Sequential Recommendation

no code implementations1 Jun 2024 Dugang Liu, Shenxian Xian, Xiaolin Lin, Xiaolian Zhang, Hong Zhu, Yuan Fang, Zhen Chen, Zhong Ming

Specifically, in the information reconstruction stage, we design a new user-level SFT task for collaborative information injection with the assistance of a pre-trained SRS model, which is more efficient and compatible with limited text information.

Sequential Recommendation

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons

no code implementations14 May 2024 Zhen Chen, Xingjian Luo, Jinlin Wu, Danny T. M. Chan, Zhen Lei, Jinqiao Wang, Sebastien Ourselin, Hongbin Liu

In this work, by leveraging advanced multimodal large language models (MLLMs), we propose a Versatile Surgery Assistant (VS-Assistant) that can accurately understand the surgeon's intention and complete a series of surgical understanding tasks, e. g., surgical scene analysis, surgical instrument detection, and segmentation on demand.

Transitive Vision-Language Prompt Learning for Domain Generalization

no code implementations29 Apr 2024 Liyuan Wang, Yan Jin, Zhen Chen, Jinlin Wu, Mengke Li, Yang Lu, Hanzi Wang

The vision-language pre-training has enabled deep models to make a huge step forward in generalizing across unseen domains.

Domain Generalization

Large-Scale Multi-Domain Recommendation: an Automatic Domain Feature Extraction and Personalized Integration Framework

1 code implementation12 Apr 2024 Dongbo Xi, Zhen Chen, Yuexian Wang, He Cui, Chong Peng, Fuzhen Zhuang, Peng Yan

Besides, by personalized integration of domain features from other domains for each user and the innovation in the training mode, the DFEI framework can yield more accurate conversion identification.

Feature Engineering Task 2

Unified Multi-modal Diagnostic Framework with Reconstruction Pre-training and Heterogeneity-combat Tuning

1 code implementation9 Apr 2024 Yupei Zhang, Li Pan, Qiushi Yang, Tan Li, Zhen Chen

Specifically, to enhance the representation abilities of vision and language encoders, we propose the Multi-level Reconstruction Pre-training (MR-Pretrain) strategy, including a feature-level and data-level reconstruction, which guides models to capture the semantic information from masked inputs of different modalities.

Unsupervised Learning for Joint Beamforming Design in RIS-aided ISAC Systems

1 code implementation26 Mar 2024 Junjie Ye, Lei Huang, Zhen Chen, Peichang Zhang, Mohamed Rihan

It is critical to design efficient beamforming in reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) systems for enhancing spectrum utilization.

Endora: Video Generation Models as Endoscopy Simulators

no code implementations17 Mar 2024 Chenxin Li, Hengyu Liu, Yifan Liu, Brandon Y. Feng, Wuyang Li, Xinyu Liu, Zhen Chen, Jing Shao, Yixuan Yuan

In a nutshell, Endora marks a notable breakthrough in the deployment of generative AI for clinical endoscopy research, setting a substantial stage for further advances in medical content generation.

Data Augmentation Video Generation

GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping

1 code implementation14 Mar 2024 Yuhang Zheng, Xiangyu Chen, Yupeng Zheng, Songen Gu, Runyi Yang, Bu Jin, Pengfei Li, Chengliang Zhong, Zengmao Wang, Lina Liu, Chao Yang, Dawei Wang, Zhen Chen, Xiaoxiao Long, Meiqing Wang

In particular, we propose an Efficient Feature Distillation (EFD) module that employs contrastive learning to efficiently and accurately distill language embeddings derived from foundational models.

Contrastive Learning Robotic Grasping +1

UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images

1 code implementation26 Feb 2024 Zhen Chen, Qing Xu, Xinyu Liu, Yixuan Yuan

Moreover, to unleash the generalization capability of SAM across a variety of nuclei images, we devise a Domain-adaptive Tuning Encoder (DT-Encoder) to seamlessly harmonize visual features with domain-common and domain-specific knowledge, and further devise a Domain Query-enhanced Decoder (DQ-Decoder) by leveraging learnable domain queries for segmentation decoding in different nuclei domains.

Decoder Segmentation +1

Exploiting Duality in Open Information Extraction with Predicate Prompt

1 code implementation20 Jan 2024 Zhen Chen, Jingping Liu, Deqing Yang, Yanghua Xiao, Huimin Xu, ZongYu Wang, Rui Xie, Yunsen Xian

Open information extraction (OpenIE) aims to extract the schema-free triplets in the form of (\emph{subject}, \emph{predicate}, \emph{object}) from a given sentence.

Open Information Extraction Sentence

Energy Efficiency Optimization in Active Reconfigurable Intelligent Surface-Aided Integrated Sensing and Communication Systems

no code implementations28 Nov 2023 Junjie Ye, Mohamed Rihan, Peichang Zhang, Lei Huang, Stefano Buzzi, Zhen Chen

Energy efficiency (EE) is a challenging task in integrated sensing and communication (ISAC) systems, where high spectral efficiency and low energy consumption appear as conflicting requirements.

Surgical Temporal Action-aware Network with Sequence Regularization for Phase Recognition

no code implementations21 Nov 2023 Zhen Chen, Yuhao Zhai, Jun Zhang, Jinqiao Wang

Specifically, we propose an efficient multi-scale surgical temporal action (MS-STA) module, which integrates visual features with spatial and temporal knowledge of surgical actions at the cost of 2D networks.

Surgical phase recognition

PWISeg: Point-based Weakly-supervised Instance Segmentation for Surgical Instruments

1 code implementation16 Nov 2023 Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu

To address this issue, we propose a novel yet effective weakly-supervised surgical instrument instance segmentation approach, named Point-based Weakly-supervised Instance Segmentation (PWISeg).

Instance Segmentation Segmentation +4

SurgPLAN: Surgical Phase Localization Network for Phase Recognition

no code implementations16 Nov 2023 Xingjian Luo, You Pang, Zhen Chen, Jinlin Wu, Zongmin Zhang, Zhen Lei, Hongbin Liu

To address these two challenges, we propose a Surgical Phase LocAlization Network, named SurgPLAN, to facilitate a more accurate and stable surgical phase recognition with the principle of temporal detection.

Surgical phase recognition

Weakly Supervised YOLO Network for Surgical Instrument Localization in Endoscopic Videos

2 code implementations23 Sep 2023 Rongfeng Wei, Jinlin Wu, Xuexue Bai, Ming Feng, Zhen Lei, Hongbin Liu, Zhen Chen

In minimally invasive surgery, surgical instrument localization is a crucial task for endoscopic videos, which enables various applications for improving surgical outcomes.

PL-UNeXt: Per-stage Edge Detail and Line Feature Guided Segmentation for Power Line Detection

no code implementations8 Mar 2023 Yang Cheng, Zhen Chen, Daming Liu

Power line detection is a critical inspection task for electricity companies and is also useful in avoiding drone obstacles.

Line Detection Segmentation

Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems

no code implementations23 Nov 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao, Yusen He, Yaohua Wang

We first argue that the MSE minimization approach is equivalent to a conditional entropy learning problem, and then propose a mutual information learning formulation for solving regression problems by using a reparameterization technique.

regression

Improving Continual Relation Extraction through Prototypical Contrastive Learning

no code implementations COLING 2022 Chengwei Hu, Deqing Yang, Haoliang Jin, Zhen Chen, Yanghua Xiao

Continual relation extraction (CRE) aims to extract relations towards the continuous and iterative arrival of new data, of which the major challenge is the catastrophic forgetting of old tasks.

Continual Relation Extraction Contrastive Learning +1

Neural Volumetric Mesh Generator

no code implementations6 Oct 2022 Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, QiXing Huang

We first propose a diffusion-based generative model to tackle this problem by generating voxelized shapes with close-to-reality outlines and structures.

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

no code implementations3 Oct 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone feature extractors in downstream tasks.

Binary Classification Data Augmentation

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

no code implementations21 Sep 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets.

Binary Classification

Robust Modeling of Unknown Dynamical Systems via Ensemble Averaged Learning

no code implementations7 Mar 2022 Victor Churchill, Steve Manns, Zhen Chen, Dongbin Xiu

In the proposed ensemble averaging method, multiple models are independently trained and model predictions are averaged at each time step.

Stochastic Optimization

Exploring Gradient Flow Based Saliency for DNN Model Compression

1 code implementation24 Oct 2021 Xinyu Liu, Baopu Li, Zhen Chen, Yixuan Yuan

Model pruning aims to reduce the deep neural network (DNN) model size or computational overhead.

Image Classification Image Denoising +1

Personalized Retrogress-Resilient Framework for Real-World Medical Federated Learning

1 code implementation1 Oct 2021 Zhen Chen, Meilu Zhu, Chen Yang, Yixuan Yuan

To address this problem, we propose a personalized retrogress-resilient framework to produce a superior personalized model for each client.

Federated Learning

Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

no code implementations7 Jun 2021 Zhen Chen, Victor Churchill, Kailiang Wu, Dongbin Xiu

Consequently, a trained DNN defines a predictive model for the underlying unknown PDE over structureless grids.

Modeling the Sequential Dependence among Audience Multi-step Conversions with Multi-task Learning in Targeted Display Advertising

3 code implementations18 May 2021 Dongbo Xi, Zhen Chen, Peng Yan, Yinger Zhang, Yongchun Zhu, Fuzhen Zhuang, Yu Chen

While considerable multi-task efforts have been made in this direction, a long-standing challenge is how to explicitly model the long-path sequential dependence among audience multi-step conversions for improving the end-to-end conversion.

Multi-Task Learning

Socially-Aware Conference Participant Recommendation with Personality Traits

no code implementations9 Aug 2020 Feng Xia, Nana Yaw Asabere, Haifeng Liu, Zhen Chen, Wei Wang

As a result of the importance of academic collaboration at smart conferences, various researchers have utilized recommender systems to generate effective recommendations for participants.

Recommendation Systems

Data-driven learning of non-autonomous systems

no code implementations2 Jun 2020 Tong Qin, Zhen Chen, John Jakeman, Dongbin Xiu

To circumvent the difficulty presented by the non-autonomous nature of the system, our method transforms the solution state into piecewise integration of the system over a discrete set of time instances.

Overcoming information reduced data and experimentally uncertain parameters in ptychography with regularized optimization

no code implementations4 May 2020 Marcel Schloz, Thomas C. Pekin, Zhen Chen, Wouter Van den Broek, David A. Muller, Christoph T. Koch

The overdetermination of the mathematical problem underlying ptychography is reduced by a host of experimentally more desirable settings.

Methods to Recover Unknown Processes in Partial Differential Equations Using Data

no code implementations5 Mar 2020 Zhen Chen, Kailiang Wu, Dongbin Xiu

Various numerical examples are then presented to demonstrate the performance and properties of the numerical methods.

Vocal Bursts Type Prediction

On generalized residue network for deep learning of unknown dynamical systems

no code implementations23 Jan 2020 Zhen Chen, Dongbin Xiu

When an existing coarse model is not available, we present numerical strategies for fast creation of coarse models, to be used in conjunction with the generalized ResNet.

Complementary Network with Adaptive Receptive Fields for Melanoma Segmentation

1 code implementation12 Jan 2020 Xiaoqing Guo, Zhen Chen, Yixuan Yuan

To tackle these issues, we propose a novel complementary network with adaptive receptive filed learning.

Lesion Segmentation Segmentation +1

Parameterized Synthetic Image Data Set for Fisheye Lens

no code implementations12 Nov 2018 Zhen Chen, Anthimos Georgiadis

Based on different projection geometry, a fisheye image can be presented as a parameterized non-rectilinear image.

Diversity

Deep Learning Framework for Multi-class Breast Cancer Histology Image Classification

no code implementations3 Feb 2018 Yeeleng S. Vang, Zhen Chen, Xiaohui Xie

In this work, we present a deep learning framework for multi-class breast cancer image classification as our submission to the International Conference on Image Analysis and Recognition (ICIAR) 2018 Grand Challenge on BreAst Cancer Histology images (BACH).

Breast Cancer Histology Image Classification General Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.