Search Results for author: Yifei Chen

Found 21 papers, 11 papers with code

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

no code implementations • 8 Mar 2024 • Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhu

In this work, we present the Convolutional Reconstruction Model (CRM), a high-fidelity feed-forward single image-to-3D generative model.

Image to 3D

Paper
Add Code

Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction

1 code implementation • 29 Feb 2024 • Hao Li, Ying Chen, Yifei Chen, Wenxian Yang, Bowen Ding, Yuchen Han, Liansheng Wang, Rongshan Yu

It is designed to enhance the model's generalizability by leveraging the interaction between localized visual patterns and fine-grained pathological semantics.

Image Classification Language Modelling +3

Paper
Code

TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method

1 code implementation • 17 Feb 2024 • Chenyan Zhang, Yifei Chen, Zhenxiong Fan, Yiyu Huang, Wenchao Weng, Ruiquan Ge, Dong Zeng, Changmiao Wang

We also suggest the incorporation of the MF-UNet module, designed to enhance the quality of MRI images generated by the model while mitigating the over-smoothing issue to a certain extent.

Image Generation MRI Reconstruction

Paper
Code

Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation Strategies

1 code implementation • 17 Feb 2024 • Yifei Chen, Chenyan Zhang, Yifan Ke, Yiyu Huang, Xuezhou Dai, Feiwei Qin, Yongquan Zhang, Xiaodong Zhang, Changmiao Wang

Traditional supervised learning methods have historically encountered certain constraints in medical image segmentation due to the challenging collection process, high labeling cost, low signal-to-noise ratio, and complex features characterizing biomedical images.

Data Augmentation Image Segmentation +2

Paper
Code

A Survey on Hallucination in Large Vision-Language Models

no code implementations • 1 Feb 2024 • Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

In this comprehensive survey, we dissect LVLM-related hallucinations in an attempt to establish an overview and facilitate future mitigation.

Hallucination

Paper
Add Code

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

1 code implementation • 12 Jan 2024 • Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zhicheng Dou, Zheng Liu, Ji-Rong Wen

Despite this, their application to information retrieval (IR) tasks is still challenging due to the infrequent occurrence of many IR-specific concepts in natural language.

document understanding Information Retrieval +2

182

Paper
Code

Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood Diseases

1 code implementation • 1 Jan 2024 • Yifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, Yu Gao

To address these issues, this paper proposes an innovative method of leukocyte detection: the Multi-level Feature Fusion and Deformable Self-attention DETR (MFDS-DETR).

Paper
Code

SCUNet++: Swin-UNet and CNN Bottleneck Hybrid Architecture with Multi-Fusion Dense Skip Connection for Pulmonary Embolism CT Image Segmentation

1 code implementation • 22 Dec 2023 • Yifei Chen, Binfeng Zou, Zhaoxin Guo, Yiyu Huang, Yifan Huang, Feiwei Qin, Qinhai Li, Changmiao Wang

These findings demonstrate that our method exhibits strong performance in PE segmentation tasks, potentially enhancing the accuracy of automatic segmentation of PE and providing a powerful diagnostic tool for clinical physicians.

Image Segmentation Segmentation +1

Paper
Code

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

no code implementations • 27 Nov 2023 • Yifei Chen, Dapeng Chen, Ruijin Liu, Sai Zhou, Wenyuan Xue, Wei Peng

With the aligned entities, we feed their text embeddings to a transformer-based video adapter as the queries, which can help extract the semantics of the most important entities from a video to a vector.

Action Recognition Representation Learning +1

Paper
Add Code

Warfare:Breaking the Watermark Protection of AI-Generated Content

no code implementations • 27 Sep 2023 • Guanlin Li, Yifei Chen, Jie Zhang, Jiwei Li, Shangwei Guo, Tianwei Zhang

We propose Warfare, a unified methodology to achieve both attacks in a holistic way.

Generative Adversarial Network

Paper
Add Code

Unveiling the Hidden Realm: Self-supervised Skeleton-based Action Recognition in Occluded Environments

1 code implementation • 21 Sep 2023 • Yifei Chen, Kunyu Peng, Alina Roitberg, David Schneider, Jiaming Zhang, Junwei Zheng, Ruiping Liu, Yufan Chen, Kailun Yang, Rainer Stiefelhagen

To integrate action recognition methods into autonomous robotic systems, it is crucial to consider adverse situations involving target occlusions.

Action Recognition Imputation +1

Paper
Code

ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition

no code implementations • 15 Aug 2023 • Wenyuan Xue, Dapeng Chen, Baosheng Yu, Yifei Chen, Sai Zhou, Wei Peng

Visual chart recognition systems are gaining increasing attention due to the growing demand for automatically identifying table headers and values from chart images.

Keypoint Detection

Paper
Add Code

ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection

1 code implementation • 15 Aug 2023 • Jifeng Shen, Yifei Chen, Yue Liu, Xin Zuo, Heng Fan, Wankou Yang

Effective feature fusion of multispectral images plays a crucial role in multi-spectral object detection.

Ranked #2 on Object Detection on VEDAI

Multispectral Object Detection object-detection +1

Paper
Code

Efficient information recovery from Pauli noise via classical shadow

no code implementations • 6 May 2023 • Yifei Chen, Zhan Yu, Chenghong Zhu, Xin Wang

The rapid advancement of quantum computing has led to an extensive demand for effective techniques to extract classical information from quantum systems, particularly in fields like quantum machine learning and quantum chemistry.

Quantum Machine Learning

Paper
Add Code

Video Action Recognition with Attentive Semantic Units

no code implementations • ICCV 2023 • Yifei Chen, Dapeng Chen, Ruijin Liu, Hao Li, Wei Peng

Supervised by the semantics of action labels, recent works adapt the visual branch of VLMs to learn video representations.

Action Recognition Temporal Action Localization +1

Paper
Add Code

PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation

1 code implementation • 16 Sep 2022 • Haoyu Ma, Zhe Wang, Yifei Chen, Deying Kong, Liangjian Chen, Xingwei Liu, Xiangyi Yan, Hao Tang, Xiaohui Xie

In this paper, we propose the token-Pruned Pose Transformer (PPT) for 2D human pose estimation, which can locate a rough human mask and performs self-attention only within selected tokens.

Ranked #17 on 3D Human Pose Estimation on Human3.6M (using extra training data)

2D Human Pose Estimation 3D Human Pose Estimation

Paper
Code

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui, Xinbo Gao, Xiumei Wang, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan

Video super-resolution has recently become one of the most important mobile-related problems due to the rise of video communication and streaming services.

Video Super-Resolution

Paper
Add Code

Rotation-invariant Mixed Graphical Model Network for 2D Hand Pose Estimation

no code implementations • 5 Feb 2020 • Deying Kong, Haoyu Ma, Yifei Chen, Xiaohui Xie

In this paper, we propose a new architecture named Rotation-invariant Mixed Graphical Model Network (R-MGMN) to solve the problem of 2D hand pose estimation from a monocular RGB image.

Hand Pose Estimation

Paper
Add Code

Nonparametric Structure Regularization Machine for 2D Hand Pose Estimation

1 code implementation • 24 Jan 2020 • Yifei Chen, Haoyu Ma, Deying Kong, Xiangyi Yan, Jianbao Wu, Wei Fan, Xiaohui Xie

We propose a novel Nonparametric Structure Regularization Machine (NSRM) for 2D hand pose estimation, adopting a cascade multi-task architecture to learn hand structure and keypoint representations jointly.

Hand Pose Estimation

102

Paper
Code

Adaptive Graphical Model Network for 2D Handpose Estimation

1 code implementation • 18 Sep 2019 • Deying Kong, Yifei Chen, Haoyu Ma, Xiangyi Yan, Xiaohui Xie

In this paper, we propose a new architecture called Adaptive Graphical Model Network (AGMN) to tackle the task of 2D hand pose estimation from a monocular RGB image.

Hand Pose Estimation

Paper
Code

Inference in Kingman's Coalescent with Particle Markov Chain Monte Carlo Method

no code implementations • 3 May 2013 • Yifei Chen, Xiaohui Xie

We propose a new algorithm to do posterior sampling of Kingman's coalescent, based upon the Particle Markov Chain Monte Carlo methodology.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.