Search Results for author: Kaidong Zhang

Found 7 papers, 6 papers with code

Surfer: Progressive Reasoning with World Models for Robotic Manipulation

no code implementations20 Jun 2023 Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang

To conduct a comprehensive and systematic evaluation of the robot manipulation model in terms of language understanding and physical execution, we also created a robotic manipulation benchmark with progressive reasoning tasks, called SeaWave.

Decision Making Natural Language Understanding +2

A Dataset for Deep Learning-based Bone Structure Analyses in Total Hip Arthroplasty

1 code implementation7 Jun 2023 Kaidong Zhang, Ziyang Gan, Dong Liu, Xifu Shang

For THA, it is of clinical significance to analyze the bone structure from the CT images, especially to observe the structure of the acetabulum and femoral head, before the surgical procedure.

Active Learning Anatomy +3

Towards Interactive Image Inpainting via Sketch Refinement

1 code implementation1 Jun 2023 Chang Liu, Shunxin Xu, Jialun Peng, Kaidong Zhang, Dong Liu

To address this problem, we propose a two-stage image inpainting method termed SketchRefiner.

Image Inpainting

Customized Segment Anything Model for Medical Image Segmentation

1 code implementation26 Apr 2023 Kaidong Zhang, Dong Liu

Different from the previous methods, SAMed is built upon the large-scale image segmentation model, Segment Anything Model (SAM), to explore the new research paradigm of customizing large-scale models for medical image segmentation.

Image Segmentation Medical Image Segmentation +3

Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting

2 code implementations24 Jan 2023 Kaidong Zhang, Jialun Peng, Jingjing Fu, Dong Liu

Transformers have been widely used for video processing owing to the multi-head self attention (MHSA) mechanism.

 Ranked #1 on Video Inpainting on DAVIS (SSIM (square) metric)

Optical Flow Estimation Video Inpainting

Flow-Guided Transformer for Video Inpainting

1 code implementation14 Aug 2022 Kaidong Zhang, Jingjing Fu, Dong Liu

Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention.

Retrieval Video Inpainting

Inertia-Guided Flow Completion and Style Fusion for Video Inpainting

1 code implementation CVPR 2022 Kaidong Zhang, Jingjing Fu, Dong Liu

We propose a flow completion network to align and aggregate flow features from the consecutive flow sequences based on the inertia prior.

Optical Flow Estimation valid +1

Cannot find the paper you are looking for? You can Submit a new open access paper.