Search Results for author: Kun Hu

Found 18 papers, 7 papers with code

SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation

2 code implementations22 Dec 2023 Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, ZongYuan Ge, Yong Xia, Jiebo Luo, Zhiyong Wang

Specifically, we achieve this by proposing (1) Collaborative Prompts that describe instrument structures via collaborating category-level and part-level texts; (2) Cross-Modal Prompt Encoder that encodes text prompts jointly with visual embeddings into discriminative part-level representations; and (3) Part-to-Whole Adaptive Fusion and Hierarchical Decoding that adaptively fuse the part-level representations into a whole for accurate instrument segmentation in surgical scenarios.

Segmentation Semantic Segmentation

Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance

no code implementations31 Aug 2023 Zexin Hu, Kun Hu, Clinton Mo, Lei Pan, Zhiyong Wang

Sketch-based terrain generation seeks to create realistic landscapes for virtual environments in various applications such as computer games, animation and virtual reality.

Denoising

Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening

no code implementations25 Aug 2023 Jiaming Shen, Kun Hu, Wei Bao, Chang Wen Chen, Zhiyong Wang

The 2D animation workflow is typically initiated with the creation of keyframes using sketch-based drawing.

Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms

1 code implementation18 Aug 2023 Penghui Wen, Kun Hu, Wenxi Yue, Sen Zhang, Wanlei Zhou, Zhiyong Wang

Robust audio anti-spoofing has been increasingly challenging due to the recent advancements on deepfake techniques.

Face Swapping

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

1 code implementation17 Aug 2023 Wenxi Yue, Jing Zhang, Kun Hu, Yong Xia, Jiebo Luo, Zhiyong Wang

However, we observe two problems with this naive pipeline: (1) the domain gap between natural objects and surgical instruments leads to inferior generalisation of SAM; and (2) SAM relies on precise point or box locations for accurate segmentation, requiring either extensive manual guidance or a well-performing specialist detector for prompt preparation, which leads to a complex multi-stage pipeline.

Image Segmentation Segmentation +1

Continuous Intermediate Token Learning with Implicit Motion Manifold for Keyframe Based Motion Interpolation

1 code implementation CVPR 2023 Clinton Ansun Mo, Kun Hu, Chengjiang Long, Zhiyong Wang

Deriving sophisticated 3D motions from sparse keyframes is a particularly challenging problem, due to continuity and exceptionally skeletal precision.

Motion Interpolation Motion Synthesis

Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase

no code implementations3 Mar 2023 Lintao Wang, Kun Hu, Lei Bai, Yu Ding, Wanli Ouyang, Zhiyong Wang

As past poses often contain useful auxiliary hints, in this paper, we propose a task-agnostic deep learning method, namely Multi-scale Control Signal-aware Transformer (MCS-T), with an attention based encoder-decoder architecture to discover the auxiliary information implicitly for synthesizing controllable motion without explicitly requiring auxiliary information such as phase.

Feature Engineering Motion Synthesis

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

no code implementations18 Jan 2023 Jianfeng Weng, Kun Hu, Tingting Yao, Jingya Wang, Zhiyong Wang

Thus, in this work, a federated unsupervised cluster-contrastive (FedUCC) learning method is proposed for Person ReID.

Federated Learning Person Re-Identification

ICD-Face: Intra-class Compactness Distillation for Face Recognition

no code implementations ICCV 2023 Zhipeng Yu, Jiaheng Liu, Haoyu Qin, Yichao Wu, Kun Hu, Jiayi Tian, Ding Liang

Knowledge distillation is an effective model compression method to improve the performance of a lightweight student model by transferring the knowledge of a well-performed teacher model, which has been widely adopted in many computer vision tasks, including face recognition (FR).

Face Recognition Knowledge Distillation +1

TLDW: Extreme Multimodal Summarisation of News Videos

no code implementations16 Oct 2022 Peggy Tang, Kun Hu, Lei Zhang, Jiebo Luo, Zhiyong Wang

Multimodal summarisation with multimodal output is drawing increasing attention due to the rapid growth of multimedia data.

Sentence

Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection

no code implementations22 Sep 2022 Kun Hu, Shaohui Mei, Wei Wang, Kaylena A. Ehgoetz Martens, Liang Wang, Simon J. G. Lewis, David D. Feng, Zhiyong Wang

The proposed scheme also sheds light on improving subject-level clinical studies from other scenarios as it can be integrated with many existing deep architectures.

M2-Net: Multi-stages Specular Highlight Detection and Removal in Multi-scenes

1 code implementation20 Jul 2022 Zhaoyangfan Huang, Kun Hu, Xingjun Wang

The framework consists of three main components, highlight feature extractor module, highlight coarse removal module, and highlight refine removal module.

Highlight Detection highlight removal

OTExtSum: Extractive Text Summarisation with Optimal Transport

1 code implementation Findings (NAACL) 2022 Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang

Optimal sentence extraction is conceptualised as obtaining an optimal summary that minimises the transportation cost to a given document regarding their semantic distributions.

Sentence

Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

no code implementations14 Nov 2021 Jichao Kan, Kun Hu, Markus Hagenbuchner, Ah Chung Tsoi, Mohammed Bennamounm, Zhiyong Wang

Therefore, in this paper, these unique characteristics of sign languages are formulated as hierarchical spatio-temporal graph representations, including high-level and fine-level graphs of which a vertex characterizes a specified body part and an edge represents their interactions.

Machine Translation NMT +2

A Framework in CRM Customer Lifecycle: Identify Downward Trend and Potential Issues Detection

no code implementations25 Feb 2018 Kun Hu, Zhe Li, Ying Liu, Luyin Cheng, Qi Yang, Yan Li

In the first prediction part, we focus on predicting the downward trend, which is an earlier stage of the customer lifecycle compared to churn.

Causal Inference Management +1

Cannot find the paper you are looking for? You can Submit a new open access paper.