Search Results for author: Hongxiang Li

Found 8 papers, 2 papers with code

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

no code implementations • 3 Apr 2024 • Xiaoshuang Huang, Hongxiang Li, Meng Cao, Long Chen, Chenyu You, Dong An

Recent developments underscore the potential of textual information in enhancing learning models for a deeper understanding of medical visual semantics.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

1 code implementation • 18 Jan 2024 • Qingyun Wang, Zixuan Zhang, Hongxiang Li, Xuan Liu, Jiawei Han, Huimin Zhao, Heng Ji

Fine-grained few-shot entity extraction in the chemical domain faces two unique challenges.

Chemical Entity Recognition Few-shot NER +1

Paper
Code

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

no code implementations • 19 Nov 2023 • Xuxin Cheng, Bowen Cao, Qichen Ye, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Specifically, in fine-tuning, we apply mutual learning and train two SLU models on the manual transcripts and the ASR transcripts, respectively, aiming to iteratively share knowledge between these two models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

1 code implementation • ICCV 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou

Due to two annoying issues in video grounding: (1) the co-existence of some visual entities in both ground truth and other moments, \ie semantic overlapping; (2) only a few moments in the video are annotated, \ie sparse annotation dilemma, vanilla contrastive learning is unable to model the correlations between temporally distant moments and learned inconsistent video representations.

Contrastive Learning Video Grounding

Paper
Code

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

no code implementations • ICCV 2023 • Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Automatic radiology report generation has attracted enormous research interest due to its practical value in reducing the workload of radiologists.

Sentence

Paper
Add Code

Exploiting Auxiliary Caption for Video Grounding

no code implementations • 15 Jan 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou

Video grounding aims to locate a moment of interest matching the given query sentence from an untrimmed video.

Contrastive Learning Dense Video Captioning +2

Paper
Add Code

Macroblock Classification Method for Video Applications Involving Motions

no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

Paper
Add Code

A new network-based algorithm for human activity recognition in video

no code implementations • 21 Feb 2015 • Weiyao Lin, Yuanzhe Chen, Jianxin Wu, Hanli Wang, Bin Sheng, Hongxiang Li

Based on this network, we further model people in the scene as packages while human activities can be modeled as the process of package transmission in the network.

Activity Detection Activity Recognition In Videos +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.