Search Results for author: Junyan Lin

Found 8 papers, 4 papers with code

Dynamic Cross-Modal Feature Interaction Network for Hyperspectral and LiDAR Data Classification

1 code implementation10 Mar 2025 Junyan Lin, Feng Gap, Lin Qi, Junyu Dong, Qian Du, Xinbo Gao

To address these limitations, we propose a novel Dynamic Cross-Modal Feature Interaction Network (DCMNet), the first framework leveraging a dynamic routing mechanism for HSI and LiDAR classification.

Classification

Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

1 code implementation8 Mar 2025 Junyan Lin, Haoran Chen, Yue Fan, Yingqi Fan, Xin Jin, Hui Su, Jinlan Fu, Xiaoyu Shen

Multimodal Large Language Models (MLLMs) have made significant advancements in recent years, with visual features playing an increasingly critical role in enhancing model performance.

Language Modeling Language Modelling

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task

no code implementations24 Dec 2024 Jinming Liu, Yuntao Wei, Junyan Lin, Shengyang Zhao, Heming Sun, Zhibo Chen, Wenjun Zeng, Xin Jin

While learned image compression methods have achieved impressive results in either human visual perception or machine vision tasks, they are often specialized only for one domain.

Disentanglement Image Compression

To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models

1 code implementation9 Oct 2024 Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen

However, there is still considerable debate on constructing MLLM architectures, particularly regarding the selection of appropriate connectors for perception tasks of varying granularities.

MME

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

no code implementations16 Aug 2024 Jinming Liu, Yuntao Wei, Junyan Lin, Shengyang Zhao, Heming Sun, Zhibo Chen, Wenjun Zeng, Xin Jin

We present a new image compression paradigm to achieve ``intelligently coding for machine'' by cleverly leveraging the common sense of Large Multimodal Models (LMMs).

Common Sense Reasoning Image Classification +5

Sparse Focus Network for Multi-Source Remote Sensing Data Classification

no code implementations3 Jun 2024 Xuepeng Jin, Junyan Lin, Feng Gao, Lin Qi, Yang Zhou

Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors.

Classification

Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification

no code implementations3 Jun 2024 Junyan Lin, Xuepeng Jin, Feng Gao, Junyu Dong, Hui Yu

Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage.

Cannot find the paper you are looking for? You can Submit a new open access paper.