Search Results for author: Shan Jiang

Found 30 papers, 4 papers with code

Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking

no code implementations13 Mar 2025 Xinglong Sun, Haijiang Sun, Shan Jiang, Jiacheng Wang, Jiasong Wang

The trackers based on lightweight neural networks have achieved great success in the field of aerial remote sensing, most of which aggregate multi-stage deep features to lift the tracking quality.

Object Tracking

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

no code implementations22 Feb 2025 Qianqi Yan, Yue Fan, Hongquan Li, Shan Jiang, Yang Zhao, Xinze Guan, Ching-Chen Kuo, Xin Eric Wang

Existing Multimodal Large Language Models (MLLMs) are predominantly trained and tested on consistent visual-textual inputs, leaving open the question of whether they can handle inconsistencies in real-world, layout-rich content.

Multimodal Reasoning

Real-Time Neural-Enhancement for Online Cloud Gaming

no code implementations12 Jan 2025 Shan Jiang, Zhenhua Han, Haisheng Tan, Xinyang Jiang, Yifan Yang, Xiaoxi Zhang, Hongqiu Ni, Yuqing Yang, Xiang-Yang Li

To address this, we introduce River, a cloud gaming delivery framework designed based on the observation that video segment features in cloud gaming are typically repetitive and redundant.

Super-Resolution

Distributed satellite information networks: Architecture, enabling technologies, and trends

no code implementations17 Dec 2024 Qinyu Zhang, Liang Xu, Jianhao Huang, Tao Yang, Jian Jiao, Ye Wang, Yao Shi, Chiya Zhang, Xingjian Zhang, Ke Zhang, Yupeng Gong, Na Deng, Nan Zhao, Zhen Gao, Shujun Han, Xiaodong Xu, Li You, Dongming Wang, Shan Jiang, Dixian Zhao, Nan Zhang, Liujun Hu, Xiongwen He, Yonghui Li, Xiqi Gao, Xiaohu You

In this context, the distributed satellite information networks (DSIN), exemplified by the cohesive clustered satellites system, have emerged as an innovative architecture, bridging information gaps across diverse satellite systems, such as communication, navigation, and remote sensing, and establishing a unified, open information network paradigm to support resilient space information services.

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

1 code implementation27 Jun 2024 Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang

Based on the tree, our ToL agent not only comprehends the content of the indicated area but also articulates the layout and spatial relationships between elements.

SHARON: Secure and Efficient Cross-shard Transaction Processing via Shard Rotation

no code implementations INFOCOM 2024 Shan Jiang, Jiannong Cao, Cheung Leong Tung, Yuqin Wang, Shan Wang

Recently, sharding has become a popular direction to scale out blockchain systems by dividing the network into shards that process transactions in parallel.

Scheduling

Multi-attention Associate Prediction Network for Visual Tracking

no code implementations25 Mar 2024 Xinglong Sun, Haijiang Sun, Shan Jiang, Jiacheng Wang, Xilai Wei, Zhonghe Hu

They are capable of fully capturing the category-related semantics for classification and the local spatial contexts for regression, respectively.

Prediction regression +1

A Clustering Method with Graph Maximum Decoding Information

no code implementations18 Mar 2024 Xinrun Xu, Manying Lv, Zhanbiao Lian, Yurong Wu, Jin Yan, Shan Jiang, Zhiming Ding

Despite its efficacy, the current clustering method utilizing the graph-based model overlooks the uncertainty associated with random walk access between nodes and the embedded structural information in the data.

Clustering Computational Efficiency +1

Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA

no code implementations29 Jan 2024 Yue Fan, Jing Gu, Kaiwen Zhou, Qianqi Yan, Shan Jiang, Ching-Chen Kuo, Xinze Guan, Xin Eric Wang

Our evaluation shows that questions in the MultipanelVQA benchmark pose significant challenges to the state-of-the-art Multimodal Large Language Models (MLLMs) tested, even though humans can attain approximately 99% accuracy on these questions.

Benchmarking Image Comprehension +4

Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

no code implementations31 Oct 2023 Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis.

Diagnostic

Online Streaming Video Super-Resolution with Convolutional Look-Up Table

no code implementations1 Mar 2023 Guanghao Yin, Zefan Qu, Xinyang Jiang, Shan Jiang, Zhenhua Han, Ningxin Zheng, Xiaohong Liu, Huan Yang, Yuqing Yang, Dongsheng Li, Lili Qiu

To facilitate the research on this problem, a new benchmark dataset named LDV-WebRTC is constructed based on a real-world online streaming system.

Video Super-Resolution

A Unified Multi-view Multi-person Tracking Framework

no code implementations8 Feb 2023 Fan Yang, Shigeyuki Odashima, Sosuke Yamao, Hiroaki Fujimoto, Shoichi Masui, Shan Jiang

Although there is a significant development in 3D Multi-view Multi-person Tracking (3D MM-Tracking), current 3D MM-Tracking frameworks are designed separately for footprint and pose tracking.

3D Multi-Person Pose Estimation Multiple People Tracking +2

Multifunctional fiber-based optoacoustic emitter for non-genetic bidirectional neural communication

no code implementations9 Jan 2023 Nan Zheng, Ying Jiang, Shan Jiang, Jongwoon Kim, Yueming Li, Ji-Xin Cheng, Xiaoting Jia, Chen Yang

In vivo application of mFOE for successful simultaneous optoacoustic stimulation and electrical recording of brain activities was confirmed in mouse hippocampus in both acute and chronical applications up to 1 month.

Hippocampus

Hard to Track Objects with Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching Space

1 code implementation24 Nov 2022 Fan Yang, Shigeyuki Odashima, Shoichi Masui, Shan Jiang

To address this issue, our C-BIoU tracker adds buffers to expand the matching space of detections and tracks, which mitigates the effect of irregular motions in two aspects: one is to directly match identical but non-overlapping detections and tracks in adjacent frames, and the other is to compensate for the motion estimation bias in the matching space.

Motion Estimation Multi-Object Tracking +1

SoccerNet 2022 Challenges Results

7 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation

no code implementations25 Jun 2022 Zhixuan Liang, Jiannong Cao, Shan Jiang, Divya Saxena, Huafeng Xu

To tackle the issues, we propose a hierarchical reinforcement learning approach with high-level decision-making and low-level individual control for efficient policy search.

Autonomous Vehicles Decision Making +4

From Multi-agent to Multi-robot: A Scalable Training and Evaluation Platform for Multi-robot Reinforcement Learning

no code implementations20 Jun 2022 Zhiuxan Liang, Jiannong Cao, Shan Jiang, Divya Saxena, Jinlin Chen, Huafeng Xu

Precisely, SMART consists of two components: 1) a simulation environment that provides a variety of complex interaction scenarios for training and 2) a real-world multi-robot system for realistic performance evaluation.

Multi-agent Reinforcement Learning reinforcement-learning +2

Channel-wise Alignment for Adaptive Object Detection

no code implementations7 Sep 2020 Hang Yang, Shan Jiang, Xinge Zhu, Mingyang Huang, Zhiqiang Shen, Chunxiao Liu, Jianping Shi

Existing methods on this task usually draw attention on the high-level alignment based on the whole image or object of interest, which naturally, cannot fully utilize the fine-grained channel information.

Instance Segmentation Object +3

Effects of Digital Map on the RT-based Channel Model for UAV mmWave Communications

no code implementations28 Jul 2020 Qiuming Zhu, Shan Jiang, Cheng-Xiang Wang, Boyu Hua, Kai Mao, Xiaomin Chen, Weizhi Zhong

Based on the geometry and ray tracing (RT) theory, a millimeter wave (mmWave) channel model and parameter computation method for unmanned aerial vehicle (UAV) assisted air-to-ground (A2G) communications are proposed in this paper.

DEMN: Distilled-Exposition Enhanced Matching Network for Story Comprehension

no code implementations PACLIC 2018 Chunhua Liu, Haiou Zhang, Shan Jiang, Dong Yu

We divide a complete story into three narrative segments: an \textit{exposition}, a \textit{climax}, and an \textit{ending}.

Cloze Test

Multi-turn Inference Matching Network for Natural Language Inference

1 code implementation8 Jan 2019 Chunhua Liu, Shan Jiang, Hainan Yu, Dong Yu

The inference of each turn is performed on the current matching feature and the memory.

Natural Language Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.