Search Results for author: Changyin Sun

Found 21 papers, 7 papers with code

LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition

no code implementations9 Jul 2024 Teng Wang, Lingquan Meng, Lei Cheng, Changyin Sun

We start from a new perspective and attempt to build a discriminative global representations by fusing image data and text descriptions of the the visual scene.

Representation Learning Scene Understanding +2

Window-to-Window BEV Representation Learning for Limited FoV Cross-View Geo-localization

no code implementations9 Jul 2024 Lei Cheng, Teng Wang, Lingquan Meng, Changyin Sun

Subsequently, the cross-attention is performed between the matched BEV and ground windows to learn the robust BEV representation.

geo-localization Representation Learning

MCMS: Multi-Category Information and Multi-Scale Stripe Attention for Blind Motion Deblurring

no code implementations2 May 2024 Nianzu Qiao, Lamei Di, Changyin Sun

As a result, the model effectively improves motion deblurring by fusing the edge information of the high-frequency component and the structural information of the low-frequency component.

Deblurring

Single Image Super-Resolution Based on Global-Local Information Synergy

no code implementations2 May 2024 Nianzu Qiao, Lamei Di, Changyin Sun

To overcome the existing challenges, a novel super-resolution reconstruction algorithm is proposed in this paper.

Image Super-Resolution

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting

no code implementations17 Apr 2024 Guangran Cheng, Chuheng Zhang, Wenzhe Cai, Li Zhao, Changyin Sun, Jiang Bian

While large language models (LLMs) are successful in completing various language processing tasks, they easily fail to interact with the physical world by generating control sequences properly.

Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction

no code implementations9 Mar 2024 Yonghao Dong, Le Wang, Sanping Zhou, Gang Hua, Changyin Sun

Previous studies have tried to tackle this problem by leveraging a portion of the trajectory data from the target domain to adapt the model.

Domain Adaptation Pedestrian Trajectory Prediction +1

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

1 code implementation25 Dec 2023 Wenzhang Liu, Wenzhe Cai, Kun Jiang, Guangran Cheng, Yuanda Wang, Jiawei Wang, Jingyu Cao, Lele Xu, Chaoxu Mu, Changyin Sun

In this paper, we present XuanCe, a comprehensive and unified deep reinforcement learning (DRL) library designed to be compatible with PyTorch, TensorFlow, and MindSpore.

Deep Reinforcement Learning reinforcement-learning

Sparse Pedestrian Character Learning for Trajectory Prediction

no code implementations27 Nov 2023 Yonghao Dong, Le Wang, Sanpin Zhou, Gang Hua, Changyin Sun

Specifically, TSNet learns the negative-removed characters in the sparse character representation stream to improve the trajectory embedding obtained in the trajectory representation stream.

Autonomous Driving Pedestrian Trajectory Prediction +1

Robust Navigation with Cross-Modal Fusion and Knowledge Transfer

1 code implementation23 Sep 2023 Wenzhe Cai, Guangran Cheng, Lingyue Kong, Lu Dong, Changyin Sun

We consider the problem of improving the generalization of mobile robots and achieving sim-to-real transfer for navigation skills.

Transfer Learning

LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph Generation

1 code implementation2 Mar 2023 Xiaoguang Chang, Teng Wang, Shaowei Cai, Changyin Sun

Besides, representation-level unbiased strategies endow LANDMARK the advantage of compatibility with other methods.

Graph Generation Object +3

Transformer-Guided Convolutional Neural Network for Cross-View Geolocalization

no code implementations21 Apr 2022 Teng Wang, Shujuan Fan, Daikun Liu, Changyin Sun

Furthermore, we design a dual-branch Transformer head network to combine image features from multi-scale windows in order to improve details of the global feature representation.

Representation Learning

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation

1 code implementation17 Mar 2022 Xiaoguang Chang, Teng Wang, Changyin Sun, Wenzhe Cai

Scene graph generation is a sophisticated task because there is no specific recognition pattern (e. g., "looking at" and "near" have no conspicuous difference concerning vision, whereas "near" could occur between entities with different morphology).

Graph Generation Predicate Classification +2

Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots

no code implementations13 Dec 2021 Zichen He, Lu Dong, Chunwei Song, Changyin Sun

In this paper, a novel hybrid multi-robot motion planner that can be applied under non-communication and local observable conditions is presented.

A coarse-to-fine approach for dynamic-to-static image translation

1 code implementation Pattern Recognition 2021 Teng Wang, Lin Wu, Changyin Sun

Using the coarse predicted image, we explicitly infer a more accurate dynamic mask to identify both dynamic objects and their shadows, so that the task could be effectively converted to an image inpainting problem.

Image Inpainting Image-to-Image Translation +2

Learning Temporally Causal Latent Processes from General Temporal Data

2 code implementations11 Oct 2021 Weiran Yao, Yuewen Sun, Alex Ho, Changyin Sun, Kun Zhang

In this work, we consider both a nonparametric, nonstationary setting and a parametric setting for the latent processes and propose two provable conditions under which temporally causal latent processes can be identified from their nonlinear mixtures.

Causal Discovery Representation Learning +1

Learning Temporally Latent Causal Processes from General Temporal Data

2 code implementations ICLR 2022 Weiran Yao, Yuewen Sun, Alex Ho, Changyin Sun, Kun Zhang

Our goal is to find time-delayed latent causal variables and identify their relations from temporal measured variables.

Causal Discovery Disentanglement +1

Multi-modal Visual Place Recognition in Dynamics-Invariant Perception Space

no code implementations17 May 2021 Lin Wu, Teng Wang, Changyin Sun

In this letter, we for the first time explore the use of multi-modal fusion of semantic and visual modalities in dynamics-invariant space to improve place recognition in dynamic environments.

Segmentation Semantic Segmentation +1

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

no code implementations29 Apr 2016 Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun

Crowd counting is an important task in computer vision, which has many applications in video surveillance.

Attribute Crowd Counting

Cannot find the paper you are looking for? You can Submit a new open access paper.