Search Results for author: Shaoyi Du

Found 29 papers, 13 papers with code

SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition

1 code implementation21 May 2025 Mengqi Lei, Yihong Wu, Siqi Li, Xinhu Zheng, Juan Wang, Yue Gao, Shaoyi Du

However, existing hypergraph neural networks typically rely on static and hard hyperedge assignments, leading to excessive and redundant hyperedges with hard binary vertex memberships that overlook the continuity of visual semantics.

Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation

no code implementations30 Mar 2025 Yifan Feng, Hao Hu, Xingliang Hou, Shiquan Liu, Shihui Ying, Shaoyi Du, Han Hu, Yue Gao

Large language models (LLMs) have transformed various sectors, including education, finance, and medicine, by enhancing content generation and decision-making processes.

RAG Retrieval +1

ERetinex: Event Camera Meets Retinex Theory for Low-Light Image Enhancement

1 code implementation4 Mar 2025 Xuejian Guo, Zhiqiang Tian, Yuehang Wang, Siqi Li, Yu Jiang, Shaoyi Du, Yue Gao

Additionally, we propose an effective fusion strategy that combines the high dynamic range data from event cameras with the color information of traditional images to enhance image quality.

Image Restoration Low-Light Image Enhancement

Hypergraph Foundation Model

no code implementations3 Mar 2025 Yifan Feng, Shiquan Liu, Xiangmin Han, Shaoyi Du, Zongze Wu, Han Hu, Yue Gao

Hypergraph neural networks (HGNNs) effectively model complex high-order relationships in domains like protein interactions and social networks by connecting multiple vertices through hyperedges, enhancing modeling capabilities, and reducing information loss.

Diversity model

Jointly Understand Your Command and Intention:Reciprocal Co-Evolution between Scene-Aware 3D Human Motion Synthesis and Analysis

no code implementations1 Mar 2025 Xuehao Gao, Yang Yang, Shaoyi Du, Guo-Jun Qi, Junwei Han

As two intimate reciprocal tasks, scene-aware human motion synthesis and analysis require a joint understanding between multiple modalities, including 3D body motions, 3D scenes, and textual descriptions.

Diversity Motion Generation +1

EigenActor: Variant Body-Object Interaction Generation Evolved from Invariant Action Basis Reasoning

no code implementations1 Mar 2025 Xuehao Gao, Yang Yang, Shaoyi Du, Yang Wu, Yebin Liu, Guo-Jun Qi

Specifically, the first canonical body action inference stage focuses on learning intra-class shareable body motion priors and mapping given text-based semantics to action-specific canonical 3D body motions.

Human-Object Interaction Detection Object

Beyond Graphs: Can Large Language Models Comprehend Hypergraphs?

1 code implementation14 Oct 2024 Yifan Feng, Chengwu Yang, Xingliang Hou, Shaoyi Du, Shihui Ying, Zongze Wu, Yue Gao

Existing benchmarks like NLGraph and GraphQA evaluate LLMs on graphs by focusing mainly on pairwise relationships, overlooking the high-order correlations found in real-world data.

Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation

1 code implementation9 Aug 2024 Yifan Feng, Jiangang Huang, Shaoyi Du, Shihui Ying, Jun-Hai Yong, Yipeng Li, Guiguang Ding, Rongrong Ji, Yue Gao

We introduce Hyper-YOLO, a new object detection method that integrates hypergraph computations to capture the complex high-order correlations among visual features.

object-detection Object Detection

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

2 code implementations14 Jul 2024 Runzhao Yao, Shaoyi Du, Wenting Cui, Canhui Tang, Chengwu Yang

To further improve the distinctiveness of descriptors, we propose a position-aware convolution, which can better learn spatial information of local structures.

Inductive Bias Point Cloud Registration +1

Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction

no code implementations29 May 2024 Xuehao Gao, Yang Yang, Yang Wu, Shaoyi Du, Guo-Jun Qi

Inferring 3D human motion is fundamental in many applications, including understanding human activity and analyzing one's intention.

Human motion prediction motion prediction +1

Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos

1 code implementation8 Apr 2024 Fengrui Tian, Yueqi Duan, Angtian Wang, Jianfei Guo, Shaoyi Du

As there is 2D-to-3D ambiguity problem in the viewing direction when extracting 3D flow features from 2D video frames, we consider the volume densities as opacity priors that describe the contributions of flow features to the semantics on the frames.

NeRF

Learning a Category-level Object Pose Estimator without Pose Annotations

no code implementations8 Apr 2024 Fengrui Tian, Yaoyao Liu, Adam Kortylewski, Yueqi Duan, Shaoyi Du, Alan Yuille, Angtian Wang

Instead of using manually annotated images, we leverage diffusion models (e. g., Zero-1-to-3) to generate a set of images under controlled pose differences and propose to learn our object pose estimator with those images.

Object Pose Estimation

Hypergraph-based Multi-View Action Recognition using Event Cameras

no code implementations28 Mar 2024 Yue Gao, Jiaxuan Lu, Siqi Li, Yipeng Li, Shaoyi Du

By treating segments as vertices and constructing hyperedges using rule-based and KNN-based strategies, a multi-view hypergraph neural network that captures relationships across viewpoint and temporal features is established.

Action Recognition

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

1 code implementation27 Mar 2024 Weidong Xie, Lun Luo, Nanfei Ye, Yi Ren, Shaoyi Du, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

Experimental results on the KITTI dataset show that our proposed methods achieve state-of-the-art performance while running in real time.

Cross-modal place recognition Depth Estimation

GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation

1 code implementation4 Jan 2024 Xuehao Gao, Yang Yang, Zhenyu Xie, Shaoyi Du, Zhongqian Sun, Yang Wu

The whole text-driven human motion synthesis problem is then divided into multiple abstraction levels and solved with a multi-stage generation framework with a cascaded latent diffusion model: an initial generator first generates the coarsest human motion guess from a given text description; then, a series of successive generators gradually enrich the motion details based on the textual description and the previous synthesized results.

Motion Generation Motion Synthesis

ColorPCR: Color Point Cloud Registration with Multi-Stage Geometric-Color Fusion

no code implementations CVPR 2024 Juncheng Mu, Lin Bie, Shaoyi Du, Yue Gao

In this way both geometric and color data can be used thus lead to robust performance even under extremely challenging scenarios such as low overlap between two point clouds.

Point Cloud Registration

3D Feature Tracking via Event Camera

1 code implementation CVPR 2024 Siqi Li, Zhikuan Zhou, Zhou Xue, Yipeng Li, Shaoyi Du, Yue Gao

To achieve this our method leverages a joint framework to predict the 2D feature motion offsets and the 3D feature spatial position simultaneously.

Motion Compensation Patch Matching +1

Learning Discriminative Features for Crowd Counting

no code implementations8 Nov 2023 Yuehai Chen, Qingzhong Wang, Jing Yang, Badong Chen, Haoyi Xiong, Shaoyi Du

Crowd counting models in highly congested areas confront two main challenges: weak localization ability and difficulty in differentiating between foreground and background, leading to inaccurate estimations.

Contrastive Learning Crowd Counting +2

Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map

no code implementations29 Jul 2023 Yuehai Chen, Jing Yang, Badong Chen, Hua Gang, Shaoyi Du

To improve the robustness to annotation displacement, we design an effective transport cost function based on GGD.

Object Counting regression

HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching

1 code implementation29 Mar 2023 Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du

Firstly, we propose to use salient points with prominent local features as nodes to increase patch repeatability, and introduce some uniformly distributed points to complete the point cloud, thus constituting hybrid points.

Patch Matching Point Cloud Registration

Decompose More and Aggregate Better: Two Closer Looks at Frequency Representation Learning for Human Motion Prediction

no code implementations CVPR 2023 Xuehao Gao, Shaoyi Du, Yang Wu, Yang Yang

Encouraged by the effectiveness of encoding temporal dynamics within the frequency domain, recent human motion prediction systems prefer to first convert the motion representation from the original pose space into the frequency space.

Human motion prediction motion prediction +2

MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos

1 code implementation ICCV 2023 Fengrui Tian, Shaoyi Du, Yueqi Duan

More specifically, we learn an implicit velocity field to estimate point trajectory from temporal features with Neural ODE, which is followed by a flow-based feature aggregation module to obtain spatial features along the point trajectory.

NeRF

Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation

no code implementations21 Jun 2022 Yuehai Chen, Jing Yang, Badong Chen, Shaoyi Du

Thus, CNN could locate and estimate crowds accurately in low-density regions, while it is hard to properly perceive the densities in high-density regions.

Crowd Counting

Region-Aware Network: Model Human's Top-Down Visual Perception Mechanism for Crowd Counting

no code implementations23 Jun 2021 Yuehai Chen, Jing Yang, Dong Zhang, Kun Zhang, Badong Chen, Shaoyi Du

More specifically, we scan the whole input images and its priority maps in the form of column vector to obtain a relevance matrix estimating their similarity.

Crowd Counting

Color Point Cloud Registration Based on Supervoxel Correspondence

no code implementations IEEE Access 2020 YANG YANG, WEILE CHEN, Muyi Wang, DEXING ZHONG, Shaoyi Du

Different from traditional feature-based methods, we design a hybrid feature representation with color moments of the point, which could be applied naturally for any color point cloud.

Point Cloud Registration

An Effective Approach for Point Clouds Registration Based on the Hard and Soft Assignments

no code implementations1 Jun 2017 Congcong Jin, Jihua Zhu, Yaochen Li, Shaoyi Du, Zhongyu Li, Huimin Lu

For the registration of partially overlapping point clouds, this paper proposes an effective approach based on both the hard and soft assignments.

Cannot find the paper you are looking for? You can Submit a new open access paper.