Search Results for author: Haiyang Wang

Found 22 papers, 15 papers with code

RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework

1 code implementation14 Apr 2025 Xiao Wang, Haiyang Wang, Shiao Wang, Qiang Chen, Jiandong Jin, Haoyu Song, Bo Jiang, Chenglong Li

In this paper, we revisit these issues and propose a novel multi-modal RGB-Event attribute recognition task by drawing inspiration from the advantages of event cameras in low-light, high-speed, and low-power consumption.

Attribute Pedestrian Attribute Recognition

DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation

no code implementations10 Mar 2025 Ming Wang, Fang Wang, Minghao Hu, Li He, Haiyang Wang, Jun Zhang, Tianwei Yan, Li Li, Zhunchen Luo, Wei Luo, Xiaoying Bai, Guotong Geng

Long-form article generation (LFAG) presents challenges such as maintaining logical consistency, comprehensive topic coverage, and narrative coherence across extended articles.

Form Retrieval +1

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

1 code implementation3 Mar 2025 Hao Tang, ChenWei Xie, Haiyang Wang, Xiaoyi Bao, Tingyu Weng, Pandeng Li, Yun Zheng, LiWei Wang

Generalist models have achieved remarkable success in both language and vision-language tasks, showcasing the potential of unified modeling.

Instance Segmentation Reasoning Segmentation +2

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

2 code implementations30 Oct 2024 Haiyang Wang, Yue Fan, Muhammad Ferjad Naeem, Yongqin Xian, Jan Eric Lenssen, LiWei Wang, Federico Tombari, Bernt Schiele

By treating model parameters as tokens, we replace all the linear projections in Transformers with our token-parameter attention layer, where input tokens act as queries and model parameters as keys and values.

model

SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks

1 code implementation10 Oct 2024 Haiyang Wang, Qian Zhu, Mowen She, Yabo Li, Haoyu Song, Minghe Xu, Xiao Wang

To address this issue, in this paper, we propose a Spiking Neural Network (SNN) based framework for energy-efficient attribute recognition.

Attribute Knowledge Distillation +1

Pedestrian Attribute Recognition: A New Benchmark Dataset and A Large Language Model Augmented Framework

2 code implementations19 Aug 2024 Jiandong Jin, Xiao Wang, Qian Zhu, Haiyang Wang, Chenglong Li

To address this issue, this paper proposes a new large-scale, cross-domain pedestrian attribute recognition dataset to fill the data gap, termed MSP60K.

Attribute Ensemble Learning +4

GiT: Towards Generalist Vision Transformer through Universal Language Interface

1 code implementation14 Mar 2024 Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, LiWei Wang

Due to its simple design, this paradigm holds promise for narrowing the architectural gap between vision and language.

Ranked #2 on Video Captioning on MSVD-CTN (using extra training data)

Language Modeling Language Modelling +1

GSINA: Improving Subgraph Extraction for Graph Invariant Learning via Graph Sinkhorn Attention

1 code implementation11 Feb 2024 Fangyu Ding, Haiyang Wang, Zhixuan Chu, Tianming Li, Zhaoping Hu, Junchi Yan

Many recent endeavors of GIL focus on extracting the invariant subgraph from the input graph for prediction as a regularization strategy to improve the generalization performance of graph learning.

Graph Attention Graph Learning

RBGNet: Ray-based Grouping for 3D Object Detection

1 code implementation CVPR 2022 Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, LiWei Wang

In order to learn better representations of object shape to enhance cluster features for predicting 3D boxes, we propose a ray-based feature grouping module, which aggregates the point-wise features on object surfaces using a group of determined rays uniformly emitted from cluster centers.

3D Object Detection Object +1

Full-attention based Neural Architecture Search using Context Auto-regression

no code implementations13 Nov 2021 Yuan Zhou, Haiyang Wang, Shuwei Huo, Boyu Wang

Thus, it is appropriate to consider using NAS methods to discover a better self-attention architecture automatically.

Fine-Grained Image Recognition Image Classification +5

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

no code implementations NeurIPS 2021 Jikai Jin, Bohang Zhang, Haiyang Wang, LiWei Wang

Distributionally robust optimization (DRO) is a widely-used approach to learn models that are robust against distribution shift.

Drug-Target Interaction Prediction with Graph Attention networks

1 code implementation10 Jul 2021 Haiyang Wang, Guangyu Zhou, SiQi Liu, Jyun-Yu Jiang, Wei Wang

For better learning and interpreting the DTI topological structure and the similarity, it is desirable to have methods specifically for predicting interactions from the graph structure.

Graph Attention Prediction

Collaborative Visual Navigation

1 code implementation2 Jul 2021 Haiyang Wang, Wenguan Wang, Xizhou Zhu, Jifeng Dai, LiWei Wang

As a fundamental problem for Artificial Intelligence, multi-agent system (MAS) is making rapid progress, mainly driven by multi-agent reinforcement learning (MARL) techniques.

Multi-agent Reinforcement Learning Navigate +1

Anomaly Detection of Time Series with Smoothness-Inducing Sequential Variational Auto-Encoder

no code implementations2 Feb 2021 Longyuan Li, Junchi Yan, Haiyang Wang, Yaohui Jin

Our model is based on Variational Auto-Encoder (VAE), and its backbone is fulfilled by a Recurrent Neural Network to capture latent temporal structures of time series for both generative model and inference model.

Anomaly Detection Density Estimation +2

Explicit Shape Encoding for Real-Time Instance Segmentation

1 code implementation ICCV 2019 Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu

In this paper, we propose a novel top-down instance segmentation framework based on explicit shape encoding, named \textbf{ESE-Seg}.

Object object-detection +4

Visual Rhythm Prediction with Feature-Aligning Network

no code implementations29 Jan 2019 Yutong Xie, Haiyang Wang, Yan Hao, Zihao Xu

In this paper, we propose a data-driven visual rhythm prediction method, which overcomes the previous works' deficiency that predictions are made primarily by human-crafted hard rules.

Optical Flow Estimation Prediction +1

Dense Adaptive Cascade Forest: A Self Adaptive Deep Ensemble for Classification Problems

no code implementations29 Apr 2018 Haiyang Wang, Yong Tang, Ziyang Jia, Fei Ye

Second, our model connects each layer to the subsequent ones in a feed-forward fashion, which enhances the capability of the model to resist performance degeneration.

Ensemble Learning General Classification

OMNIRank: Risk Quantification for P2P Platforms with Deep Learning

no code implementations27 Apr 2017 Honglun Zhang, Haiyang Wang, Xiaming Chen, Yongkun Wang, Yaohui Jin

P2P lending presents as an innovative and flexible alternative for conventional lending institutions like banks, where lenders and borrowers directly make transactions and benefit each other without complicated verifications.

Deep Learning Reading Comprehension +1

Cannot find the paper you are looking for? You can Submit a new open access paper.