Search Results for author: Guang Chen

Found 36 papers, 17 papers with code

Text with Knowledge Graph Augmented Transformer for Video Captioning

no code implementations22 Mar 2023 Xin Gu, Guang Chen, YuFei Wang, Libo Zhang, Tiejian Luo, Longyin Wen

Meanwhile, the internal stream is designed to exploit the multi-modality information in videos (e. g., the appearance of video frames, speech transcripts, and video captions) to ensure the quality of caption results.

Video Captioning

TMA: Temporal Motion Aggregation for Event-based Optical Flow

no code implementations21 Mar 2023 Haotian Liu, Guang Chen, Sanqing Qu, Yanping Zhang, Zhijun Li, Alois Knoll, Changjun Jiang

Event cameras have the ability to record continuous and detailed trajectories of objects with high temporal resolution, thereby providing intuitive motion cues for optical flow estimation.

Event-based Optical Flow Optical Flow Estimation

Upcycling Models under Domain and Category Shift

1 code implementation13 Mar 2023 Sanqing Qu, Tianpei Zou, Florian Roehrbein, Cewu Lu, Guang Chen, DaCheng Tao, Changjun Jiang

We examine the superiority of our GLC on multiple benchmarks with different category shift scenarios, including partial-set, open-set, and open-partial-set DA.

Source-Free Domain Adaptation Universal Domain Adaptation +1

Modality-Agnostic Debiasing for Single Domain Generalization

no code implementations13 Mar 2023 Sanqing Qu, Yingwei Pan, Guang Chen, Ting Yao, Changjun Jiang, Tao Mei

We validate the superiority of our MAD in a variety of single-DG scenarios with different modalities, including recognition on 1D texts, 2D images, 3D point clouds, and semantic segmentation on 2D images.

Data Augmentation Domain Generalization +1

SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving

1 code implementation25 Feb 2023 Jiawei Hou, Qi Chen, Yurong Cheng, Guang Chen, xiangyang xue, Taiping Zeng, Jian Pu

However, there is a lack of underground parking scenario datasets with multiple sensors and well-labeled images that support both SLAM tasks and perception tasks, such as semantic segmentation and parking slot detection.

3D Reconstruction Autonomous Driving +4

Dual-Stream Transformer for Generic Event Boundary Captioning

1 code implementation7 Jul 2022 Xin Gu, Hanhua Ye, Guang Chen, YuFei Wang, Libo Zhang, Longyin Wen

This paper describes our champion solution for the CVPR2022 Generic Event Boundary Captioning (GEBC) competition.

Boundary Captioning

A Review of Safe Reinforcement Learning: Methods, Theory and Applications

1 code implementation20 May 2022 Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois Knoll

To establish a good foundation for future research in this thread, in this paper, we provide a review for safe RL from the perspectives of methods, theory and applications.

Autonomous Driving Decision Making +3

BMD: A General Class-balanced Multicentric Dynamic Prototype Strategy for Source-free Domain Adaptation

1 code implementation6 Apr 2022 Sanqing Qu, Guang Chen, Jing Zhang, Zhijun Li, wei he, DaCheng Tao

Source-free Domain Adaptation (SFDA) aims to adapt a pre-trained source model to the unlabeled target domain without accessing the well-labeled source data, which is a much more practical setting due to the data privacy, security, and transmission issues.

Pseudo Label Source-Free Domain Adaptation

Unsupervised Domain Adaptation for Nighttime Aerial Tracking

2 code implementations CVPR 2022 Junjie Ye, Changhong Fu, Guangze Zheng, Danda Pani Paudel, Guang Chen

Previous advances in object tracking mostly reported on favorable illumination circumstances while neglecting performance at nighttime, which significantly impeded the development of related aerial robot applications.

Object Discovery Object Tracking +1

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

1 code implementation ICCV 2021 Fan Lu, Guang Chen, Yinlong Liu, Lijun Zhang, Sanqing Qu, Shu Liu, Rongqi Gu

Extensive experiments are conducted on two large-scale outdoor LiDAR point cloud datasets to demonstrate the high accuracy and efficiency of the proposed HRegNet.

Point Cloud Registration

DMInet: An Accurate and Highly Flexible Deep Learning Framework for Drug Membrane Interaction with Membrane Selectivity

1 code implementation27 May 2021 Guang Chen

Inheriting from coarse-grained Martini representation of organic molecules and combined with deep learning, DMInet has the potential for more accelerated high throughput screening in drug discovery across a much larger chemical space than that can be explored by physics-based simulations alone.

Drug Discovery

Data Augmentation for Object Detection via Differentiable Neural Rendering

1 code implementation4 Mar 2021 Guanghan Ning, Guang Chen, Chaowei Tan, Si Luo, Liefeng Bo, Heng Huang

We propose a new offline data augmentation method for object detection, which semantically interpolates the training data with novel views.

Data Augmentation Neural Rendering +3

NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting

1 code implementation10 Feb 2021 Kai Chen, Guang Chen, Dan Xu, Lijun Zhang, Yuyao Huang, Alois Knoll

Although Transformer has made breakthrough success in widespread domains especially in Natural Language Processing (NLP), applying it to time series forecasting is still a great challenge.

Time Series Forecasting

Lightweight Convolutional Neural Network with Gaussian-based Grasping Representation for Robotic Grasping Detection

no code implementations25 Jan 2021 Hu Cao, Guang Chen, Zhijun Li, Jianjie Lin, Alois Knoll

Extensive experiments on two public grasping datasets, Cornell and Jacquard demonstrate the state-of-the-art performance of our method in balancing accuracy and inference speed.

object-detection Robotic Grasping

PointINet: Point Cloud Frame Interpolation Network

1 code implementation18 Dec 2020 Fan Lu, Guang Chen, Sanqing Qu, Zhijun Li, Yinlong Liu, Alois Knoll

Generally, the frame rates of mechanical LiDAR sensors are 10 to 20 Hz, which is much lower than other commonly used sensors like cameras.

MoNet: Motion-based Point Cloud Prediction Network

no code implementations21 Nov 2020 Fan Lu, Guang Chen, Yinlong Liu, Zhijun Li, Sanqing Qu, Tianpei Zou

3D point clouds accurately model 3D information of surrounding environment and are crucial for intelligent vehicles to perceive the scene.

Autonomous Driving

LAP-Net: Adaptive Features Sampling via Learning Action Progression for Online Action Detection

no code implementations16 Nov 2020 Sanqing Qu, Guang Chen, Dan Xu, Jinhu Dong, Fan Lu, Alois Knoll

At each time step, this sampling strategy first estimates current action progression and then decide what temporal ranges should be used to aggregate the optimal supplementary features.

Online Action Detection

RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor

1 code implementation NeurIPS 2020 Fan Lu, Guang Chen, Yinlong Liu, Zhongnan Qu, Alois Knoll

To tackle the information loss of random sampling, we exploit a novel random dilation cluster strategy to enlarge the receptive field of each sampled point and an attention mechanism to aggregate the positions and features of neighbor points.

Point Cloud Registration Saliency Prediction

Efficient Pig Counting in Crowds with Keypoints Tracking and Spatial-aware Temporal Response Filtering

no code implementations27 May 2020 Guang Chen, Shiwen Shen, Longyin Wen, Si Luo, Liefeng Bo

Existing methods only focused on pig counting using single image, and its accuracy is challenged by several factors, including pig movements, occlusion and overlapping.

Edge-computing

Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection

1 code implementation ECCV 2020 Yuliang Guo, Guang Chen, Peitao Zhao, Weide Zhang, Jinghao Miao, Jingao Wang, Tae Eun Choe

The method, inspired by the latest state-of-the-art 3D-LaneNet, is a unified framework solving image encoding, spatial transform of features and 3D lane prediction in a single network.

3D Lane Detection Image Segmentation +1

Indirect and Direct Training of Spiking Neural Networks for End-to-End Control of a Lane-Keeping Vehicle

no code implementations10 Mar 2020 Zhenshan Bing, Claus Meschede, Guang Chen, Alois Knoll, Kai Huang

Building spiking neural networks (SNNs) based on biological synaptic plasticities holds a promising potential for accomplishing fast and energy-efficient computing, which is beneficial to mobile robotic applications.

Q-Learning

OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement

no code implementations8 Mar 2020 Fangyi Zhu, Jenq-Neng Hwang, Zhanyu Ma, Guang Chen, Jun Guo

Thereafter, we construct a new dataset, providing consistent object-sentence pairs, to facilitate effective cross-modal learning.

Video Captioning

Globally optimal vertical direction estimation in Atlanta World

1 code implementation29 Apr 2019 Yinlong Liu, Alois Knoll, Guang Chen

Accordingly, we propose a vertical direction estimation method by considering the relationship between the vertical frame and horizontal frames.

A Novel Method for the Absolute Pose Problem with Pairwise Constraints

no code implementations25 Mar 2019 Yinlong Liu, Xuechen Li, Manning Wang, Guang Chen, Zhijian Song, Alois Knoll

In this paper, we consider pairwise constraints and propose a globally optimal algorithm for solving the absolute pose estimation problem.

Pose Estimation Translation

Salience Biased Loss for Object Detection in Aerial Images

no code implementations18 Oct 2018 Peng Sun, Guang Chen, Guerdan Luke, Yi Shang

Experimental results show our proposed loss function with the RetinaNet architecture outperformed other state-of-art object detection models by at least 4. 31 mAP, and RetinaNet by 2. 26 mAP with the same inference speed of RetinaNet.

object-detection Object Detection In Aerial Images

Deep Anticipation: Light Weight Intelligent Mobile Sensing in IoT by Recurrent Architecture

no code implementations6 Dec 2017 Guang Chen, Shu Liu, Kejia Ren, Zhongnan Qu, Changhong Fu, Gereon Hinz, Alois Knoll

However, the mobile sensing perception brings new challenges for how to efficiently analyze and intelligently interpret the deluge of IoT data in mission- critical services.

Hierarchical Latent Semantic Mapping for Automated Topic Generation

no code implementations11 Nov 2015 Guorui Zhou, Guang Chen

Inspired by these algorithms, in this paper, we propose a novel method named Hierarchical Latent Semantic Mapping (HLSM), which automatically generates topics from corpus.

Association Community Detection

Large-Scale Visual Font Recognition

no code implementations CVPR 2014 Guang Chen, Jianchao Yang, Hailin Jin, Jonathan Brandt, Eli Shechtman, Aseem Agarwala, Tony X. Han

This paper addresses the large-scale visual font recognition (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content.

Font Recognition Image Categorization +1

Detection Evolution with Multi-order Contextual Co-occurrence

no code implementations CVPR 2013 Guang Chen, Yuanyuan Ding, Jing Xiao, Tony X. Han

The so-called (1 st -order) context feature is computed as a set of randomized binary comparisons on the response map of the baseline object detector.

object-detection Object Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.