Search Results for author: Weidong Chen

Found 23 papers, 15 papers with code

Dual-path Collaborative Generation Network for Emotional Video Captioning

1 code implementation6 Aug 2024 Cheng Ye, Weidong Chen, Jingyu Li, Lei Zhang, Zhendong Mao

Emotional Video Captioning is an emerging task that aims to describe factual content with the intrinsic emotions expressed in videos.

Caption Generation Video Captioning

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

1 code implementation16 Jul 2024 Yanbo Wang, Wentao Zhao, Chuan Cao, Tianchen Deng, Jingchuan Wang, Weidong Chen

Although LiDAR semantic segmentation advances rapidly, state-of-the-art methods often incorporate specifically designed inductive bias derived from benchmarks originating from mechanical spinning LiDAR.

LIDAR Semantic Segmentation Semantic Segmentation

Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting

1 code implementation19 Apr 2024 Fengyi Fu, Shancheng Fang, Weidong Chen, Zhendong Mao

Furthermore, a batch attention module is also proposed in this paper to alleviate the problem of missing sentimental samples, caused by the data imbalance, which is common in live videos as the popularity of videos varies.

Diversity

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

1 code implementation29 Mar 2024 Tianchen Deng, Yanbo Wang, Hongle Xie, Hesheng Wang, Jingchuan Wang, Danwei Wang, Weidong Chen

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

3D Reconstruction Denoising +3

Compact 3D Gaussian Splatting For Dense Visual SLAM

1 code implementation17 Mar 2024 Tianchen Deng, Yaohui Chen, Leyan Zhang, Jianfei Yang, Shenghai Yuan, Jiuming Liu, Danwei Wang, Hesheng Wang, Weidong Chen

Recent work has shown that 3D Gaussian-based SLAM enables high-quality reconstruction, accurate pose estimation, and real-time rendering of scenes.

Pose Estimation

PLGSLAM: Progressive Neural Scene Represenation with Local to Global Bundle Adjustment

no code implementations CVPR 2024 Tianchen Deng, Guole Shen, Tong Qin, Jianyu Wang, Wentao Zhao, Jingchuan Wang, Danwei Wang, Weidong Chen

To this end, we introduce PLGSLAM, a neural visual SLAM system capable of high-fidelity surface reconstruction and robust camera tracking in real-time.

Surface Reconstruction

ProSGNeRF: Progressive Dynamic Neural Scene Graph with Frequency Modulated Auto-Encoder in Urban Scenes

no code implementations14 Dec 2023 Tianchen Deng, Siyang Liu, Xuan Wang, Yejia Liu, Danwei Wang, Weidong Chen

Implicit neural representation has demonstrated promising results in view synthesis for large and complex scenes.

Improving Image Captioning via Predicting Structured Concepts

no code implementations14 Nov 2023 Ting Wang, Weidong Chen, Yuanhe Tian, Yan Song, Zhendong Mao

Having the difficulty of solving the semantic gap between images and texts for the image captioning task, conventional studies in this area paid some attention to treating semantic concepts as a bridge between the two modalities and improved captioning performance accordingly.

Image Captioning

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

1 code implementation20 Jul 2023 Weidong Chen, Xiaofen Xing, Peihao Chen, Xiangmin Xu

Although PTMs shed new light on artificial general intelligence, they are constructed with general tasks in mind, and thus, their efficacy for specific tasks can be further improved.

Speech Emotion Recognition

Emergent Bio-Functional Similarities in a Cortical-Spike-Train-Decoding Spiking Neural Network Facilitate Predictions of Neural Computation

no code implementations14 Mar 2023 Tengjun Liu, Yansong Chua, Yiwei Zhang, Yuxiao Ning, Pengfu Liu, Guihua Wan, Zijun Wan, Shaomin Zhang, Weidong Chen

Despite its better bio-plausibility, goal-driven spiking neural network (SNN) has not achieved applicable performance for classifying biological spike trains, and showed little bio-functional similarities compared to traditional artificial neural networks.

DWFormer: Dynamic Window transFormer for Speech Emotion Recognition

1 code implementation3 Mar 2023 Shuaiqi Chen, Xiaofen Xing, Weibin Zhang, Weidong Chen, Xiangmin Xu

Self-attention mechanism is applied within windows for capturing temporal important information locally in a fine-grained way.

Speech Emotion Recognition

SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing

1 code implementation27 Feb 2023 Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du

Paralinguistic speech processing is important in addressing many issues, such as sentiment and neurocognitive disorder analyses.

Alzheimer's Disease Detection Speech Emotion Recognition

Multi-Attention Network for Compressed Video Referring Object Segmentation

1 code implementation26 Jul 2022 Weidong Chen, Dexiang Hong, Yuankai Qi, Zhenjun Han, Shuhui Wang, Laiyun Qing, Qingming Huang, Guorong Li

To address this problem, we propose a multi-attention network which consists of dual-path dual-attention module and a query-based cross-modal Transformer module.

Object Referring Expression Segmentation +4

Unsupervised Learning of Monocular Depth and Ego-Motion Using Multiple Masks

1 code implementation1 Apr 2021 Guangming Wang, Hesheng Wang, Yiling Liu, Weidong Chen

A new unsupervised learning method of depth and ego-motion using multiple masks from monocular video is proposed in this paper.

Depth Estimation Motion Estimation

Sex Differences in Severity and Mortality Among Patients With COVID-19: Evidence from Pooled Literature Analysis and Insights from Integrated Bioinformatic Analysis

no code implementations30 Mar 2020 Xiyi Wei, Yu-Tian Xiao, Jian Wang, Rui Chen, Wei zhang, Yue Yang, Daojun Lv, Chao Qin, Di Gu, Bo Zhang, Weidong Chen, Jianquan Hou, Ninghong Song, Guohua Zeng, Shancheng Ren

Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences.

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

1 code implementation23 Sep 2019 Hanjiang Hu, Hesheng Wang, Zhe Liu, Chenguang Yang, Weidong Chen, Le Xie

To retrieve a target image from the database, the query image is first encoded using the encoder belonging to the query domain to obtain a domain-invariant feature vector.

Autonomous Driving Retrieval +2

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

1 code implementation7 Jan 2019 Baoyuan Wu, Weidong Chen, Yanbo Fan, Yong Zhang, Jinlong Hou, Jie Liu, Tong Zhang

In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model.

Image Classification object-detection +5

Tagging like Humans: Diverse and Distinct Image Annotation

no code implementations CVPR 2018 Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu

In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model.

Generative Adversarial Network TAG

Saliency Fusion in Eigenvector Space with Multi-Channel Pulse Coupled Neural Network

no code implementations1 Mar 2017 Nevrez Imamoglu, Zhixuan Wei, Huangjun Shi, Yuki Yoshida, Myagmarbayar Nergui, Jose Gonzalez, Dongyun Gu, Weidong Chen, Kenzo Nonami, Wenwei Yu

Saliency computation has become a popular research field for many applications due to the useful information provided by saliency maps.

Cannot find the paper you are looking for? You can Submit a new open access paper.