Search Results for author: Wenkai Zhang

Found 9 papers, 5 papers with code

Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos

no code implementations EMNLP 2020 Nayu Liu, Xian Sun, Hongfeng Yu, Wenkai Zhang, Guangluan Xu

Multimodal summarization for open-domain videos is an emerging task, aiming to generate a summary from multisource information (video, audio, transcript).

Learning to Evaluate Performance of Multi-modal Semantic Localization

1 code implementation14 Sep 2022 Zhiqiang Yuan, Wenkai Zhang, Chongyang Li, Zhaoying Pan, Yongqiang Mao, Jialiang Chen, Shouke Li, Hongqi Wang, Xian Sun

Finally, we analyze the SeLo performance of RS cross-modal retrieval models in detail, explore the impact of different variables on this task, and provide a complete benchmark for the SeLo task.

Cross-Modal Retrieval Referring Expression +2

Open-domain Dialogue Generation Grounded with Dynamic Multi-form Knowledge Fusion

no code implementations24 Apr 2022 Feifei Xu, Shanlin Zhou, Xinpeng Wang, Yunpu Ma, Wenkai Zhang, Zhisong Li

To merge these two forms of knowledge into the dialogue effectively, we design a dynamic virtual knowledge selector and a controller that help to enrich and expand knowledge space.

Dialogue Generation Informativeness +1

Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information

1 code implementation21 Apr 2022 Zhiqiang Yuan, Wenkai Zhang, Changyuan Tian, Xuee Rong, Zhengyuan Zhang, Hongqi Wang, Kun fu, Xian Sun

In this article, we first propose a novel RSCTIR framework based on global and local information (GaLR), and design a multi-level information dynamic fusion (MIDF) module to efficaciously integrate features of different levels.

Cross-Modal Retrieval Image Retrieval +1

Disentangling and Vectorization: A 3D Visual Perception Approach for Autonomous Driving Based on Surround-View Fisheye Cameras

no code implementations19 Jul 2021 Zizhang Wu, Wenkai Zhang, Jizheng Wang, Man Wang, Yuanzhu Gan, Xinchao Gou, Muqing Fang, Jing Song

The 3D visual perception for vehicles with the surround-view fisheye camera system is a critical and challenging task for low-cost urban autonomous driving.

Autonomous Driving Descriptive +2

Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model

1 code implementation17 Jun 2021 Wenkai Zhang, Hongyu Lin, Xianpei Han, Le Sun, Huidan Liu, Zhicheng Wei, Nicholas Jing Yuan

Specifically, during neural network training, we naturally model the noise samples in each batch following a hypergeometric distribution parameterized by the noise-rate.

Denoising named-entity-recognition +2

DeepWORD: A GCN-based Approach for Owner-Member Relationship Detection in Autonomous Driving

no code implementations30 Mar 2021 Zizhang Wu, Man Wang, Jason Wang, Wenkai Zhang, Muqing Fang, Tianhao Xu

It's worth noting that the owner-member relationship between wheels and vehicles has an significant contribution to the 3D perception of vehicles, especially in the embedded environment.

Autonomous Driving Graph Attention +1

Cannot find the paper you are looking for? You can Submit a new open access paper.