Search Results for author: Wei Song

Found 38 papers, 9 papers with code

Aligning Knowledge Graph with Visual Perception for Object-goal Navigation

1 code implementation29 Feb 2024 Nuo Xu, Wen Wang, Rong Yang, Mengjie Qin, Zheyuan Lin, Wei Song, Chunlong Zhang, Jason Gu, Chao Li

Object-goal navigation is a challenging task that requires guiding an agent to specific objects based on first-person visual observations.

Object

Target Recognition Algorithm for Monitoring Images in Electric Power Construction Process

no code implementations9 Feb 2024 Hao Song, Wei Lin, Wei Song, Man Wang

To enhance precision and comprehensiveness in identifying targets in electric power construction monitoring video, a novel target recognition algorithm utilizing infrared imaging is explored.

Transmission Line Detection Based on Improved Hough Transform

no code implementations5 Feb 2024 Wei Song, Pei Li, Man Wang

To address the challenges of low detection accuracy and high false positive rates of transmission lines in UAV (Unmanned Aerial Vehicle) images, we explore the linear features and spatial distribution.

Line Detection

M2ConceptBase: A Fine-grained Aligned Multi-modal Conceptual Knowledge Base

no code implementations16 Dec 2023 Zhiwei Zha, Jiaan Wang, Zhixu Li, Xiangru Zhu, Wei Song, Yanghua Xiao

To collect concept-image and concept-description alignments, we propose a context-aware multi-modal symbol grounding approach that considers context information in existing large-scale image-text pairs with respect to each concept.

Language Modelling Large Language Model +1

Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning

no code implementations7 Dec 2023 Yongqi Dong, Xingmin Lu, Ruohan Li, Wei Song, Bart van Arem, Haneen Farah

In conclusion, the proposed pipeline, with its incorporation of self-supervised pre-training using MiM and other advanced deep learning techniques, emerges as a robust solution for enhancing the accuracy and efficiency of lane rendering image anomaly detection in digital navigation systems.

Anomaly Detection

Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering

1 code implementation20 Sep 2023 Yike Wu, Nan Hu, Sheng Bi, Guilin Qi, Jie Ren, Anhuan Xie, Wei Song

To this end, we propose an answer-sensitive KG-to-Text approach that can transform KG knowledge into well-textualized statements most informative for KGQA.

Graph Question Answering Language Modelling +2

Fine-grained Text and Image Guided Point Cloud Completion with CLIP Model

no code implementations17 Aug 2023 Wei Song, Jun Zhou, Mingjie Wang, Hongchen Tan, Nannan Li, Xiuping Liu

In this work, we propose a novel multimodal fusion network for point cloud completion, which can simultaneously fuse visual and textual information to predict the semantic and geometric characteristics of incomplete shapes effectively.

Language Modelling Point Cloud Completion

Multivariate Time Series characterization and forecasting of VoIP traffic in real mobile networks

no code implementations13 Jul 2023 Mario Di Mauro, Giovanni Galatro, Fabio Postiglione, Wei Song, Antonio Liotta

Predicting the behavior of real-time traffic (e. g., VoIP) in mobility scenarios could help the operators to better plan their network infrastructures and to optimize the allocation of resources.

Time Series Time Series Analysis

Robust Calibrate Proxy Loss for Deep Metric Learning

no code implementations6 Apr 2023 Xinyue Li, Jian Wang, Wei Song, Yanling Du, Zhixiang Liu

The mainstream researche in deep metric learning can be divided into two genres: proxy-based and pair-based methods.

Metric Learning Retrieval

Fast Contextual Scene Graph Generation With Unbiased Context Augmentation

no code implementations CVPR 2023 Tianlei Jin, Fangtai Guo, Qiwei Meng, Shiqiang Zhu, Xiangming Xi, Wen Wang, Zonghao Mu, Wei Song

Therefore, at the context level, we can produce diverse context descriptions by using a context augmentation method based on the original dataset.

Graph Generation Scene Graph Generation

End-To-End Audiovisual Feature Fusion for Active Speaker Detection

no code implementations27 Jul 2022 Fiseha B. Tesema, Zheyuan Lin, Shiqiang Zhu, Wei Song, Jason Gu, Hong Wu

After fusion, one BiGRU layer is attached to model the joint temporal dynamics.

TGRMPT: A Head-Shoulder Aided Multi-Person Tracker and a New Large-Scale Dataset for Tour-Guide Robot

1 code implementation8 Jul 2022 Wen Wang, Shunda Hu, Shiqiang Zhu, Wei Song, Zheyuan Lin, Tianlei Jin, Zonghao Mu, Yuanhai Zhou

A service robot serving safely and politely needs to track the surrounding people robustly, especially for Tour-Guide Robot (TGR).

Multi-Object Tracking

Verb Metaphor Detection via Contextual Relation Learning

no code implementations ACL 2021 Wei Song, Shuhui Zhou, Ruiji Fu, Ting Liu, Lizhen Liu

Correct natural language understanding requires computers to distinguish the literal and metaphorical senses of a word.

Natural Language Understanding Relation +1

Gated Transformer Networks for Multivariate Time Series Classification

2 code implementations26 Mar 2021 Minghao Liu, Shengqi Ren, Siyuan Ma, Jiahui Jiao, Yizhou Chen, Zhiguang Wang, Wei Song

In this work, we explored a simple extension of the current Transformer Networks with gating, named Gated Transformer Networks (GTN) for the multivariate time series classification problem.

Classification General Classification +3

Gravitational perturbations from NHEK to Kerr

no code implementations16 Feb 2021 Alejandra Castro, Victor Godet, Joan Simón, Wei Song, Boyang Yu

Our aim is to characterise those perturbations that are responsible for the deviations away from extremality, and to contrast them with the linearized perturbations treated in the Newman-Penrose formalism.

High Energy Physics - Theory General Relativity and Quantum Cosmology

Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis

no code implementations6 Nov 2020 Guanghui Xu, Wei Song, Zhengchen Zhang, Chao Zhang, Xiaodong He, BoWen Zhou

Despite prosody is related to the linguistic information up to the discourse structure, most text-to-speech (TTS) systems only take into account that within each sentence, which makes it challenging when converting a paragraph of texts into natural and expressive speech.

Sentence Sentence Embeddings +1

MAB-Malware: A Reinforcement Learning Framework for Attacking Static Malware Classifiers

3 code implementations6 Mar 2020 Wei Song, Xuezixiang Li, Sadia Afroz, Deepali Garg, Dmitry Kuznetsov, Heng Yin

However, it is well-known that machine learning models are vulnerable to adversarial examples (AEs).

Cryptography and Security

Neural Simile Recognition with Cyclic Multitask Learning and Local Attention

1 code implementation19 Dec 2019 Jiali Zeng, Linfeng Song, Jinsong Su, Jun Xie, Wei Song, Jiebo Luo

Simile recognition is to detect simile sentences and to extract simile components, i. e., tenors and vehicles.

Sentence Sentence Classification

An Experimental-based Review of Image Enhancement and Image Restoration Methods for Underwater Imaging

1 code implementation7 Jul 2019 Yan Wang, Wei Song, Giancarlo Fortino, Lizhe Qi, Wenqiang Zhang, Antonio Liotta

Underwater images play a key role in ocean exploration, but often suffer from severe quality degradation due to light absorption and scattering in water medium.

Image Enhancement Image Restoration

Building a mixed-lingual neural TTS system with only monolingual data

no code implementations12 Apr 2019 Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu

When deploying a Chinese neural text-to-speech (TTS) synthesis system, one of the challenges is to synthesize Chinese utterances with English phrases or words embedded.

Exploiting Syntactic Structures for Humor Recognition

no code implementations COLING 2018 Lizhen Liu, Donghai Zhang, Wei Song

Humor recognition is an interesting and challenging task in natural language processing.

Discourse Mode Identification in Essays

no code implementations ACL 2017 Wei Song, Dong Wang, Ruiji Fu, Lizhen Liu, Ting Liu, Guoping Hu

Evaluation results show that discourse modes can be identified automatically with an average F1-score of 0. 7.

Anecdote Recognition and Recommendation

no code implementations COLING 2016 Wei Song, Ruiji Fu, Lizhen Liu, Hanshi Wang, Ting Liu

More importantly, we uncover the anecdote implication, which reveals the meaning and topic of an anecdote.

Empirical Studies on Symbolic Aggregation Approximation Under Statistical Perspectives for Knowledge Discovery in Time Series

no code implementations8 Jun 2015 Wei Song, Zhiguang Wang, Yangdong Ye, Ming Fan

Our work provides an analytical framework with several statistical tools to analyze, evaluate and further improve the symbolic dynamics for knowledge discovery in time series.

Time Series Time Series Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.