Search Results for author: Song Wang

Found 125 papers, 48 papers with code

OccFiner: Offboard Occupancy Refinement with Hybrid Propagation

no code implementations13 Mar 2024 Hao Shi, Song Wang, Jiaming Zhang, Xiaoting Yin, Zhongdao Wang, Zhijian Zhao, Guangming Wang, Jianke Zhu, Kailun Yang, Kaiwei Wang

Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC), presents a significant challenge in computer vision.

3D Semantic Scene Completion

GraphRCG: Self-conditioned Graph Generation via Bootstrapped Representations

no code implementations2 Mar 2024 Song Wang, Zhen Tan, Xinyu Zhao, Tianlong Chen, Huan Liu, Jundong Li

In contrast, in this work, we propose a novel self-conditioned graph generation framework designed to explicitly model graph distributions and employ these distributions to guide the generation process.

Graph Generation

Large Language Models for Data Annotation: A Survey

1 code implementation21 Feb 2024 Zhen Tan, Alimohammad Beigi, Song Wang, Ruocheng Guo, Amrita Bhattacharjee, Bohan Jiang, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu

Furthermore, the paper includes an in-depth taxonomy of methodologies employing LLMs for data annotation, a comprehensive review of learning strategies for models incorporating LLM-generated annotations, and a detailed discussion on primary challenges and limitations associated with using LLMs for data annotation.

Bidirectional Autoregressive Diffusion Model for Dance Generation

no code implementations6 Feb 2024 Canyu Zhang, YouBao Tang, Ning Zhang, Ruei-Sung Lin, Mei Han, Jing Xiao, Song Wang

To make the generated dance motion smoother, a local information decoder is built for local motion enhancement.

Unveiling the Power of Self-supervision for Multi-view Multi-human Association and Tracking

no code implementations31 Jan 2024 Wei Feng, Feifan Wang, Ruize Han, Zekun Qian, Song Wang

Multi-view multi-human association and tracking (MvMHAT), is a new but important problem for multi-person scene video surveillance, aiming to track a group of people over time in each view, as well as to identify the same person across different views at the same time, which is different from previous MOT and multi-camera MOT tasks only considering the over-time human tracking.

Self-Learning Self-Supervised Learning

Advancing EEG/MEG Source Imaging with Geometric-Informed Basis Functions

no code implementations31 Jan 2024 Song Wang, Chen Wei, Kexin Lou, Dongfeng Gu, Quanying Liu

Here, we present a novel method which utilizes the Brain Geometric-informed Basis Functions (GBFs) as priors to enhance EEG/MEG source imaging.

EEG

I came, I saw, I certified: some perspectives on the safety assurance of cyber-physical systems

no code implementations30 Jan 2024 Mithila Sivakumar, Alvine B. Belle, Kimya Khakzad Shahandashti, Oluwafemi Odu, Hadi Hemmati, Segla Kpodjedo, Song Wang, Opeyemi O. Adesina

In such contexts, detecting assurance deficits, relying on patterns to improve the structure of assurance cases, improving existing assurance case notations, and (semi-)automating the generation of assurance cases are key to develop compelling assurance cases and foster consumer acceptance.

Autonomous Driving

A Survey on Query-based API Recommendation

no code implementations17 Dec 2023 Moshi Wei, Nima Shiri Harzevili, Alvine Boaye Belle, Junjie Wang, Lin Shi, Jinqiu Yang, Song Wang, Ming Zhen, Jiang

We also investigate the typical data extraction procedures and collection approaches employed by the existing approaches.

Interpreting Pretrained Language Models via Concept Bottlenecks

1 code implementation8 Nov 2023 Zhen Tan, Lu Cheng, Song Wang, Yuan Bo, Jundong Li, Huan Liu

Pretrained language models (PLMs) have made significant strides in various natural language processing tasks.

Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance

no code implementations2 Nov 2023 Song Wang, Zhen Tan, Ruocheng Guo, Jundong Li

Adopting a two-stage paradigm of pretraining followed by fine-tuning, Pretrained Language Models (PLMs) have achieved substantial advancements in the field of natural language processing.

Knowledge Editing for Large Language Models: A Survey

no code implementations24 Oct 2023 Song Wang, Yaochen Zhu, Haochen Liu, Zaiyi Zheng, Chen Chen, Jundong Li

Afterward, we provide an innovative taxonomy of KME techniques based on how the new knowledge is introduced into pre-trained LLMs, and investigate existing KME strategies while analyzing key insights, advantages, and limitations of methods from each category.

knowledge editing

Label-efficient Segmentation via Affinity Propagation

1 code implementation NeurIPS 2023 Wentong Li, Yuqian Yuan, Song Wang, Wenyu Liu, Dongqi Tang, Jian Liu, Jianke Zhu, Lei Zhang

In this work, we formulate the affinity modeling as an affinity propagation process, and propose a local and a global pairwise affinity terms to generate accurate soft pseudo labels.

Box-supervised Instance Segmentation Segmentation +2

SAIR: Learning Semantic-aware Implicit Representation

no code implementations13 Oct 2023 Canyu Zhang, Xiaoguang Li, Qing Guo, Song Wang

To this end, we propose a framework with two modules: (1) building a semantic implicit representation (SIR) for a corrupted image whose large regions miss.

Image Inpainting Image Reconstruction

Assessing Evaluation Metrics for Neural Test Oracle Generation

no code implementations11 Oct 2023 Jiho Shin, Hadi Hemmati, Moshi Wei, Song Wang

We apply two different correlation analyses between these two different sets of metrics.

Toward Intelligent Emergency Control for Large-scale Power Systems: Convergence of Learning, Physics, Computing and Control

no code implementations8 Oct 2023 Qiuhua Huang, Renke Huang, Tianzhixi Yin, Sohom Datta, Xueqing Sun, Jason Hou, Jie Tan, Wenhao Yu, YuAn Liu, Xinya Li, Bruce Palmer, Ang Li, Xinda Ke, Marianna Vaiman, Song Wang, Yousu Chen

Our developed methods and platform based on the convergence framework have been applied to a large (more than 3000 buses) Texas power system, and tested with 56000 scenarios.

SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features

no code implementations29 Sep 2023 Song Wang, Zhu Wang, Can Li, Xiaojuan Qi, Hayden Kwok-Hay So

In comparison to conventional RGB cameras, the superior temporal resolution of event cameras allows them to capture rich information between frames, making them prime candidates for object tracking.

Multi-Object Tracking Object

Fair Few-shot Learning with Auxiliary Sets

no code implementations28 Aug 2023 Song Wang, Jing Ma, Lu Cheng, Jundong Li

These auxiliary sets contain several labeled training samples that can enhance the model performance regarding fairness in meta-test tasks, thereby allowing for the transfer of learned useful fairness-oriented knowledge to meta-test tasks.

Fairness Few-Shot Learning

Domain Adaptation for Deep Unit Test Case Generation

no code implementations15 Aug 2023 Jiho Shin, Sepehr Hashtroudi, Hadi Hemmati, Song Wang

We compare our approach with (a) CodeT5 fine-tuned on the test generation without DA, (b) the A3Test tool, and (c) GPT-4, on 5 projects from the Defects4j dataset.

Domain Adaptation Language Modelling

Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport

1 code implementation ICCV 2023 Wentong Li, Yuqian Yuan, Song Wang, Jianke Zhu, Jianshu Li, Jian Liu, Lei Zhang

Weakly-supervised image segmentation has recently attracted increasing research attentions, aiming to avoid the expensive pixel-wise labeling.

Image Segmentation Panoptic Segmentation

CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification

no code implementations ICCV 2023 Rabab Abdelfattah, Qing Guo, Xiaoguang Li, XiaoFeng Wang, Song Wang

Using the aggregated similarity scores as the initial pseudo labels at the training stage, we propose an optimization framework to train the parameters of the classification network and refine pseudo labels for unobserved labels.

Classification Multi-Label Image Classification +2

SuperInpaint: Learning Detail-Enhanced Attentional Implicit Representation for Super-resolutional Image Inpainting

no code implementations26 Jul 2023 Canyu Zhang, Qing Guo, Xiaoguang Li, Renjie Wan, Hongkai Yu, Ivor Tsang, Song Wang

Given the coordinates of a pixel we want to reconstruct, we first collect its neighboring pixels in the input image and extract their detail-enhanced semantic embeddings, unmask-attentional semantic embeddings, importance values, and spatial distances to the desired pixel.

Image Inpainting Image Reconstruction +2

Contrastive Meta-Learning for Few-shot Node Classification

1 code implementation27 Jun 2023 Song Wang, Zhen Tan, Huan Liu, Jundong Li

First, we propose to enhance the intra-class generalizability by involving a contrastive two-step optimization in each episode to explicitly align node embeddings in the same classes.

Classification Graph Mining +2

An empirical study of using radiology reports and images to improve ICU mortality prediction

no code implementations20 Jun 2023 Mingquan Lin, Song Wang, Ying Ding, Lihui Zhao, Fei Wang, Yifan Peng

Background: The predictive Intensive Care Unit (ICU) scoring system plays an important role in ICU management because it predicts important outcomes, especially mortality.

ICU Mortality Management +1

Federated Few-shot Learning

1 code implementation17 Jun 2023 Song Wang, Xingbo Fu, Kaize Ding, Chen Chen, Huiyuan Chen, Jundong Li

In this way, the server can exploit the computational power of all clients and train the model on a larger set of data samples among all clients.

Federated Learning Few-Shot Learning

LMGQS: A Large-scale Dataset for Query-focused Summarization

no code implementations22 May 2023 Ruochen Xu, Song Wang, Yang Liu, Shuohang Wang, Yichong Xu, Dan Iter, Chenguang Zhu, Michael Zeng

We hypothesize that there is a hidden query for each summary sentence in a generic summarization annotation, and we utilize a large-scale pretrained language model to recover it.

Language Modelling Query-focused Summarization +1

Learning Restoration is Not Enough: Transfering Identical Mapping for Single-Image Shadow Removal

no code implementations18 May 2023 Xiaoguang Li, Qing Guo, Pingping Cai, Wei Feng, Ivor Tsang, Song Wang

State-of-the-art shadow removal methods train deep neural networks on collected shadow & shadow-free image pairs, which are desired to complete two distinct tasks via shared weights, i. e., data restoration for shadow regions and identical mapping for non-shadow regions.

Image Shadow Removal Shadow Removal

Summarization with Precise Length Control

no code implementations9 May 2023 Lesly Miculicich, Yujia Xie, Song Wang, Pengcheng He

Many applications of text generation such as summarization benefit from accurately controlling the text length.

Text Generation

LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation

1 code implementation CVPR 2023 Song Wang, Wentong Li, Wenyu Liu, Xiaolu Liu, Jianke Zhu

To mitigate the defects caused by lacking semantic cues in LiDAR data, we present an online Camera-to-LiDAR distillation scheme to facilitate the semantic learning from image to point cloud.

Autonomous Driving

Few-Shot 3D Point Cloud Semantic Segmentation via Stratified Class-Specific Attention Based Transformer Network

1 code implementation28 Mar 2023 Canyu Zhang, Zhenyao Wu, Xinyi Wu, Ziyu Zhao, Song Wang

While a few-shot learning method was proposed recently to address these two problems, it suffers from high computational complexity caused by graph construction and inability to learn fine-grained relationships among points due to the use of pooling operations.

Few-shot 3D Point Cloud Semantic Segmentation Few-Shot Learning +4

Parametric Surface Constrained Upsampler Network for Point Cloud

1 code implementation14 Mar 2023 Pingping Cai, Zhenyao Wu, Xinyi Wu, Song Wang

Designing a point cloud upsampler, which aims to generate a clean and dense point cloud given a sparse point representation, is a fundamental and challenging problem in computer vision.

Point Cloud Completion

Semi-supervised Large-scale Fiber Detection in Material Images with Synthetic Data

no code implementations10 Feb 2023 Lan Fu, Zhiyuan Liu, Jinlong Li, Jeff Simmons, Hongkai Yu, Song Wang

Accurate detection of large-scale, elliptical-shape fibers, including their parameters of center, orientation and major/minor axes, on the 2D cross-sectioned image slices is very important for characterizing the underlying cylinder 3D structures in microscopic material images.

Domain Adaptation

Leveraging Inpainting for Single-Image Shadow Removal

1 code implementation ICCV 2023 Xiaoguang Li, Qing Guo, Rabab Abdelfattah, Di Lin, Wei Feng, Ivor Tsang, Song Wang

In this work, we find that pretraining shadow removal networks on the image inpainting dataset can reduce the shadow remnants significantly: a naive encoder-decoder network gets competitive restoration quality w. r. t.

Image Inpainting Image Shadow Removal +1

MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization

1 code implementation1 Feb 2023 Yinghui Xing, Song Wang, Shizhou Zhang, Guoqiang Liang, Xiuwei Zhang, Yanning Zhang

Most of the available multispectral pedestrian detectors are based on non-end-to-end detectors, while in this paper, we propose MultiSpectral pedestrian DEtection TRansformer (MS-DETR), an end-to-end multispectral pedestrian detector, which extends DETR into the field of multi-modal detection.

Pedestrian Detection

Few-shot Node Classification with Extremely Weak Supervision

1 code implementation6 Jan 2023 Song Wang, Yushun Dong, Kaize Ding, Chen Chen, Jundong Li

Recent few-shot node classification methods typically learn from classes with abundant labeled nodes (i. e., meta-training classes) and then generalize to classes with limited labeled nodes (i. e., meta-test classes).

Classification Meta-Learning +1

From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration

no code implementations19 Dec 2022 Zekun Qian, Ruize Han, Wei Feng, Feifan Wang, Song Wang

We tackle a new problem of multi-view camera and subject registration in the bird's eye view (BEV) without pre-given camera calibration.

Camera Calibration Camera Localization

Transductive Linear Probing: A Novel Framework for Few-Shot Node Classification

1 code implementation11 Dec 2022 Zhen Tan, Song Wang, Kaize Ding, Jundong Li, Huan Liu

More recently, inspired by the development of graph self-supervised learning, transferring pretrained node embeddings for few-shot node classification could be a promising alternative to meta-learning but remains unexposed.

Classification Contrastive Learning +4

Cross-domain Few-shot Segmentation with Transductive Fine-tuning

no code implementations27 Nov 2022 Yuhang Lu, Xinyi Wu, Zhenyao Wu, Song Wang

Few-shot segmentation (FSS) expects models trained on base classes to work on novel classes with the help of a few support images.

Cross-Domain Few-Shot

Interpreting Unfairness in Graph Neural Networks via Training Node Attribution

1 code implementation25 Nov 2022 Yushun Dong, Song Wang, Jing Ma, Ninghao Liu, Jundong Li

In this paper, we study a novel problem of interpreting GNN unfairness through attributing it to the influence of training nodes.

A Benchmark of Video-Based Clothes-Changing Person Re-Identification

1 code implementation21 Nov 2022 Likai Wang, Xiangqun Zhang, Ruize Han, Jialin Yang, Xiaoyu Li, Wei Feng, Song Wang

In this paper, we focus on the relatively new yet practical problem of clothes-changing video-based person re-identification (CCVReID), which is less studied.

Clothes Changing Person Re-Identification Re-Ranking +1

From Indoor To Outdoor: Unsupervised Domain Adaptive Gait Recognition

no code implementations21 Nov 2022 Likai Wang, Ruize Han, Wei Feng, Song Wang

In this paper, we study a new problem of unsupervised domain adaptive gait recognition (UDA-GR), that learns a gait identifier with supervised labels from the indoor scenes (source domain), and is applied to the outdoor wild scenes (target domain).

Gait Recognition

Style-Guided Shadow Removal

1 code implementation ECCV 2022 Jin Wan, Hui Yin, Zhenyao Wu, Xinyi Wu, Yanting Liu, Song Wang

To address this problem, we propose a style-guided shadow removal network (SG-ShadowNet) for better image-style consistency after shadow removal.

Image Restoration Shadow Removal

Graph Few-shot Learning with Task-specific Structures

1 code implementation21 Oct 2022 Song Wang, Chen Chen, Jundong Li

Therefore, to adaptively learn node representations across meta-tasks, we propose a novel framework that learns a task-specific structure for each meta-task.

Classification Few-Shot Learning +2

G2NetPL: Generic Game-Theoretic Network for Partial-Label Image Classification

no code implementations20 Oct 2022 Rabab Abdelfattah, Xin Zhang, Mostafa M. Fouda, XiaoFeng Wang, Song Wang

To effectively address partial-label classification, this paper proposes an end-to-end Generic Game-theoretic Network (G2NetPL) for partial-label learning, which can be applied to most partial-label settings, including a very challenging, but annotation-efficient case where only a subset of the training images are labeled, each with only one positive label, while the rest of the training images remain unlabeled.

Multi-Label Classification Multi-Label Image Classification +2

View-Invariant Skeleton-based Action Recognition via Global-Local Contrastive Learning

no code implementations23 Sep 2022 Cunling Bian, Wei Feng, Fanbo Meng, Song Wang

Skeleton-based human action recognition has been drawing more interest recently due to its low sensitivity to appearance changes and the accessibility of more skeleton data.

Action Recognition Contrastive Learning +3

Automatic Comment Generation via Multi-Pass Deliberation

1 code implementation14 Sep 2022 Fangwen Mu, Xiao Chen, Lin Shi, Song Wang, Qing Wang

Then, we treat the comment of the retrieved code as the initial draft and input it with the code and keywords into DECOM to start the iterative deliberation process.

Comment Generation

PC-GANs: Progressive Compensation Generative Adversarial Networks for Pan-sharpening

no code implementations29 Jul 2022 Yinghui Xing, Shuyuan Yang, Song Wang, Yan Zhang, Yanning Zhang

Most of the available deep learning-based pan-sharpening methods sharpen the multispectral images through a one-step scheme, which strongly depends on the reconstruction ability of the network.

Generative Adversarial Network Pansharpening

CRFormer: A Cross-Region Transformer for Shadow Removal

no code implementations4 Jul 2022 Jin Wan, Hui Yin, Zhenyao Wu, Xinyi Wu, Zhihao Liu, Song Wang

Aiming to restore the original intensity of shadow regions in an image and make them compatible with the remaining non-shadow regions without a trace, shadow removal is a very challenging problem that benefits many downstream image/video-related tasks.

Shadow Removal

Joint Generator-Ranker Learning for Natural Language Generation

2 code implementations28 Jun 2022 Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen

Generate-then-rank is a widely used mechanism for text generation, where a generator produces multiple text candidates and a ranker chooses the best one among the text candidates.

Question Generation Question-Generation +2

On Structural Explanation of Bias in Graph Neural Networks

1 code implementation24 Jun 2022 Yushun Dong, Song Wang, Yu Wang, Tyler Derr, Jundong Li

The low transparency on how the structure of the input network influences the bias in GNN outcome largely limits the safe adoption of GNNs in various decision-critical scenarios.

Decision Making Fairness

Task-Adaptive Few-shot Node Classification

1 code implementation23 Jun 2022 Song Wang, Kaize Ding, Chuxu Zhang, Chen Chen, Jundong Li

Then we transfer such knowledge to the classes with limited labeled nodes via our proposed task-adaptive modules.

Classification Few-Shot Learning +2

FAITH: Few-Shot Graph Classification with Hierarchical Task Graphs

1 code implementation5 May 2022 Song Wang, Yushun Dong, Xiao Huang, Chen Chen, Jundong Li

Specifically, these works propose to accumulate meta-knowledge across diverse meta-training tasks, and then generalize such meta-knowledge to the target task with a disjoint label set.

Few-Shot Learning Graph Classification

Fairness in Graph Mining: A Survey

2 code implementations21 Apr 2022 Yushun Dong, Jing Ma, Song Wang, Chen Chen, Jundong Li

Recently, algorithmic fairness has been extensively studied in graph-based applications.

Fairness Graph Mining

Radiology Text Analysis System (RadText): Architecture and Evaluation

1 code implementation19 Mar 2022 Song Wang, Mingquan Lin, Ying Ding, George Shih, Zhiyong Lu, Yifan Peng

Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis.

De-identification named-entity-recognition +5

Panoramic Human Activity Recognition

1 code implementation8 Mar 2022 Ruize Han, Haomin Yan, Jiacheng Li, Songmiao Wang, Wei Feng, Song Wang

To obtain a more comprehensive activity understanding for a crowded scene, in this paper, we propose a new problem of panoramic human activity recognition (PAR), which aims to simultaneous achieve the individual action, social group activity, and global activity recognition.

Human Activity Recognition

Self-supervised Social Relation Representation for Human Group Detection

1 code implementation8 Mar 2022 Jiacheng Li, Ruize Han, Haomin Yan, Zekun Qian, Wei Feng, Song Wang

The core of human group detection is the human social relation representation and division. In this paper, we propose a new two-stage multi-head framework for human group detection.

Relation

Prior Knowledge Enhances Radiology Report Generation

no code implementations11 Jan 2022 Song Wang, Liyan Tang, Mingquan Lin, George Shih, Ying Ding, Yifan Peng

In this work, we propose to mine and represent the associations among medical findings in an informative knowledge graph and incorporate this prior knowledge with radiology report generation to help improve the quality of generated reports.

Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining

1 code implementation7 Jan 2022 Qing Guo, Jingyang Sun, Felix Juefei-Xu, Lei Ma, Di Lin, Wei Feng, Song Wang

First, we propose the uncertainty-aware cascaded predictive filtering (UC-PFilt) that can identify the difficulties of reconstructing clean pixels via predicted kernels and remove the residual rain traces effectively.

Data Augmentation Single Image Deraining +1

Connecting the Complementary-View Videos: Joint Camera Identification and Subject Association

1 code implementation CVPR 2022 Ruize Han, Yiyang Gan, Jiacheng Li, Feifan Wang, Wei Feng, Song Wang

In this paper, we develop a new approach that can simultaneously handle three tasks: i) localizing the side-view camera in the top view; ii) estimating the view direction of the side-view camera; iii) detecting and associating the same subjects on the ground across the complementary views.

Position

Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation

1 code implementation9 Dec 2021 Xinyi Wu, Zhenyao Wu, Yuhang Lu, Lili Ju, Song Wang

In this paper, we tackle the problem of one-shot unsupervised domain adaptation (OSUDA) for semantic segmentation where the segmentors only see one unlabeled target image during training.

One-shot Unsupervised Domain Adaptation Semantic Segmentation +2

Benchmarking Shadow Removal for Facial Landmark Detection and Beyond

no code implementations27 Nov 2021 Lan Fu, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang

The observation of this work motivates us to design a novel detection-aware shadow removal framework, which empowers shadow removal to achieve higher restoration quality and enhance the shadow robustness of deployed facial landmark detectors.

Benchmarking Blocking +2

ATLANTIS: A Benchmark for Semantic Segmentation of Waterbody Images

1 code implementation22 Nov 2021 Seyed Mohammad Hassan Erfani, Zhenyao Wu, Xinyi Wu, Song Wang, Erfan Goharian

We claim that ATLANTIS is the largest waterbody image dataset for semantic segmentation providing a wide range of water and water-related classes and it will benefit researchers of both computer vision and water resources engineering.

Segmentation Semantic Segmentation

JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting

1 code implementation9 Jul 2021 Qing Guo, Xiaoguang Li, Felix Juefei-Xu, Hongkai Yu, Yang Liu, Song Wang

In this paper, for the first time, we formulate image inpainting as a mix of two problems, predictive filtering and deep generation.

Image Inpainting

Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-Balanced Samplings

no code implementations CVPR 2021 Hao Guo, Song Wang

While re-balanced data sampling can improve the performance on tail classes, it may also hurt the performance on head classes in training due to label co-occurrence.

Long-tail Learning

Sparta: Spatially Attentive and Adversarially Robust Activation

no code implementations18 May 2021 Qing Guo, Felix Juefei-Xu, Changqing Zhou, Wei Feng, Yang Liu, Song Wang

In both cases, Sparta leads to CNNs with higher robustness than the vanilla ReLU, verifying the flexibility and versatility of the proposed method.

Let There be Light: Improved Traffic Surveillance via Detail Preserving Night-to-Day Transfer

no code implementations11 May 2021 Lan Fu, Hongkai Yu, Felix Juefei-Xu, Jinlong Li, Qing Guo, Song Wang

As one of the state-of-the-art perception approaches, detecting the interested objects in each frame of video surveillance is widely desired by ITS.

Object object-detection +2

DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

1 code implementation CVPR 2021 Xinyi Wu, Zhenyao Wu, Hao Guo, Lili Ju, Song Wang

We further design a re-weighting strategy to handle the inaccuracy caused by misalignment between day-night image pairs and wrong predictions of daytime images, as well as boost the prediction accuracy of small objects.

Autonomous Driving Domain Adaptation +2

From Shadow Generation to Shadow Removal

1 code implementation CVPR 2021 Zhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi, Song Wang

Shadow removal is a computer-vision task that aims to restore the image content in shadow regions.

Shadow Removal

Auto-Exposure Fusion for Single-Image Shadow Removal

2 code implementations CVPR 2021 Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang

We conduct extensive experiments on the ISTD, ISTD+, and SRD datasets to validate our method's effectiveness and show better performance in shadow regions and comparable performance in non-shadow regions over the state-of-the-art methods.

Image Shadow Removal Shadow Removal

Multi-Domain Multi-Task Rehearsal for Lifelong Learning

no code implementations14 Dec 2020 Fan Lyu, Shuai Wang, Wei Feng, Zihan Ye, Fuyuan Hu, Song Wang

Rehearsal, seeking to remind the model by storing old knowledge in lifelong learning, is one of the most effective ways to mitigate catastrophic forgetting, i. e., biased forgetting of previous knowledge when moving to new tasks.

Contour Transformer Network for One-shot Segmentation of Anatomical Structures

1 code implementation2 Dec 2020 Yuhang Lu, Kang Zheng, Weijian Li, Yirui Wang, Adam P. Harrison, ChiHung Lin, Song Wang, Jing Xiao, Le Lu, Chang-Fu Kuo, Shun Miao

In this work, we present Contour Transformer Network (CTN), a one-shot anatomy segmentation method with a naturally built-in human-in-the-loop mechanism.

Anatomy One-Shot Learning +2

TTPLA: An Aerial-Image Dataset for Detection and Segmentation of Transmission Towers and Power Lines

1 code implementation20 Oct 2020 Rabab Abdelfattah, XiaoFeng Wang, Song Wang

Accurate detection and segmentation of transmission towers~(TTs) and power lines~(PLs) from aerial images plays a key role in protecting power-grid security and low-altitude UAV safety.

Instance Segmentation object-detection +3

The Statistical Characteristics of Power-Spectrum Subband Energy Ratios under Additive Gaussian White Noise

no code implementations8 Jul 2020 Han Li, Yanzhu Hu, Song Wang, Zhen Meng

When Gaussian white noise was mixed with the known signal, the resulting PSER followed a doubly non-central beta distribution.

valid

Shadow Removal by a Lightness-Guided Network with Training on Unpaired Data

1 code implementation28 Jun 2020 Zhihao Liu, Hui Yin, Yang Mi, Mengyang Pu, Song Wang

In this paper, we present a new Lightness-Guided Shadow Removal Network (LG-ShadowNet) for shadow removal by training on unpaired data.

Shadow Removal

Modeling Cross-view Interaction Consistency for Paired Egocentric Interaction Recognition

no code implementations24 Mar 2020 Zhongguo Li, Fan Lyu, Wei Feng, Song Wang

Paired egocentric interaction recognition (PEIR) is the task to collaboratively recognize the interactions between two persons with the videos in their corresponding views.

Action Recognition

MUTATT: Visual-Textual Mutual Guidance for Referring Expression Comprehension

no code implementations18 Mar 2020 Shuai Wang, Fan Lyu, Wei Feng, Song Wang

In this paper, we argue that for REC the referring expression and the target region are semantically correlated and subject, location and relationship consistency exist between vision and language. On top of this, we propose a novel approach called MutAtt to construct mutual guidance between vision and language, which treat vision and language equally thus yield compact information matching.

Referring Expression Referring Expression Comprehension

Label-guided Learning for Text Classification

no code implementations25 Feb 2020 Xien Liu, Song Wang, Xiao Zhang, Xinxin You, Ji Wu, Dejing Dou

In this study, we propose a label-guided learning framework LguidedLearn for text representation and classification.

General Classification Representation Learning +2

Deep Poisoning: Towards Robust Image Data Sharing against Visual Disclosure

no code implementations14 Dec 2019 Hao Guo, Brian Dolhansky, Eric Hsin, Phong Dinh, Cristian Canton Ferrer, Song Wang

Due to respectively limited training data, different entities addressing the same vision task based on certain sensitive images may not train a robust deep network.

Face Recognition Image Classification

Transductive Zero-Shot Hashing for Multilabel Image Retrieval

1 code implementation17 Nov 2019 Qin Zou, Zheng Zhang, Ling Cao, Long Chen, Song Wang

Given semantic annotations such as class labels and pairwise similarities of the training data, hashing methods can learn and generate effective and compact binary codes.

Multi-Label Image Retrieval Quantization +1

An End-to-End Network for Co-Saliency Detection in One Single Image

no code implementations25 Oct 2019 Yuanhao Yue, Qin Zou, Hongkai Yu, Qian Wang, Zhongyuan Wang, Song Wang

Co-saliency detection within a single image is a common vision problem that has received little attention and has not yet been well addressed.

Clustering Co-Salient Object Detection +1

Language-independent Cross-lingual Contextual Representations

no code implementations25 Sep 2019 Xiao Zhang, Song Wang, Dejing Dou, Xien Liu, Thien Huu Nguyen, Ji Wu

Contextual representation models like BERT have achieved state-of-the-art performance on a diverse range of NLP tasks.

Transfer Learning Zero-Shot Cross-Lingual Transfer

Effects of Blur and Deblurring to Visual Object Tracking

no code implementations21 Aug 2019 Qing Guo, Wei Feng, Zhihao Chen, Ruijun Gao, Liang Wan, Song Wang

In this paper, we address these two problems by constructing a Blurred Video Tracking benchmark, which contains a variety of videos with different levels of motion blurs, as well as ground truth tracking results for evaluating trackers.

Deblurring Image Deblurring +1

Goal-Oriented End-to-End Conversational Models with Profile Features in a Real-World Setting

no code implementations NAACL 2019 Yichao Lu, Manisha Srivastava, Jared Kramer, Heba Elfardy, Andrea Kahn, Song Wang, Vikas Bhardwaj

To test our models, a customer service agent handles live contacts and at each turn we present the top four model responses and allow the agent to select (and optionally edit) one of the suggestions or to type their own.

Response Generation

Global and Local Sensitivity Guided Key Salient Object Re-augmentation for Video Saliency Detection

no code implementations19 Nov 2018 Ziqi Zhou, Zheng Wang, Huchuan Lu, Song Wang, Meijun Sun

In this paper, based on the fact that salient areas in videos are relatively small and concentrated, we propose a \textbf{key salient object re-augmentation method (KSORA) using top-down semantic knowledge and bottom-up feature guidance} to improve detection accuracy in video scenes.

Decision Making feature selection +2

Does Haze Removal Help CNN-based Image Classification?

no code implementations ECCV 2018 Yanting Pei, Yaping Huang, Qi Zou, Yuhang Lu, Song Wang

Typically, the goal of image dehazing is to produce clearer images from which human vision can better identify the object and structural details present in the images.

Classification General Classification +3

Effects of Image Degradations to CNN-based Image Classification

no code implementations12 Oct 2018 Yanting Pei, Yaping Huang, Qi Zou, Hao Zang, Xingyuan Zhang, Song Wang

In this paper, we empirically study this problem for four kinds of degraded images -- hazy images, underwater images, motion-blurred images and fish-eye images.

Classification General Classification +1

Interactive Binary Image Segmentation with Edge Preservation

no code implementations10 Sep 2018 Jianfeng Zhang, Liezhuo Zhang, Yuankai Teng, Xiao-Ping Zhang, Song Wang, Lili Ju

Binary image segmentation plays an important role in computer vision and has been widely used in many applications such as image and video editing, object extraction, and photo composition.

Image Segmentation Interactive Segmentation +4

Design Identification of Curve Patterns on Cultural Heritage Objects: Combining Template Matching and CNN-based Re-Ranking

no code implementations17 May 2018 Jun Zhou, Yuhang Lu, Kang Zheng, Karen Smith, Colin Wilder, Song Wang

The goal of this paper is to address the challenging problem of automatically identifying the underlying full design of curve patterns from such a sherd.

Re-Ranking Template Matching

Improved Deep Hashing with Soft Pairwise Similarity for Multi-label Image Retrieval

1 code implementation8 Mar 2018 Zheng Zhang, Qin Zou, Yuewei Lin, Long Chen, Song Wang

In this paper, a new deep hashing method is proposed for multi-label image retrieval by re-defining the pairwise similarity into an instance similarity, where the instance similarity is quantified into a percentage based on the normalized semantic labels.

Deep Hashing Multi-Label Image Retrieval

Object Detection with Mask-based Feature Encoding

no code implementations12 Feb 2018 Xiaochuan Fan, Hao Guo, Kang Zheng, Wei Feng, Song Wang

In particular, we design a Mask Weight Network (MWN) to learn a set of masks and then apply channel-wise masking operations to ROI feature map, followed by a global pooling and a cheap fully-connected layer.

Object object-detection +1

Curve-Structure Segmentation from Depth Maps: A CNN-based Approach and Its Application to Exploring Cultural Heritage Objects

no code implementations7 Nov 2017 Yuhang Lu, Jun Zhou, Jing Wang, Jun Chen, Karen Smith, Colin Wilder, Song Wang

Motivated by the important archaeological application of exploring cultural heritage objects, in this paper we study the challenging problem of automatically segmenting curve structures that are very weakly stamped or carved on an object surface in the form of a highly noisy depth map.

Image Segmentation Semantic Segmentation

Learning Dynamic Siamese Network for Visual Object Tracking

no code implementations ICCV 2017 Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang

How to effectively learn temporal variation of target appearance, to exclude the interference of cluttered background, while maintaining real-time response, is an essential problem of visual object tracking.

Object Visual Object Tracking

Deep Reinforcement Learning with Surrogate Agent-Environment Interface

no code implementations12 Sep 2017 Song Wang, Yu Jing

We introduce surrogate probability action and develop the probability surrogate action deterministic policy gradient (PSADPG) algorithm based on SAEI.

Continuous Control reinforcement-learning +1

On Study of the Reliable Fully Convolutional Networks with Tree Arranged Outputs (TAO-FCN) for Handwritten String Recognition

no code implementations10 Jul 2017 Song Wang, Jun Sun, Satoshi Naoi

The handwritten string recognition is still a challengeable task, though the powerful deep learning tools were introduced.

Robust Gait Recognition by Integrating Inertial and RGBD Sensors

no code implementations31 Oct 2016 Qin Zou, Lihao Ni, Qian Wang, Qingquan Li, Song Wang

We propose two new algorithms, namely EigenGait and TrajGait, to extract gait features from the inertial data and the RGBD (color and depth) data, respectively.

Gait Recognition Person Identification

Who Leads the Clothing Fashion: Style, Color, or Texture? A Computational Study

no code implementations26 Aug 2016 Qin Zou, Zheng Zhang, Qian Wang, Qingquan Li, Long Chen, Song Wang

Specifically, a classification-based model is proposed to quantify the influence of different visual stimuli, in which each visual stimulus's influence is quantified by its corresponding accuracy in fashion classification.

General Classification

Identifying Designs from Incomplete, Fragmented Cultural Heritage Objects by Curve-Pattern Matching

no code implementations5 Aug 2016 Jun Zhou, Haozhou Yu, Karen Smith, Colin Wilder, Hongkai Yu, Song Wang

The challenge to reconstruct and study complete designs is stymied because 1) most fragmentary cultural-heritage objects contain only a small portion of the underlying full design, 2) in the case of a stamping application, the same design may be applied multiple times with spatial overlap on one object, and 3) curve patterns detected on an object are usually incomplete and noisy.

Management Object

Groupwise Tracking of Crowded Similar-Appearance Targets From Low-Continuity Image Sequences

no code implementations CVPR 2016 Hongkai Yu, Youjie Zhou, Jeff Simmons, Craig P. Przybyla, Yuewei Lin, Xiaochuan Fan, Yang Mi, Song Wang

In particular, the within-group association is modeled by a nonrigid 2D Thin-Plate transform and a sequence of group shrinking, group growing and group merging operations are then developed to refine the composition of each group.

Cell Tracking

On Study of the Binarized Deep Neural Network for Image Classification

no code implementations24 Feb 2016 Song Wang, Dongchun Ren, Li Chen, Wei Fan, Jun Sun, Satoshi Naoi

Unlike those trials, in this paper, we focused on the basic propagation function of the artificial neural network and proposed the binarized deep neural network.

General Classification Image Classification

Unsupervised Cross-Domain Recognition by Identifying Compact Joint Subspaces

no code implementations5 Sep 2015 Yuewei Lin, Jing Chen, Yu Cao, Youjie Zhou, Lingfeng Zhang, Yuan Yan Tang, Song Wang

By adopting a natural and widely used assumption -- "the data samples from the same class should lay on a low-dimensional subspace, even if they come from different domains", the proposed method circumvents the limitation of the global domain shift, and solves the cross-domain recognition by finding the compact joint subspaces of source and target domain.

Domain Adaptation Object Recognition +2

Co-interest Person Detection from Multiple Wearable Camera Videos

no code implementations ICCV 2015 Yuewei Lin, Kareem Ezzeldeen, Youjie Zhou, Xiaochuan Fan, Hongkai Yu, Hui Qian, Song Wang

Wearable cameras, such as Google Glass and Go Pro, enable video data collection over larger areas and from different views.

Human Detection

LooseCut: Interactive Image Segmentation with Loosely Bounded Boxes

no code implementations11 Jul 2015 Hongkai Yu, Youjie Zhou, Hui Qian, Min Xian, Yuewei Lin, Dazhou Guo, Kang Zheng, Kareem Abdelfatah, Song Wang

In this paper, we develop a new LooseCut algorithm that can handle cases where the input bounding box only loosely covers the foreground object.

Image Segmentation Object +5

Combining Local Appearance and Holistic View: Dual-Source Deep Neural Networks for Human Pose Estimation

1 code implementation CVPR 2015 Xiaochuan Fan, Kang Zheng, Yuewei Lin, Song Wang

We propose a new learning-based method for estimating 2D human pose from a single image, using Dual-Source Deep Convolutional Neural Networks (DS-CNN).

Pose Estimation

Feature Sampling Strategies for Action Recognition

no code implementations28 Jan 2015 Youjie Zhou, Hongkai Yu, Song Wang

Although dense local spatial-temporal features with bag-of-features representation achieve state-of-the-art performance for action recognition, the huge feature number and feature size prevent current methods from scaling up to real size problems.

Action Recognition Temporal Action Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.