Search Results for author: Zheng Wang

Found 221 papers, 91 papers with code

NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval

1 code implementation13 Mar 2025 Zengrong Lin, Zheng Wang, Tianwen Qian, Pan Mu, Sixian Chan, Cong Bai

Cross-modal retrieval aims to bridge the semantic gap between different modalities, such as visual and textual data, enabling accurate retrieval across them.

Cross-Modal Retrieval Retrieval

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

no code implementations13 Mar 2025 Mari Ashiga, Wei Jie, Fan Wu, Vardan Voskanyan, Fateme Dinmohammadi, Paul Brookes, Jingzhi Gong, Zheng Wang

This information offers valuable insights for selecting models for various real-world tasks involving text and code generation, and potentially applying methods to multimodal LLMs.

SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements

no code implementations10 Mar 2025 Haiyang Xie, Xi Shen, Shihua Huang, Zheng Wang

Most visual models are designed for sRGB images, yet RAW data offers significant advantages for object detection by preserving sensor information before ISP processing.

Object object-detection +1

A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery

no code implementations6 Mar 2025 Yiheng Zhu, Mingyang Li, Junlong Liu, Kun fu, JianSheng Wu, Qiuyi Li, Mingze Yin, Jieping Ye, Jian Wu, Zheng Wang

To fill this gap, we propose a general-purpose foundation model named BIT (an abbreviation for Biomolecular Interaction Transformer), which is capable of encoding a range of biochemical entities, including small molecules, proteins, and protein-ligand complexes, as well as various data formats, encompassing both 2D and 3D structures.

Denoising Drug Discovery +2

Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios

no code implementations27 Feb 2025 Chao Wang, Luning Zhang, Zheng Wang, Yang Zhou

Notably, our approach improves the performance of models on combinatorial reasoning, with a 22. 17% boost on CVQA and 9. 40% on CPVQA over the SOTA closed-source model, demonstrating its effectiveness in enhancing combinatorial reasoning with multiple perceptual inputs in complex scenarios.

Data Integration Question Answering +1

Less is More: Improving LLM Alignment via Preference Data Selection

no code implementations20 Feb 2025 Xun Deng, Han Zhong, Rui Ai, Fuli Feng, Zheng Wang, Xiangnan He

Direct Preference Optimization (DPO) has emerged as a promising approach for aligning large language models with human preferences.

PSCon: Toward Conversational Product Search

no code implementations19 Feb 2025 Jie Zou, Mohammad Aliannejadi, Evangelos Kanoulas, Shuxi Han, Heli Ma, Zheng Wang, Yang Yang, Heng Tao Shen

Conversational Product Search (CPS) is confined to simulated conversations due to the lack of real-world CPS datasets that reflect human-like language.

Intent Detection Keyword Extraction +2

Leader and Follower: Interactive Motion Generation under Trajectory Constraints

no code implementations17 Feb 2025 Runqi Wang, Caoyuan Ma, Jian Zhao, Hanrui Xu, Dongfang Sun, Haoyang Chen, Lin Xiong, Zheng Wang, Xuelong Li

To generate interactive motion following specified trajectories, this paper decouples complex motion into a Leader - Follower dynamic, inspired by role allocation in partner dancing.

Motion Generation

A Survey of LLM-based Agents in Medicine: How far are we from Baymax?

no code implementations16 Feb 2025 Wenxuan Wang, Zizhan Ma, Zheng Wang, Chenghan Wu, WenTing Chen, Xiang Li, Yixuan Yuan

Large Language Models (LLMs) are transforming healthcare through the development of LLM-based agents that can understand, reason about, and assist with medical tasks.

Hallucination Survey

GENERator: A Long-Context Generative Genomic Foundation Model

1 code implementation11 Feb 2025 Wei Wu, Qiuyi Li, Mingyang Li, Kun fu, Fuli Feng, Jieping Ye, Hui Xiong, Zheng Wang

Recent developments in genomic language models have underscored the potential of LLMs in deciphering DNA sequences.

model

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders

1 code implementation28 Jan 2025 Zhengxuan Wu, Aryaman Arora, Atticus Geiger, Zheng Wang, Jing Huang, Dan Jurafsky, Christopher D. Manning, Christopher Potts

We introduce a novel weakly-supervised representational method (Rank-1 Representation Finetuning; ReFT-r1), which is competitive on both tasks while providing the interpretability advantages that prompting lacks.

Language Modeling Language Modelling

Spectrum Sharing in Satellite-Terrestrial Integrated Networks: Frameworks, Approaches, and Opportunities

no code implementations6 Jan 2025 Bodong Shang, Zheng Wang, Xiangyu Li, Chungang Yang, Chao Ren, Haijun Zhang

Furthermore, advanced SS approaches related to interference management in STINs and performance metrics of SS in STINs are introduced.

Management

An Investigation into Value Misalignment in LLM-Generated Texts for Cultural Heritage

no code implementations3 Jan 2025 Fan Bu, Zheng Wang, Siyi Wang, Ziyao Liu

As Large Language Models (LLMs) become increasingly prevalent in tasks related to cultural heritage, such as generating descriptions of historical monuments, translating ancient texts, preserving oral traditions, and creating educational content, their ability to produce accurate and culturally aligned texts is being increasingly relied upon by users and researchers.

Enhancing Deployment-Time Predictive Model Robustness for Code Analysis and Optimization

1 code implementation31 Dec 2024 Huanting Wang, Patrick Lenihan, Zheng Wang

By relabeling up to 5% of the Prom-identified samples through incremental learning, Prom can help a deployed model achieve a performance comparable to that attained during its model training phase.

Incremental Learning

ProtCLIP: Function-Informed Protein Multi-Modal Learning

no code implementations28 Dec 2024 Hanjing Zhou, Mingze Yin, Wei Wu, Mingyang Li, Kun fu, Jintai Chen, Jian Wu, Zheng Wang

However, these works were still unable to replicate the extraordinary success of language-supervised visual foundation models due to the ineffective usage of aligned protein-text paired data and the lack of an effective function-informed pre-training paradigm.

Protein Function Prediction Semantic Similarity +1

Rethinking Cancer Gene Identification through Graph Anomaly Analysis

1 code implementation23 Dec 2024 Yilong Zang, Lingfei Ren, Yue Li, Zhikang Wang, David Antony Selby, Zheng Wang, Sebastian Josef Vollmer, Hongzhi Yin, Jiangning Song, Junhang Wu

Graph neural networks (GNNs) have shown promise in integrating protein-protein interaction (PPI) networks for identifying cancer genes in recent studies.

Graph Neural Network

Local Features Meet Stochastic Anonymization: Revolutionizing Privacy-Preserving Face Recognition for Black-Box Models

no code implementations11 Dec 2024 Yuanwei Liu, Chengyu Jia, Ruqi Xiao, Xuemai Jia, Hui Wei, Kui Jiang, Zheng Wang

The task of privacy-preserving face recognition (PPFR) currently faces two major unsolved challenges: (1) existing methods are typically effective only on specific face recognition models and struggle to generalize to black-box face recognition models; (2) current methods employ data-driven reversible representation encoding for privacy protection, making them susceptible to adversarial learning and reconstruction of the original image.

Face Recognition Privacy Preserving

Optimizing Personalized Federated Learning through Adaptive Layer-Wise Learning

1 code implementation10 Dec 2024 Weihang Chen, Jie Ren, Zhiqiang Li, Ling Gao, Zheng Wang

Personalized FL (pFL) tackles these issues by tailoring local models to individual data sources and using weighted aggregation methods for client-specific learning.

Personalized Federated Learning

StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist

2 code implementations9 Dec 2024 Cunshi Wang, Xinjie Hu, Yu Zhang, Xunhao Chen, Pengliang Du, Yiming Mao, Rui Wang, Yuyang Li, Ying Wu, Hang Yang, Yansong Li, Beichuan Wang, Haiyang Mu, Zheng Wang, Jianfeng Tian, Liang Ge, Yongna Mao, Shengming Li, Xiaomeng Lu, Jinhang Zou, Yang Huang, Ningchen Sun, Jie Zheng, Min He, Yu Bai, Junjie Jin, Hong Wu, Chaohui Shang, Jifeng Liu

Additionally, the integration of AI agents within the system provides online accessibility, saving astronomers' time and encouraging greater participation from amateur astronomers in the NGSS project.

Mix-Modality Person Re-Identification: A New and Practical Paradigm

no code implementations6 Dec 2024 Wei Liu, Xin Xu, Hua Chang, Xin Yuan, Zheng Wang

Current visible-infrared cross-modality person re-identification research has only focused on exploring the bi-modality mutual retrieval paradigm, and we propose a new and more practical mix-modality retrieval paradigm.

Cross-Modality Person Re-identification Person Re-Identification +1

Multi-Level Correlation Network For Few-Shot Image Classification

1 code implementation4 Dec 2024 Yunkai Dang, Min Zhang, Zhengyu Chen, Xinliang Zhang, Zheng Wang, Meijun Sun, Donglin Wang

In this paper, we argue that measure at such a level may not be effective enough to generalize from base to novel classes when using only a few images.

Few-Shot Image Classification Metric Learning +1

Prompt as Free Lunch: Enhancing Diversity in Source-Free Cross-domain Few-shot Learning through Semantic-Guided Prompting

no code implementations1 Dec 2024 Linhai Zhuo, Zheng Wang, Yuqian Fu, Tianwen Qian

After training of the first step, support samples with different diversity prompts are input into the CLIP backbone to generate enhanced features.

cross-domain few-shot learning Diversity

Online Adaptive Real-Time Beamforming Design for Dynamic Environments in Cell-Free Systems

no code implementations27 Nov 2024 Guanghui Chen, Zheng Wang, Hongxin Lin, Pengguang Du, Yongming Huang

In this paper, we consider real-time beamforming design for dynamic wireless environments with varying channels and different numbers of access points (APs) and users in cell-free systems.

TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection

no code implementations5 Nov 2024 Wei Wu, Zhuoshi Pan, Chao Wang, Liyi Chen, Yunchu Bai, Kun fu, Zheng Wang, Hui Xiong

With the development of large language models (LLMs), the ability to handle longer contexts has become a key capability for Web applications such as cross-document understanding and LLM-powered search systems.

document understanding

Bridge-IF: Learning Inverse Protein Folding with Markov Bridges

1 code implementation4 Nov 2024 Yiheng Zhu, Jialu Wu, Qiuyi Li, Jiahuan Yan, Mingze Yin, Wei Wu, Mingyang Li, Jieping Ye, Zheng Wang, Jian Wu

To fill these gaps, we propose Bridge-IF, a generative diffusion bridge model for inverse folding, which is designed to learn the probabilistic dependency between the distributions of backbone structures and protein sequences.

Protein Design Protein Folding

P4GCN: Vertical Federated Social Recommendation with Privacy-Preserving Two-Party Graph Convolution Network

no code implementations16 Oct 2024 Zheng Wang, Wanwan Wang, Yimin Huang, Zhaopeng Peng, Ziqi Yang, Ming Yao, Cheng Wang, Xiaoliang Fan

To address this issue, we propose a novel vertical federated social recommendation method leveraging privacy-preserving two-party graph convolution networks (P4GCN) to enhance recommendation accuracy without requiring direct access to sensitive social information.

Privacy Preserving Recommendation Systems

Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

no code implementations14 Oct 2024 Zhengwei Yang, Yuke Li, Qiang Sun, Basura Fernando, Heng Huang, Zheng Wang

Most existing studies on few-shot learning focus on unimodal settings, where models are trained to generalize on unseen data using only a small number of labeled examples from the same modality.

Few-Shot Learning Transfer Learning

A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models

no code implementations5 Oct 2024 Zhihao Wang, Shiyu Liu, Jianheng Huang, Zheng Wang, Yixuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su

Preliminary experiments demonstrate that PTFS achieves better pre-training performance, while CPT has lower training cost.

Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding

no code implementations4 Oct 2024 Wei Wu, Chao Wang, Liyi Chen, Mingze Yin, Yiheng Zhu, Kun fu, Jieping Ye, Hui Xiong, Zheng Wang

Recent development of protein language models (pLMs) with supervised fine tuning provides a promising solution to this problem.

Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting

1 code implementation1 Oct 2024 Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap

In recent years, the rapid increase in online video content has underscored the limitations of static Video Question Answering (VideoQA) models trained on fixed datasets, as they struggle to adapt to new questions or tasks posed by newly available content.

Continual Learning Language Modeling +4

Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation

no code implementations30 Sep 2024 Yuran Wang, Zhijing Wan, Yansheng Qiu, Zheng Wang

To address the overlooked challenge, we introduce the Locality-Aware Diffusion (Lad), a novel method tailored for exquisite 3D abdominal CT volume generation.

Medical Image Analysis Organ Segmentation +1

Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs

no code implementations28 Sep 2024 Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, Wei Shi

In the age of mobile internet, user data, often referred to as memories, is continuously generated on personal devices.

RAG Retrieval

GalleryGPT: Analyzing Paintings with Large Multimodal Models

1 code implementation1 Aug 2024 Yi Bin, Wenhao Shi, Yujuan Ding, Zhiqiang Hu, Zheng Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen

Specifically, we first propose a task of composing paragraph analysis for artworks, i. e., painting in this paper, only focusing on visual characteristics to formulate more comprehensive understanding of artworks.

Art Analysis

360VFI: A Dataset and Benchmark for Omnidirectional Video Frame Interpolation

no code implementations19 Jul 2024 Wenxuan Lu, Mengshun Hu, Yansheng Qiu, Liang Liao, Zheng Wang

This paper introduces the benchmark dataset, 360VFI, for Omnidirectional Video Frame Interpolation.

Decoder ERP +1

A3S: A General Active Clustering Method with Pairwise Constraints

1 code implementation14 Jul 2024 Xun Deng, Junlong Liu, Han Zhong, Fuli Feng, Chen Shen, Xiangnan He, Jieping Ye, Zheng Wang

Active clustering aims to boost the clustering performance by integrating human-annotated pairwise constraints through strategic querying.

Clustering

Model Tells You Where to Merge: Adaptive KV Cache Merging for LLMs on Long-Context Tasks

no code implementations11 Jul 2024 Zheng Wang, Boxiao Jin, Zhongzhi Yu, Minjia Zhang

To mitigate computational costs, LLMs often employ the KV Cache technique to improve the generation speed.

Computational Efficiency

Zero-shot Object Counting with Good Exemplars

1 code implementation6 Jul 2024 Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He

Zero-shot object counting (ZOC) aims to enumerate objects in images using only the names of object classes during testing, without the need for manual annotations.

Contrastive Learning Object +2

Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration

1 code implementation22 Jun 2024 Zhongzhi Yu, Zheng Wang, Yonggan Fu, Huihong Shi, Khalid Shaikh, Yingyan Celine Lin

Specifically, this work begins with comprehensive visualizations of the attention distributions in LLMs during inference across various inputs and tasks.

Semantic Enhanced Few-shot Object Detection

no code implementations19 Jun 2024 Zheng Wang, Yingjie Gao, Qingjie Liu, Yunhong Wang

As a result, our method allows each novel class to construct a compact feature space without being confused with similar base classes.

Few-Shot Object Detection Object +4

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

1 code implementation11 Jun 2024 Haoran You, Yichao Fu, Zheng Wang, Amir Yazdanbakhsh, Yingyan Celine Lin

Autoregressive Large Language Models (LLMs) have achieved impressive performance in language tasks but face two significant bottlenecks: (1) quadratic complexity in the attention module as the number of tokens increases, and (2) limited efficiency due to the sequential processing nature of autoregressive LLMs during generation.

Decoder

Selective Knowledge Sharing for Personalized Federated Learning Under Capacity Heterogeneity

no code implementations31 May 2024 Zheng Wang, Zhaopeng Peng, Zihui Wang, Cheng Wang

Finally, we facilitate the implicit aggregation of personal parameters by aggregating client embeddings through a self-attention module.

Personalized Federated Learning

CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code

1 code implementation24 Apr 2024 Batu Guan, Yao Wan, Zhangqian Bi, Zheng Wang, Hongyu Zhang, Pan Zhou, Lichao Sun

Experiments conducted on a real-world dataset across five programming languages demonstrate the effectiveness of CodeIP in watermarking LLMs for code generation while maintaining the syntactical correctness of code.

Code Generation Diversity

A Massive MIMO Sampling Detection Strategy Based on Denoising Diffusion Model

no code implementations20 Apr 2024 Lanxin He, Zheng Wang, Yongming Huang

The Langevin sampling method relies on an accurate score matching while the existing massive multiple-input multiple output (MIMO) Langevin detection involves an inevitable singular value decomposition (SVD) to calculate the posterior score.

Denoising

FedPFT: Federated Proxy Fine-Tuning of Foundation Models

1 code implementation17 Apr 2024 Zhaopeng Peng, Xiaoliang Fan, Yufan Chen, Zheng Wang, Shirui Pan, Chenglu Wen, Ruisheng Zhang, Cheng Wang

Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs.

Federated Learning

ReFT: Representation Finetuning for Language Models

2 code implementations4 Apr 2024 Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

We define a strong instance of the ReFT family, Low-rank Linear Subspace ReFT (LoReFT), and we identify an ablation of this method that trades some performance for increased efficiency.

Arithmetic Reasoning

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

3 code implementations12 Mar 2024 Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability.

Model Editing

Multimodal Query Suggestion with Multi-Agent Reinforcement Learning from Human Feedback

no code implementations7 Feb 2024 Zheng Wang, Bingzheng Gan, Wei Shi

In this paper, we introduce a novel Multimodal Query Suggestion (MMQS) task, which aims to generate query suggestions based on user query images to improve the intentionality and diversity of search results.

Diversity Information Retrieval +3

Mitigating Prior Shape Bias in Point Clouds via Differentiable Center Learning

no code implementations3 Feb 2024 Zhe Li, Ziyang Zhang, Jinglin Zhao, Zheng Wang, Bocheng Ren, Debin Liu, Laurence T. Yang

Experimental results demonstrate that our method enhances the expressive capacity of existing point cloud models and effectively addresses the issue of information leakage.

Contributing Dimension Structure of Deep Feature for Coreset Selection

1 code implementation29 Jan 2024 Zhijing Wan, Zhixiang Wang, Yuran Wang, Zheng Wang, Hongyuan Zhu, Shin'ichi Satoh

Existing methods typically measure both the representation and diversity of data based on similarity metrics, such as L2-norm.

Diversity

IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation

no code implementations CVPR 2024 Mengshun Hu, Kui Jiang, Zhihang Zhong, Zheng Wang, Yinqiang Zheng

To this end we propose a novel framework for implicit quadratic video frame interpolation (IQ-VFI) which explores latent acceleration information and accurate intermediate motions via knowledge distillation.

Knowledge Distillation Motion Estimation +1

SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement

1 code implementation CVPR 2024 Tao Wang, Lei Jin, Zheng Wang, Jianshu Li, Liang Li, Fang Zhao, Yu Cheng, Li Yuan, Li Zhou, Junliang Xing, Jian Zhao

To leverage this quality information we propose a motion refinement network termed SynSP to achieve a Synergy of Smoothness and Precision in the sequence refinement tasks.

Bi-Causal: Group Activity Recognition via Bidirectional Causality

no code implementations CVPR 2024 Youliang Zhang, Wenxuan Liu, Danni Xu, Zhuo Zhou, Zheng Wang

This network establishes a causality communication channel while modeling relations and interactions enabling reciprocal enhancement between human-object interactions and human relations ensuring their mutual consistency.

Group Activity Recognition Human-Object Interaction Detection +1

Ensemble Diversity Facilitates Adversarial Transferability

1 code implementation CVPR 2024 Bowen Tang, Zheng Wang, Yi Bin, Qi Dou, Yang Yang, Heng Tao Shen

With the advent of ensemble-based attacks the transferability of generated adversarial examples is elevated by a noticeable margin despite many methods only employing superficial integration yet ignoring the diversity between ensemble models.

Diversity reinforcement-learning +1

Open-Vocabulary Video Relation Extraction

1 code implementation25 Dec 2023 Wentao Tian, Zheng Wang, Yuqian Fu, Jingjing Chen, Lechao Cheng

A comprehensive understanding of videos is inseparable from describing the action with its contextual action-object interactions.

Action Classification Action Understanding +3

A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor

no code implementations27 Nov 2023 Jialin Liu, Lu Yan, Xiaowei Liu, Yuzhuo Dai, Fanggen Lu, Yuanting Ma, Muzhou Hou, Zheng Wang

We conducted image processing of HRM to predict the esophageal contraction vigor for assisting the evaluation of esophageal dynamic function.

POS

Diffusion-Generative Multi-Fidelity Learning for Physical Simulation

no code implementations9 Nov 2023 Zheng Wang, Shibo Li, Shikai Fang, Shandian Zhe

We propose a conditional score model to control the solution generation by the input parameters and the fidelity.

Denoising

Dynamic Association Learning of Self-Attention and Convolution in Image Restoration

no code implementations9 Nov 2023 Kui Jiang, Xuemei Jia, Wenxin Huang, Wenbin Wang, Zheng Wang, Junjun Jiang

Thus, we propose to refine background textures with the predicted degradation prior in an association learning manner.

Decoder Image Restoration +1

Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data

1 code implementation8 Nov 2023 Shikai Fang, Xin Yu, Zheng Wang, Shibo Li, Mike Kirby, Shandian Zhe

To generalize Tucker decomposition to such scenarios, we propose Functional Bayesian Tucker Decomposition (FunBaT).

Gaussian Processes

Foundation Models Meet Visualizations: Challenges and Opportunities

no code implementations9 Oct 2023 Weikai Yang, Mengchen Liu, Zheng Wang, Shixia Liu

Recent studies have indicated that foundation models, such as BERT and GPT, excel in adapting to a variety of downstream tasks.

Fairness

Label Deconvolution for Node Representation Learning on Large-scale Attributed Graphs against Learning Bias

1 code implementation26 Sep 2023 Zhihao Shi, Jie Wang, Fanghua Lu, Hanzhu Chen, Defu Lian, Zheng Wang, Jieping Ye, Feng Wu

The inverse mapping leads to an objective function that is equivalent to that by the joint training, while it can effectively incorporate GNNs in the training phase of NEs against the learning bias.

Representation Learning

An Effective Two-stage Training Paradigm Detector for Small Dataset

no code implementations11 Sep 2023 Zheng Wang, Dong Xie, Hanzhi Wang, Jiang Tian

Learning from the limited amount of labeled data to the pre-train model has always been viewed as a challenging task.

object-detection Object Detection

Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning

no code implementations29 Aug 2023 Zhuo Zhou, Wenxuan Liu, Danni Xu, Zheng Wang, Jian Zhao

HID presents a unique challenge in that hidden intentions lack the obvious visual representations to distinguish them from normal intentions.

Intent Detection

A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions

no code implementations25 Aug 2023 Tianyi Zhang, Zheng Wang, Jing Huang, Mohiuddin Muhammad Tasnim, Wei Shi

Fortunately, the availability of open-source stable diffusion models and their underlying mathematical principles has enabled the academic community to extensively analyze the performance of current image generation models and make improvements based on this stable diffusion framework.

Image Generation

A Generalized Physical-knowledge-guided Dynamic Model for Underwater Image Enhancement

1 code implementation10 Aug 2023 Pan Mu, Hanning Xu, Zheyuan Liu, Zheng Wang, Sixian Chan, Cong Bai

To tackle these challenges, we design a Generalized Underwater image enhancement method via a Physical-knowledge-guided Dynamic Model (short for GUPDM), consisting of three parts: Atmosphere-based Dynamic Structure (ADS), Transmission-guided Dynamic Structure (TDS), and Prior-based Multi-scale Structure (PMS).

Image Enhancement

NNVISR: Bring Neural Network Video Interpolation and Super Resolution into Video Processing Framework

1 code implementation6 Aug 2023 Yuan Tong, Mengshun Hu, Zheng Wang

We present NNVISR - an open-source filter plugin for the VapourSynth video processing framework, which facilitates the application of neural networks for various kinds of video enhancing tasks, including denoising, super resolution, interpolation, and spatio-temporal super-resolution.

Denoising Super-Resolution +1

From Generation to Suppression: Towards Effective Irregular Glow Removal for Nighttime Visibility Enhancement

no code implementations31 Jul 2023 Wanyu Wu, Wei Wang, Zheng Wang, Kui Jiang, Xin Xu

Most existing Low-Light Image Enhancement (LLIE) methods are primarily designed to improve brightness in dark regions, which suffer from severe degradation in nighttime images.

Low-Light Image Enhancement Zero-Shot Learning

Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling

1 code implementation ICCV 2023 Zhuoxiao Chen, Yadan Luo, Zheng Wang, Mahsa Baktashmotlagh, Zi Huang

Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection.

3D Object Detection object-detection +1

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

1 code implementation20 Jun 2023 Taorong Liu, Liang Liao, Delin Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature.

Decoder Image Inpainting +1

Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation

1 code implementation19 Jun 2023 Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, SiuMing Yiu, Ruihua Han

In addition, we introduce a cross-view contrastive learning paradigm to model the inter-dependencies across view-specific region representations and preserve underlying relation heterogeneity.

Contrastive Learning Graph Learning +1

One for All: Unified Workload Prediction for Dynamic Multi-tenant Edge Cloud Platforms

1 code implementation2 Jun 2023 Shaoyuan Huang, Zheng Wang, Heng Zhang, Xiaofei Wang, Cheng Zhang, Wenyu Wang

In this paper, we propose an end-to-end framework with global pooling and static content awareness, DynEformer, to provide a unified workload prediction scheme for dynamic MT-ECP.

All Prediction +2

Group Activity Recognition via Dynamic Composition and Interaction

no code implementations9 May 2023 Youliang Zhang, Zhuo Zhou, Wenxuan Liu, Danni Xu, Zheng Wang

Group composition tells us the location of people and their relations inside the group, while interaction reflects the relation between humans and objects outside the group.

Group Activity Recognition Human-Object Interaction Detection +1

Automated Spatio-Temporal Graph Contrastive Learning

1 code implementation6 May 2023 Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Zhonghang Li, SiuMing Yiu

In this paper, we tackle the above challenges by exploring the Automated Spatio-Temporal graph contrastive learning paradigm (AutoST) over the heterogeneous region graph generated from multi-view data sources.

Contrastive Learning

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding

1 code implementation23 Mar 2023 Ziyang Lu, Yunqiang Pei, Guoqing Wang, Yang Yang, Zheng Wang, Heng Tao Shen

Despite their effectiveness, existing methods suffer from the difficulty of low recognition accuracy in cases of multiple adjacent objects with similar appearances. To address this issue, this work intuitively introduces the human-robot interaction as a cue to facilitate the development of 3D visual grounding.

3D visual grounding

Adaptive Modeling of Uncertainties for Traffic Forecasting

no code implementations16 Mar 2023 Ying Wu, Yongchao Ye, Adnan Zeb, James J. Q. Yu, Zheng Wang

We evaluated QuanTraffic by applying it to five representative DNN models for traffic forecasting across seven public datasets.

Management Prediction +2

DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection

1 code implementation15 Mar 2023 HUI ZHANG, Zheng Wang, Zuxuan Wu, Yu-Gang Jiang

Anomaly detection has garnered extensive applications in real industrial manufacturing due to its remarkable effectiveness and efficiency.

Denoising Unsupervised Anomaly Detection

Unleashing the Potential of Acquisition Functions in High-Dimensional Bayesian Optimization

1 code implementation16 Feb 2023 Jiayu Zhao, Renyu Yang, Shenghao Qiu, Zheng Wang

Bayesian optimization (BO) is widely used to optimize expensive-to-evaluate black-box functions. BO first builds a surrogate model to represent the objective function and assesses its uncertainty.

Bayesian Optimization

Multilateral Semantic Relations Modeling for Image Text Retrieval

no code implementations CVPR 2023 Zheng Wang, Zhenwei Gao, Kangshuai Guo, Yang Yang, Xiaoming Wang, Heng Tao Shen

Specifically, a given query is first mapped as a probabilistic embedding to learn its true semantic distribution based on Mahalanobis distance.

Image-text Retrieval Text Retrieval

Scratch Each Other's Back: Incomplete Multi-Modal Brain Tumor Segmentation via Category Aware Group Self-Support Learning

1 code implementation ICCV 2023 Yansheng Qiu, Delin Chen, Hongdou Yao, Yongchao Xu, Zheng Wang

In this paper, considering the sensitivity of different modalities to diverse tumor regions, we propose a Category Aware Group Self-Support Learning framework, called GSS, to make up for the information deficit among the modalities in the individual modal feature extraction phase.

Brain Tumor Segmentation Tumor Segmentation

Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification

1 code implementation CVPR 2023 Zhengwei Yang, Meng Lin, Xian Zhong, Yu Wu, Zheng Wang

Entangled representation of clothing and identity (ID)-intrinsic clues are potentially concomitant in conventional person Re-IDentification (ReID).

Cloth-Changing Person Re-Identification

Error-aware Quantization through Noise Tempering

no code implementations11 Dec 2022 Zheng Wang, Juncheng B Li, Shuhui Qu, Florian Metze, Emma Strubell

In this work, we incorporate exponentially decaying quantization-error-aware noise together with a learnable scale of task loss gradient to approximate the effect of a quantization operator.

Model Compression Quantization

Prototypical Residual Networks for Anomaly Detection and Localization

no code implementations CVPR 2023 HUI ZHANG, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang

Anomaly detection and localization are widely used in industrial manufacturing for its efficiency and effectiveness.

Ranked #4 on Supervised Anomaly Detection on MVTec AD (using extra training data)

Supervised Anomaly Detection

Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval

1 code implementation IEEE Transactions on Multimedia 2020 Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura

Therefore, previous works pre-train their models on rich-labeled photo retrieval data (i. e., source domain) and then fine-tune them on the limited-labeled sketch-to-photo retrieval data (i. e., target domain).

Domain Adaptation Image Retrieval +1

FedGS: Federated Graph-based Sampling with Arbitrary Client Availability

1 code implementation25 Nov 2022 Zheng Wang, Xiaoliang Fan, Jianzhong Qi, Haibing Jin, Peizhen Yang, Siqi Shen, Cheng Wang

Second, constrained by the far-distance in data distribution of the sampled clients, we further minimize the variance of the numbers of times that the clients are sampled, to mitigate long-term bias.

Federated Learning

On Inferring User Socioeconomic Status with Mobility Records

1 code implementation15 Nov 2022 Zheng Wang, Mingrui Liu, Cheng Long, Qianru Zhang, Jiangneng Li, Chunyan Miao

The DeepSEI model incorporates two networks called deep network and recurrent network, which extract the features of the mobility records from three aspects, namely spatiality, temporality and activity, one at a coarse level and the other at a detailed level.

Management

UmeTrack: Unified multi-view end-to-end hand tracking for VR

no code implementations31 Oct 2022 Shangchen Han, Po-Chen Wu, Yubo Zhang, Beibei Liu, Linguang Zhang, Zheng Wang, Weiguang Si, Peizhao Zhang, Yujun Cai, Tomas Hodan, Randi Cabezas, Luan Tran, Muzaffer Akbay, Tsz-Ho Yu, Cem Keskin, Robert Wang

In this paper, we present a unified end-to-end differentiable framework for multi-view, multi-frame hand tracking that directly predicts 3D hand pose in world space.

A Survey of Dataset Refinement for Problems in Computer Vision Datasets

2 code implementations21 Oct 2022 Zhijing Wan, Zhixiang Wang, CheukTing Chung, Zheng Wang

In addition, we organize these dataset refinement methods according to the addressed data problems and provide a systematic comparative description.

Active Learning

Context-Enhanced Stereo Transformer

1 code implementation21 Oct 2022 Weiyu Guo, Zhaoshuo Li, Yongkui Yang, Zheng Wang, Russell H. Taylor, Mathias Unberath, Alan Yuille, Yingwei Li

We construct our stereo depth estimation model, Context Enhanced Stereo Transformer (CSTR), by plugging CEP into the state-of-the-art stereo depth estimation method Stereo Transformer.

Stereo Depth Estimation Stereo Matching

SQuAT: Sharpness- and Quantization-Aware Training for BERT

no code implementations13 Oct 2022 Zheng Wang, Juncheng B Li, Shuhui Qu, Florian Metze, Emma Strubell

Quantization is an effective technique to reduce memory footprint, inference latency, and power consumption of deep learning models.

Quantization

Physical Adversarial Attack meets Computer Vision: A Decade Survey

1 code implementation30 Sep 2022 Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin'ichi Satoh, Luc van Gool, Zheng Wang

Building upon this foundation, we uncover the pervasive role of artifacts carrying adversarial perturbations in the physical world.

Adversarial Attack Medical Diagnosis +1

Faith: An Efficient Framework for Transformer Verification on GPUs

1 code implementation23 Sep 2022 Boyuan Feng, Tianqi Tang, yuke wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding

In this paper, we propose Faith, an efficient framework for transformer verification on GPUs.

Sentence

MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multi-GPU Platforms

1 code implementation14 Sep 2022 yuke wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin Barker, Ang Li, Yufei Ding

For irregularly sparse and fine-grained GNN workloads, such solutions miss the opportunity to jointly schedule/optimize the computation and communication operations for high-performance delivery.

Layout Design Management

Detecting Algorithmically Generated Domains Using a GCNN-LSTM Hybrid Neural Network

no code implementations6 Aug 2022 Zheng Wang

Domain generation algorithm (DGA) is used by botnets to build a stealthy command and control (C&C) communication channel between the C&C server and the bots.

Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem

1 code implementation1 Aug 2022 Zheng Wang, Wenjie Ruan

Recent research on the robustness of deep learning has shown that Vision Transformers (ViTs) surpass the Convolutional Neural Networks (CNNs) under some perturbations, e. g., natural corruption, adversarial attacks, etc.

Adversarial Robustness

Reference-Guided Texture and Structure Inference for Image Inpainting

1 code implementation29 Jul 2022 Taorong Liu, Liang Liao, Zheng Wang, Shin'ichi Satoh

Existing learning-based image inpainting methods are still in challenge when facing complex semantic environments and diverse hole patterns.

Decoder Image Inpainting

Magic ELF: Image Deraining Meets Association Learning and Transformer

1 code implementation21 Jul 2022 Kui Jiang, Zhongyuan Wang, Chen Chen, Zheng Wang, Laizhong Cui, Chia-Wen Lin

Convolutional neural network (CNN) and Transformer have achieved great success in multimedia applications.

Rain Removal

You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution

no code implementations13 Jul 2022 Mengshun Hu, Kui Jiang, Zhixiang Nie, Zheng Wang

Spatial-Temporal Video Super-Resolution (ST-VSR) technology generates high-quality videos with higher resolution and higher frame rates.

Video Super-Resolution

Nonparametric Factor Trajectory Learning for Dynamic Tensor Decomposition

1 code implementation6 Jul 2022 Zheng Wang, Shandian Zhe

In practice, tensor data is often accompanied by temporal information, namely the time points when the entry values were generated.

Tensor Decomposition

Infinite-Fidelity Coregionalization for Physical Simulation

no code implementations1 Jul 2022 Shibo Li, Zheng Wang, Robert M. Kirby, Shandian Zhe

Our model can interpolate and/or extrapolate the predictions to novel fidelities, which can be even higher than the fidelities of training data.

Gaussian Processes

Towards Generalizable Person Re-identification with a Bi-stream Generative Model

no code implementations19 Jun 2022 Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian

Guided by original pedestrian images, one stream is employed to learn a camera-invariant global feature for the CC problem via filtering cross-camera interference factors.

Domain Generalization Generalizable Person Re-identification

Improving Generalization of Metric Learning via Listwise Self-distillation

1 code implementation17 Jun 2022 Zelong Zeng, Fan Yang, Zheng Wang, Shin'ichi Satoh

Most deep metric learning (DML) methods employ a strategy that forces all positive samples to be close in the embedding space while keeping them away from negative ones.

Metric Learning

Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion

1 code implementation10 Jun 2022 Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.

Autonomous Driving Pseudo Label +4

Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval

1 code implementation22 May 2022 Zelong Zeng, Zheng Wang, Fan Yang, Shin'ichi Satoh

The large variation of viewpoint and irrelevant content around the target always hinder accurate image retrieval and its subsequent tasks.

geo-localization Image Retrieval +2

Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning

no code implementations CVPR 2022 Mengshun Hu, Kui Jiang, Liang Liao, Jing Xiao, Junjun Jiang, Zheng Wang

Specifically, we propose to exploit the mutual information among them via iterative up-and-down projections, where the spatial and temporal features are fully fused and distilled, helping the high-quality video reconstruction.

Video Reconstruction Video Super-Resolution

Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study

no code implementations7 May 2022 Liqun Lin, Zheng Wang, Jiachen He, Weiling Chen, Yiwen Xu, Tiesong Zhao

In this work, a semi-automatic labeling method is adopted to build a large-scale compressed video quality database, which allows us to label a large number of compressed videos with manageable human workload.

Video Quality Assessment Visual Question Answering (VQA)

A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges

no code implementations11 Apr 2022 Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen

In this paper, to address the current lack of comprehensive survey of existing LJP tasks, datasets, models and evaluations, (1) we analyze 31 LJP datasets in 6 languages, present their construction process and define a classification method of LJP with 3 different attributes; (2) we summarize 14 evaluation metrics under four categories for different outputs of LJP tasks; (3) we review 12 legal-domain pretrained models in 3 languages and highlight 3 major research directions for LJP; (4) we show the state-of-art results for 8 representative datasets from different court cases and discuss the open challenges.

Unsupervised Manga Character Re-identification via Face-body and Spatial-temporal Associated Clustering

no code implementations10 Apr 2022 Zhimin Zhang, Zheng Wang, Wei Hu

In the past few years, there has been a dramatic growth in e-manga (electronic Japanese-style comics).

Clustering Triplet

Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence Embedding

1 code implementation Findings (ACL) 2022 Rui Cao, Yihao Wang, Yuxin Liang, Ling Gao, Jie Zheng, Jie Ren, Zheng Wang

We define a maximum traceable distance metric, through which we learn to what extent the text contrastive learning benefits from the historical information of negative samples.

Contrastive Learning Sentence +4

Visual-Tactile Sensing for Real-time Liquid Volume Estimation in Grasping

no code implementations23 Feb 2022 Fan Zhu, Ruixing Jia, Lei Yang, Youcan Yan, Zheng Wang, Jia Pan, Wenping Wang

We propose a deep visuo-tactile model for realtime estimation of the liquid inside a deformable container in a proprioceptive way. We fuse two sensory modalities, i. e., the raw visual inputs from the RGB camera and the tactile cues from our specific tactile sensor without any extra sensor calibrations. The robotic system is well controlled and adjusted based on the estimation model in real time.

Multi-Task Learning

Dynamic GPU Energy Optimization for Machine Learning Training Workloads

1 code implementation5 Jan 2022 Farui Wang, Weizhe Zhang, Shichao Lai, Meng Hao, Zheng Wang

This paper presents GPOEO, an online GPU energy optimization framework for machine learning training workloads.

BIG-bench Machine Learning Scheduling

TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs

2 code implementations3 Dec 2021 yuke wang, Boyuan Feng, Zheng Wang, Guyue Huang, Yufei Ding

Recently, graph neural networks (GNNs), as the backbone of graph-based machine learning, demonstrate great success in various domains (e. g., e-commerce).

Translation

Self-Adaptable Point Processes with Nonparametric Time Decays

no code implementations NeurIPS 2021 Zhimeng Pan, Zheng Wang, Jeff M. Phillips, Shandian Zhe

Specifically, we use an embedding to represent each event type and model the event influence as an unknown function of the embeddings and time span.

Point Processes

Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding

no code implementations CVPR 2022 Xianzheng Ma, Zhixiang Wang, Yacheng Zhan, Yinqiang Zheng, Zheng Wang, Dengxin Dai, Chia-Wen Lin

Unlike previous methods that mainly focus on closing the domain gap caused by fog -- defogging the foggy images or fogging the clear images, we propose to alleviate the domain gap by considering fog influence and style variation simultaneously.

Disentanglement Domain Adaptation +1

Stable and Compact Face Recognition via Unlabeled Data Driven Sparse Representation-Based Classification

no code implementations4 Nov 2021 XiaoHui Yang, Zheng Wang, Huan Wu, Licheng Jiao, Yiming Xu, Haolin Chen

The proposed model aims to mine the hidden semantic information and intrinsic structure information of all available data, which is suitable for few labeled samples and proportion imbalance between labeled samples and unlabeled samples problems in frontal face recognition.

Face Recognition Sparse Representation-based Classification

Optimizing Sparse Matrix Multiplications for Graph Neural Networks

no code implementations30 Oct 2021 Shenghao Qiu, You Liang, Zheng Wang

Our model is first trained offline using training matrix samples, and the trained model can be applied to any input matrix and GNN kernels with SpMM computation.

Nonparametric Sparse Tensor Factorization with Hierarchical Gamma Processes

no code implementations19 Oct 2021 Conor Tillinghast, Zheng Wang, Shandian Zhe

Compared with the existent works, our model not only leverages the structural information underlying the observed entry indices, but also provides extra interpretability and flexibility -- it can simultaneously estimate a set of location factors about the intrinsic properties of the tensor nodes, and another set of sociability factors reflecting their extrovert activity in interacting with others; users are free to choose a trade-off between the two types of factors.

Meta-Learning with Adjoint Methods

no code implementations16 Oct 2021 Shibo Li, Zheng Wang, Akil Narayan, Robert Kirby, Shandian Zhe

the initialization, we only need to run the standard ODE solver twice -- one is forward in time that evolves a long trajectory of gradient flow for the sampled task; the other is backward and solves the adjoint ODE.

Meta-Learning

GANet: Glyph-Attention Network for Few-Shot Font Generation

no code implementations29 Sep 2021 Mingtao Guo, Wei Xiong, Zheng Wang, Yong Tang, Ting Wu

Font generation is a valuable but challenging task, it is time consuming and costly to design font libraries which cover all glyphs with various styles.

Decoder Font Generation

DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

1 code implementation27 Jul 2021 Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang

Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects.

6D Pose Estimation Metric Learning +2

Spectrum Gaussian Processes Based On Tunable Basis Functions

no code implementations14 Jul 2021 Wenqi Fang, Guanlin Wu, Jingjing Li, Zheng Wang, Jiang Cao, Yang Ping

Spectral approximation and variational inducing learning for the Gaussian process are two popular methods to reduce computational complexity.

Gaussian Processes

Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations

1 code implementation23 Jun 2021 Qian Li, Hao Peng, JianXin Li, Jia Wu, Yuanxing Ning, Lihong Wang, Philip S. Yu, Zheng Wang

Our approach leverages knowledge of the already extracted arguments of the same sentence to determine the role of arguments that would be difficult to decide individually.

Event Extraction Incremental Learning +3

Effect of Adaptive and Fixed Shared Steering Control on Distracted Driver Behavior

no code implementations7 Jun 2021 Zheng Wang, Satoshi Suga, Edric John Cruz Nacpil, Bo Yang, Kimihiko Nakano

Evaluation results indicated that, for both attentive and distracted drivers, haptic guidance with adaptive authority yielded lower driver workload and reduced lane departure risk than manual driving and fixed authority.

Steering Control

CNTLS: A Benchmark Dataset for Abstractive or Extractive Chinese Timeline Summarization

no code implementations29 May 2021 Qianren Mao, Jiazheng Wang, Zheng Wang, Xi Li, Bo Li, JianXin Li

We meticulously analyze the corpus using well-known metrics, focusing on the style of the summaries and the complexity of the summarization task.

Information Retrieval Retrieval +3

Degrade is Upgrade: Learning Degradation for Low-light Image Enhancement

1 code implementation19 Mar 2021 Kui Jiang, Zhongyuan Wang, Zheng Wang, Chen Chen, Peng Yi, Tao Lu, Chia-Wen Lin

Different from existing methods tending to accomplish the relighting task directly by ignoring the fidelity and naturalness recovery, we investigate the intrinsic degradation and relight the low-light image while refining the details and color in two steps.

Low-Light Image Enhancement

Model Rectification via Unknown Unknowns Extraction from Deployment Samples

no code implementations8 Feb 2021 Bruno Abrahao, Zheng Wang, Haider Ahmed, Yuchen Zhu

Model deficiency that results from incomplete training data is a form of structural blindness that leads to costly errors, oftentimes with high confidence.

Active Learning

Image Inpainting Guided by Coherence Priors of Semantics and Textures

no code implementations CVPR 2021 Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.

Image Inpainting Semantic Segmentation

Re-identification = Retrieval + Verification: Back to Essence and Forward with a New Metric

1 code implementation23 Nov 2020 Zheng Wang, Xin Yuan, Toshihiko Yamasaki, Yutian Lin, Xin Xu, Wenjun Zeng

In essence, current re-ID overemphasizes the importance of retrieval but underemphasizes that of verification, \textit{i. e.}, all returned images are considered as the target.

Image Retrieval Retrieval

Intention-Based Lane Changing and Lane Keeping Haptic Guidance Steering System

no code implementations15 Nov 2020 Zhanhong Yan, Kaiming Yang, Zheng Wang, Bo Yang, Tsutomu Kaizuka, Kimihiko Nakano

By exerting continuous torque on the steering wheel, both the driver and support system can share lateral control of the vehicle.

Steering Control

Towards Context-Aware Code Comment Generation

no code implementations Findings of the Association for Computational Linguistics 2020 Xiaohan Yu, Quzhe Huang, Zheng Wang, Yansong Feng, Dongyan Zhao

Code comments are vital for software maintenance and comprehension, but many software projects suffer from the lack of meaningful and up-to-date comments in practice.

Code Comment Generation Comment Generation +1

Learning Personalized Discretionary Lane-Change Initiation for Fully Autonomous Driving Based on Reinforcement Learning

no code implementations29 Oct 2020 Zhuoxi Liu, Zheng Wang, Bo Yang, Kimihiko Nakano

In this article, the authors present a novel method to learn the personalized tactic of discretionary lane-change initiation for fully autonomous vehicles through human-computer interactions.

Autonomous Driving Reinforcement Learning (RL)

Universal Weighting Metric Learning for Cross-Modal Matching

1 code implementation CVPR 2020 Jiwei Wei, Xing Xu, Yang Yang, Yanli Ji, Zheng Wang, Heng Tao Shen

Furthermore, we introduce a new polynomial loss under the universal weighting framework, which defines a weight function for the positive and negative informative pairs respectively.

Image-text matching Metric Learning +1

Uncertainty-aware Attention Graph Neural Network for Defending Adversarial Attacks

no code implementations22 Sep 2020 Boyuan Feng, Yuke Wang, Zheng Wang, Yufei Ding

With the increasing popularity of graph-based learning, graph neural networks (GNNs) emerge as the essential tool for gaining insights from graphs.

Graph Neural Network

DTDN: Dual-task De-raining Network

1 code implementation21 Aug 2020 Zheng Wang, Jianwu Li, Ge Song

Removing rain streaks from rainy images is necessary for many tasks in computer vision, such as object detection and recognition.

Generative Adversarial Network object-detection +2

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

no code implementations12 Aug 2020 Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang

We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.

Crowd Counting Knowledge Distillation +3

Lifelong Property Price Prediction: A Case Study for the Toronto Real Estate Market

1 code implementation12 Aug 2020 Hao Peng, Jian-Xin Li, Zheng Wang, Renyu Yang, Mingzhe Liu, Mingming Zhang, Philip S. Yu, Lifang He

As a departure from prior work, Luce organizes the house data in a heterogeneous information network (HIN) where graph nodes are house entities and attributes that are important for house price valuation.

3D Spectrum Mapping Based on ROI-Driven UAV Deployment

no code implementations6 Aug 2020 Qihui Wu, Feng Shen, Zheng Wang, Guoru Ding

Given the explosive growth of Internet of Things (IoT) devices ranging from the two-dimensional (2D) ground to the three-dimensional (3D) space, it is a necessity to establish a 3D spectrum map to comprehensively present and effectively manage the 3D spatial spectrum resources in smart city infrastructures.

Exploring Image Enhancement for Salient Object Detection in Low Light Images

no code implementations31 Jul 2020 Xin Xu, Shiqin Wang, Zheng Wang, Xiaolong Zhang, Ruimin Hu

Low light images captured in a non-uniform illumination environment usually are degraded with the scene depth and the corresponding environment lights.

Image Enhancement Object +3

Streaming Probabilistic Deep Tensor Factorization

1 code implementation14 Jul 2020 Shikai Fang, Zheng Wang, Zhimeng Pan, Ji Liu, Shandian Zhe

Our algorithm provides responsive incremental updates for the posterior of the latent factors and NN weights upon receiving new tensor entries, and meanwhile select and inhibit redundant/useless weights.

Network Embedding with Completely-imbalanced Labels

2 code implementations IEEE Transactions on Knowledge and Data Engineering 2020 Zheng Wang, Xiaojun Ye, Chaokun Wang, Jian Cui, Philip S. Yu

Network embedding, aiming to project a network into a low-dimensional space, is increasingly becoming a focus of network research.

Network Embedding

Road Network Metric Learning for Estimated Time of Arrival

no code implementations24 Jun 2020 Yiwen Sun, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

To address the data sparsity problem, we propose the Road Network Metric Learning framework for ETA (RNML-ETA).

Metric Learning

Sparse Gaussian Process Based On Hat Basis Functions

no code implementations15 Jun 2020 Wenqi Fang, Huiyun Li, Hui Huang, Shaobo Dang, Zhejun Huang, Zheng Wang

Based on hat basis functions, we propose a new sparse Gaussian process method to solve the unconstrained regression problem.

regression

Multi-Fidelity High-Order Gaussian Processes for Physical Simulation

1 code implementation8 Jun 2020 Zheng Wang, Wei Xing, Robert Kirby, Shandian Zhe

To address these issues, we propose Multi-Fidelity High-Order Gaussian Process (MFHoGP) that can capture complex correlations both between the outputs and between the fidelities to enhance solution estimation, and scale to large numbers of outputs.

Gaussian Processes Vocal Bursts Intensity Prediction

Physics Informed Deep Kernel Learning

no code implementations8 Jun 2020 Zheng Wang, Wei Xing, Robert Kirby, Shandian Zhe

Deep kernel learning is a promising combination of deep neural networks and nonparametric function learning.

Gaussian Processes Uncertainty Quantification

FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention

no code implementations7 Jun 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Ziang Yan, Chang-Shui Zhang, Jieping Ye

Estimated time of arrival (ETA) is one of the most important services in intelligent transportation systems and becomes a challenging spatial-temporal (ST) data mining task in recent years.

Fusion Recurrent Neural Network

no code implementations7 Jun 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

Furthermore, in order to evaluate Fusion RNN's sequence feature extraction capability, we choose a representative data mining task for sequence data, estimated time of arrival (ETA) and present a novel model based on Fusion RNN.

Neighborhood Matching Network for Entity Alignment

1 code implementation ACL 2020 Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Dongyan Zhao

This paper presents Neighborhood Matching Network (NMN), a novel entity alignment framework for tackling the structural heterogeneity challenge.

Entity Alignment Graph Sampling +1

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

no code implementations23 Apr 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

Recently, deep learning based methods have achieved promising results by adopting graph convolutional network (GCN) to extract the spatial correlations and recurrent neural network (RNN) to capture the temporal dependencies.

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures: A Machine Learning Based Approach

1 code implementation5 Mar 2020 Peng Zhang, Jianbin Fang, Canqun Yang, Chun Huang, Tao Tang, Zheng Wang

This article presents an automatic approach to quickly derive a good solution for hardware resource partition and task granularity for task-based parallel applications on heterogeneous many-core architectures.

BIG-bench Machine Learning

Robotic Cane as a Soft SuperLimb for Elderly Sit-to-Stand Assistance

no code implementations29 Feb 2020 Xia Wu, Haiyuan Liu, Ziqi Liu, Mingdong Chen, Fang Wan, Chenglong Fu, Harry Asada, Zheng Wang, Chaoyang Song

Many researchers have identified robotics as a potential solution to the aging population faced by many developed and developing countries.

Curriculum Audiovisual Learning

no code implementations26 Jan 2020 Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou

Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data.

Clustering cross-modal alignment

Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation

no code implementations CVPR 2020 Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu

To tackle intra-class shape variations, we learn canonical shape space (CASS), a unified representation for a large variety of instances of a certain object category.

3D Shape Representation Generating 3D Point Clouds +1

Characterizing Scalability of Sparse Matrix-Vector Multiplications on Phytium FT-2000+ Many-cores

no code implementations20 Nov 2019 Donglin Chen, Jianbin Fang, Chuanfu Xu, Shizhao Chen, Zheng Wang

Understanding the scalability of parallel programs is crucial for software optimization and hardware architecture design.

Optimizing Deep Learning Inference on Embedded Systems Through Adaptive Model Selection

no code implementations9 Nov 2019 Vicent Sanz Marco, Ben Taylor, Zheng Wang, Yehia Elkhatib

For image classification, we achieve a 1. 8x reduction in inference time with a 7. 52% improvement in accuracy, over the most-capable single DNN model.

Deep Learning Image Classification +3

Conditional Expectation Propagation

no code implementations27 Oct 2019 Zheng Wang, Shandian Zhe

Expectation propagation (EP) is a powerful approximate inference algorithm.

Computational Efficiency

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning

1 code implementation NeurIPS 2020 Xinyue Chen, Zijian Zhou, Zheng Wang, Che Wang, Yanqiu Wu, Keith Ross

There has recently been a surge in research in batch Deep Reinforcement Learning (DRL), which aims for learning a high-performing policy from a given dataset without additional interactions with the environment.

Deep Reinforcement Learning Imitation Learning +4

Jointly Learning Entity and Relation Representations for Entity Alignment

1 code implementation IJCNLP 2019 Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Dongyan Zhao

Entity alignment is a viable means for integrating heterogeneous knowledge among different knowledge graphs (KGs).

Ranked #18 on Entity Alignment on DBP15k zh-en (using extra training data)

Entity Alignment Entity Embeddings +2

Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs

1 code implementation22 Aug 2019 Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, Dongyan Zhao

Entity alignment is the task of linking entities with the same real-world identity from different knowledge graphs (KGs), which has been recently dominated by embedding-based methods.

Ranked #20 on Entity Alignment on DBP15k zh-en (using extra training data)

Entity Alignment Entity Embeddings +2

Zero-Shot Feature Selection via Transferring Supervised Knowledge

no code implementations9 Aug 2019 Zheng Wang, Qiao Wang, Tingzhang Zhao, Xiaojun Ye

Feature selection, an effective technique for dimensionality reduction, plays an important role in many machine learning systems.

Dimensionality Reduction feature selection +1

Robust Linear Discriminant Analysis Using Ratio Minimization of L1,2-Norms

no code implementations29 Jun 2019 Feiping Nie, Hua Wang, Zheng Wang, Heng Huang

In this paper, we propose a novel robust linear discriminant analysis method based on the L1, 2-norm ratio minimization.

Beyond Intra-modality: A Survey of Heterogeneous Person Re-identification

no code implementations24 May 2019 Zheng Wang, Zhixiang Wang, Yinqiang Zheng, Yang Wu, Wen-Jun Zeng, Shin'ichi Satoh

An efficient and effective person re-identification (ReID) system relieves the users from painful and boring video watching and accelerates the process of video analysis.

Person Re-Identification

Less Memory, Faster Speed: Refining Self-Attention Module for Image Reconstruction

no code implementations20 May 2019 Zheng Wang, Jianwu Li, Ge Song, Tieling Li

Self-attention (SA) mechanisms can capture effectively global dependencies in deep neural networks, and have been applied to natural language processing and image processing successfully.

Image Reconstruction

DotSCN: Group Re-identification via Domain-Transferred Single and Couple Representation Learning

no code implementations13 May 2019 Ziling Huang, Zheng Wang, Chung-Chi Tsai, Shin'ichi Satoh, Chia-Wen Lin

To gain the superiority of deep learning models, we treat a group as multiple persons and transfer the domain of a labeled ReID dataset to a G-ReID target dataset style to learn single representations.

Person Re-Identification Representation Learning

Ensemble Super-Resolution with A Reference Dataset

1 code implementation12 May 2019 Junjun Jiang, Yi Yu, Zheng Wang, Suhua Tang, Ruimin Hu, Jiayi Ma

In this paper, we present a simple but effective single image SR method based on ensemble learning, which can produce a better performance than that could be obtained from any of SR methods to be ensembled (or called component super-resolvers).

Ensemble Learning Image Super-Resolution

Illumination-Adaptive Person Re-identification

no code implementations11 May 2019 Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, Shin'ichi Satoh

To demonstrate the illumination issue and to evaluate our model, we construct two large-scale simulated datasets with a wide range of illumination variations.

Disentanglement Person Re-Identification +2

Robust Semantic Segmentation By Dense Fusion Network On Blurred VHR Remote Sensing Images

no code implementations7 Mar 2019 Yi Peng, Shihao Sun, Zheng Wang, Yining Pan, Ruirui Li

Robust semantic segmentation of VHR remote sensing images from UAV sensors is critical for earth observation, land use, land cover or mapping applications.

Decoder Earth Observation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.