Search Results for author: Wei Huang

Found 114 papers, 43 papers with code

Exploring Label Hierarchy in a Generative Way for Hierarchical Text Classification

no code implementations COLING 2022 Wei Huang, Chen Liu, Bo Xiao, Yihua Zhao, Zhaoming Pan, Zhimin Zhang, Xinyun Yang, Guiquan Liu

Hierarchical Text Classification (HTC), which aims to predict text labels organized in hierarchical space, is a significant task lacking in investigation in natural language processing.

text-classification Text Classification

Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning

no code implementations16 Apr 2024 Zhengyang Liang, Meiyu Liang, Wei Huang, Yawen Li, Zhe Xue

Our methodology streamlines pre-trained multimodal large models using only their output features and original image-level information, requiring minimal computational resources.

Cross-Modal Retrieval Representation Learning

Privacy-Preserving End-to-End Spoken Language Understanding

no code implementations22 Mar 2024 Yinggui Wang, Wei Huang, Le Yang

Thus, the SLU system needs to ensure that a potential malicious attacker cannot deduce the sensitive attributes of the users, while it should avoid greatly compromising the SLU accuracy.

Privacy Preserving speech-recognition +2

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

no code implementations2 Mar 2024 Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei zha

Audio-visual saliency prediction can draw support from diverse modality complements, but further performance enhancement is still challenged by customized architectures as well as task-specific loss functions.

Denoising Saliency Prediction

Global and Local Prompts Cooperation via Optimal Transport for Federated Learning

1 code implementation29 Feb 2024 Hongxia Li, Wei Huang, Jingya Wang, Ye Shi

Specifically, for each client, we learn a global prompt to extract consensus knowledge among clients, and a local prompt to capture client-specific category characteristics.

Federated Learning

A Unified Causal View of Instruction Tuning

no code implementations9 Feb 2024 Lu Chen, Wei Huang, Ruqing Zhang, Wei Chen, Jiafeng Guo, Xueqi Cheng

The key idea is to learn task-required causal factors and only use those to make predictions for a given task.

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

1 code implementation6 Feb 2024 Wei Huang, Yangdong Liu, Haotong Qin, Ying Li, Shiming Zhang, Xianglong Liu, Michele Magno, Xiaojuan Qi

Pretrained large language models (LLMs) exhibit exceptional general language processing capabilities but come with significant demands on memory and computational resources.

Binarization Quantization

DaFoEs: Mixing Datasets towards the generalization of vision-state deep-learning Force Estimation in Minimally Invasive Robotic Surgery

1 code implementation17 Jan 2024 Mikel De Iturrate Reyzabal, Mingcong Chen, Wei Huang, Sebastien Ourselin, Hongbin Liu

In this paper, we present a new vision-haptic dataset (DaFoEs) with variable soft environments for the training of deep neural models.

Graph Relation Distillation for Efficient Biomedical Instance Segmentation

2 code implementations12 Jan 2024 Xiaoyu Liu, Yueyi Zhang, Zhiwei Xiong, Wei Huang, Bo Hu, Xiaoyan Sun, Feng Wu

IGD constructs a graph representing instance features and relations, transferring these two types of knowledge by enforcing instance graph consistency.

Instance Segmentation Knowledge Distillation +2

Maximum Likelihood CFO Estimation for High-Mobility OFDM Systems: A Chinese Remainder Theorem Based Method

no code implementations27 Dec 2023 Wei Huang, Jun Wang, Xiaoping Li, Qihang Peng

Orthogonal frequency division multiplexing (OFDM) is a widely adopted wireless communication technique but is sensitive to the carrier frequency offset (CFO).

Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning

1 code implementation NeurIPS 2023 Zhongyi Cai, Ye Shi, Wei Huang, Jingya Wang

Specifically, the online model learns general knowledge that is shared among all clients, while the offline model is trained locally to learn the specialized knowledge of each individual client.

Domain Generalization General Knowledge +2

Earthfarseer: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

no code implementations13 Dec 2023 Hao Wu, Shilong Wang, Yuxuan Liang, Zhengyang Zhou, Wei Huang, Wei Xiong, Kun Wang

Efficiently modeling spatio-temporal (ST) physical processes and observations presents a challenging problem for the deep learning community.

A Simple Framework to Enhance the Adversarial Robustness of Deep Learning-based Intrusion Detection System

no code implementations6 Dec 2023 Xinwei Yuan, Shu Han, Wei Huang, Hongliang Ye, Xianglong Kong, Fan Zhang

In this paper, we propose a novel IDS architecture that can enhance the robustness of IDS against adversarial attacks by combining conventional machine learning (ML) models and Deep Learning models.

Adversarial Attack Adversarial Robustness +1

Interpretable Modeling of Single-cell perturbation Responses to Novel Drugs Using Cycle Consistence Learning

no code implementations17 Nov 2023 Wei Huang, Aichun Zhu, Hui Liu

Phenotype-based screening has attracted much attention for identifying cell-active compounds.

Generator Identification for Linear SDEs with Additive and Multiplicative Noise

no code implementations NeurIPS 2023 Yuanyuan Wang, Xi Geng, Wei Huang, Biwei Huang, Mingming Gong

In this paper, we present conditions for identifying the generator of a linear stochastic differential equation (SDE) from the distribution of its solution process with a given fixed initial state.

Causal Inference

Experimental Results of Underwater Sound Speed Profile Inversion by Few-shot Multi-task Learning

no code implementations18 Oct 2023 Wei Huang, Fan Gao, Junting Wang, Hao Zhang

Underwater Sound Speed Profile (SSP) distribution has great influence on the propagation mode of acoustic signal, thus the fast and accurate estimation of SSP is of great importance in building underwater observation systems.

Compressive Sensing Few-Shot Learning +1

Fast Ray-Tracing-Based Precise Underwater Acoustic Localization without Prior Acknowledgment of Target Depth

no code implementations12 Oct 2023 Wei Huang, Hao Zhang, Kaitao Meng, Fan Gao, Wenzhou Sun, Jianxu Shu, Tianhe Xu, Deshi Li

To tackle this issue, we propose an iterative ray tracing 3D underwater localization (IRTUL) method for stratification compensation.

Underwater Sound Speed Profile Construction: A Review

no code implementations12 Oct 2023 Wei Huang, Jixuan Zhou, Fan Gao, Jiajun Lu, Sijia Li, Pengfei Wu, Junting Wang, Hao Zhang, Tianhe Xu

The proposal of SSP inversion method greatly improves the convenience and real--time performance, but the accuracy is not as good as the direct measurement method.

Compressive Sensing

Rethinking Large-scale Pre-ranking System: Entire-chain Cross-domain Models

1 code implementation12 Oct 2023 Jinbo Song, Ruoran Huang, Xinyang Wang, Wei Huang, Qian Yu, Mingming Chen, Yafei Yao, Chaosheng Fan, Changping Peng, Zhangang Lin, Jinghe Hu, Jingping Shao

Industrial systems such as recommender systems and online advertising, have been widely equipped with multi-stage architectures, which are divided into several cascaded modules, including matching, pre-ranking, ranking and re-ranking.

Recommendation Systems Re-Ranking +1

Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning

1 code implementation6 Oct 2023 Yinda Chen, Wei Huang, Shenglong Zhou, Qi Chen, Zhiwei Xiong

By extracting semantic information from unlabeled data, self-supervised methods can improve the performance of downstream tasks, among which the mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.

Multi-agent Reinforcement Learning reinforcement-learning +2

UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection

no code implementations15 Sep 2023 Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei zha, Tao You

While many approaches have crafted task-specific training paradigms for either video saliency prediction or video salient object detection tasks, few attention has been devoted to devising a generalized saliency modeling framework that seamlessly bridges both these distinct tasks.

object-detection Saliency Prediction +3

DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection

no code implementations7 Sep 2023 Cunhang Fan, Hongyu Zhang, Wei Huang, Jun Xue, JianHua Tao, Jiangyan Yi, Zhao Lv, Xiaopei Wu

Specifically, to effectively represent the non-Euclidean properties of EEG signals, dynamical graph convolutional networks are applied to represent the graph structure of EEG signals, which can also extract crucial features related to auditory spatial attention in EEG signals.

EEG

OHQ: On-chip Hardware-aware Quantization

no code implementations5 Sep 2023 Wei Huang, Haotong Qin, Yangdong Liu, Jingzhuo Liang, Yulun Zhang, Ying Li, Xianglong Liu

Mixed-precision quantization leverages multiple bit-width architectures to unleash the accuracy and efficiency potential of quantized models.

Quantization

Domain Adaptive Synapse Detection with Weak Point Annotations

no code implementations31 Aug 2023 Qi Chen, Wei Huang, Yueyi Zhang, Zhiwei Xiong

In the second stage, we improve model generalizability on target data by regenerating square masks to get high-quality pseudo labels.

Segmentation

DARWIN Series: Domain Specific Large Language Models for Natural Science

2 code implementations25 Aug 2023 Tong Xie, Yuwei Wan, Wei Huang, Zhenyu Yin, Yixuan Liu, Shaozhou Wang, Qingyuan Linghu, Chunyu Kit, Clara Grazian, Wenjie Zhang, Imran Razzak, Bram Hoex

To add new capabilities in natural science, enabling the acceleration and enrichment of automation of the discovery process, we present DARWIN, a series of tailored LLMs for natural science, mainly in physics, chemistry, and material science.

Knowledge Graphs

Inducing Causal Structure for Abstractive Text Summarization

1 code implementation24 Aug 2023 Lu Chen, Ruqing Zhang, Wei Huang, Wei Chen, Jiafeng Guo, Xueqi Cheng

The key idea is to reformulate the Variational Auto-encoder (VAE) to fit the joint distribution of the document and summary variables from the training corpus.

Abstractive Text Summarization

Minimalist Traffic Prediction: Linear Layer Is All You Need

no code implementations20 Aug 2023 Wenying Duan, HONG RAO, Wei Huang, Xiaoxi He

Traffic prediction is essential for the progression of Intelligent Transportation Systems (ITS) and the vision of smart cities.

Time Series Traffic Prediction

Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization

1 code implementation9 Aug 2023 Tianyu Liu, Peng Zhang, Wei Huang, Yufei zha, Tao You, Yanning Zhang

By decoupling the gradients of visual and audio modalities, the discriminative visual representations of sound sources can be learned with the designed Induction Vector in a bootstrap manner, which also enables the audio modality to be aligned with the visual modality consistently.

Contrastive Learning

Multi-scale Alternated Attention Transformer for Generalized Stereo Matching

no code implementations6 Aug 2023 Wei Miao, Hong Zhao, Tongjia Chen, Wei Huang, Changyan Xiao

Recent stereo matching networks achieves dramatic performance by introducing epipolar line constraint to limit the matching range of dual-view.

Stereo Matching

What, Indeed, is an Achievable Provable Guarantee for Learning-Enabled Safety Critical Systems

no code implementations20 Jul 2023 Saddek Bensalem, Chih-Hong Cheng, Wei Huang, Xiaowei Huang, Changshun Wu, Xingyu Zhao

Machine learning has made remarkable advancements, but confidently utilising learning-enabled components in safety-critical domains still poses challenges.

Neural Causal Graph Collaborative Filtering

1 code implementation10 Jul 2023 Xiangmeng Wang, Qian Li, Dianer Yu, Wei Huang, Guandong Xu

In this work, we propose to integrate causal modeling with the learning process of GCN-based GCF models, leveraging causality-aware graph embeddings to capture complex causal relations in recommendations.

Collaborative Filtering Graph Learning +3

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction

no code implementations8 Jul 2023 Ganglai Wang, Peng Zhang, Junwen Xiong, Feihan Yang, Wei Huang, Yufei zha

DeepFake based digital facial forgery is threatening public media security, especially when lip manipulation has been used in talking face generation, and the difficulty of fake video detection is further improved.

Face Detection Face Swapping +2

Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective

no code implementations24 Jun 2023 Wei Huang, Yuan Cao, Haonan Wang, Xin Cao, Taiji Suzuki

Graph neural networks (GNNs) have pioneered advancements in graph representation learning, exhibiting superior feature learning and performance over multilayer perceptrons (MLPs) when handling graph inputs.

Graph Representation Learning Learning Theory +1

Generative Text-Guided 3D Vision-Language Pretraining for Unified Medical Image Segmentation

no code implementations7 Jun 2023 Yinda Chen, Che Liu, Wei Huang, Sibo Cheng, Rossella Arcucci, Zhiwei Xiong

To address these challenges, we present Generative Text-Guided 3D Vision-Language Pretraining for Unified Medical Image Segmentation (GTGM), a framework that extends of VLP to 3D medical images without relying on paired textual descriptions.

Computed Tomography (CT) Contrastive Learning +4

Transforming Geospatial Ontologies by Homomorphisms

no code implementations22 May 2023 Xiuzhan Guo, Wei Huang, Min Luo, Priya Rangarajan

Geospatial ontology merging systems, natural partial orders on the systems, and geospatial ontology merging closures in the systems are then transformed under geospatial ontology system homomorphisms that are given by quotients and embeddings.

Clustering

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation

no code implementations19 May 2023 Xiaowei Huang, Wenjie Ruan, Wei Huang, Gaojie Jin, Yi Dong, Changshun Wu, Saddek Bensalem, Ronghui Mu, Yi Qi, Xingyu Zhao, Kaiwen Cai, Yanghao Zhang, Sihao Wu, Peipei Xu, Dengyu Wu, Andre Freitas, Mustafa A. Mustafa

Large Language Models (LLMs) have exploded a new heatwave of AI for their ability to engage end-users in human-level conversations with detailed and articulate answers across many knowledge domains.

Understanding and Improving Feature Learning for Out-of-Distribution Generalization

1 code implementation NeurIPS 2023 Yongqiang Chen, Wei Huang, Kaiwen Zhou, Yatao Bian, Bo Han, James Cheng

Moreover, when fed the ERM learned features to the OOD objectives, the invariant feature learning quality significantly affects the final OOD performance, as OOD objectives rarely learn new features.

Out-of-Distribution Generalization

Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT

no code implementations5 Apr 2023 Tong Xie, Yuwei Wan, Wei Huang, Yufei Zhou, Yixuan Liu, Qingyuan Linghu, Shaozhou Wang, Chunyu Kit, Clara Grazian, Wenjie Zhang, Bram Hoex

The amount of data has growing significance in exploring cutting-edge materials and a number of datasets have been generated either by hand or automated approaches.

feature selection Language Modelling

Communication under Mixed Gaussian-Impulsive Channel: An End-to-End Framework

no code implementations19 Jan 2023 Chengjie Zhao, Jun Wang, Wei Huang, Xiaonan Chen, Tianfu Qi

Under MGIN channel, classical communication signal schemes and corresponding detection methods usually can not achieve desirable performance as they are optimized with respect to WGN.

Style Projected Clustering for Domain Generalized Semantic Segmentation

no code implementations CVPR 2023 Wei Huang, Chang Chen, Yong Li, Jiacheng Li, Cheng Li, Fenglong Song, Youliang Yan, Zhiwei Xiong

In contrast to existing methods, we instead utilize the difference between images to build a better representation space, where the distinct style features are extracted and stored as the bases of representation.

Clustering Semantic Segmentation

Learning Cross-Representation Affinity Consistency for Sparsely Supervised Biomedical Instance Segmentation

1 code implementation ICCV 2023 Xiaoyu Liu, Wei Huang, Zhiwei Xiong, Shenglong Zhou, Yueyi Zhang, Xuejin Chen, Zheng-Jun Zha, Feng Wu

Sparse instance-level supervision has recently been explored to address insufficient annotation in biomedical instance segmentation, which is easier to annotate crowded instances and better preserves instance completeness for 3D volumetric datasets compared to common semi-supervision. In this paper, we propose a sparsely supervised biomedical instance segmentation framework via cross-representation affinity consistency regularization.

Instance Segmentation Pseudo Label +1

A Soma Segmentation Benchmark in Full Adult Fly Brain

1 code implementation CVPR 2023 Xiaoyu Liu, Bo Hu, Mingxing Li, Wei Huang, Yueyi Zhang, Zhiwei Xiong

Finally, we provide quantitative and qualitative benchmark comparisons on the testset to validate the superiority of the proposed method, as well as preliminary statistics of the reconstructed somas in the full adult fly brain from the biological perspective.

TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective

no code implementations17 Dec 2022 Pengfei Xi, Guifeng Wang, Zhipeng Hu, Yu Xiong, Mingming Gong, Wei Huang, Runze Wu, Yu Ding, Tangjie Lv, Changjie Fan, Xiangnan Feng

TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions which further improves estimation accuracy.

Contrastive Learning counterfactual +3

Single-Pass Contrastive Learning Can Work for Both Homophilic and Heterophilic Graph

1 code implementation20 Nov 2022 Haonan Wang, Jieyu Zhang, Qi Zhu, Wei Huang, Kenji Kawaguchi, Xiaokui Xiao

To answer this question, we theoretically study the concentration property of features obtained by neighborhood aggregation on homophilic and heterophilic graphs, introduce the single-pass augmentation-free graph contrastive learning loss based on the property, and provide performance guarantees for the minimizer of the loss on downstream tasks.

Contrastive Learning

ViT-LSLA: Vision Transformer with Light Self-Limited-Attention

no code implementations31 Oct 2022 Zhenzhe Hechen, Wei Huang, Yixin Zhao

Consequently, this paper presents a light self-limited-attention (LSLA) consisting of a light self-attention mechanism (LSA) to save the computation cost and the number of parameters, and a self-limited-attention mechanism (SLA) to improve the performance.

Position

Variance reduced Shapley value estimation for trustworthy data valuation

no code implementations30 Oct 2022 Mengmeng Wu, Ruoxi Jia, Changle lin, Wei Huang, Xiangyu Chang

Data valuation, especially quantifying data value in algorithmic prediction and decision-making, is a fundamental problem in data trading scenarios.

Data Valuation Decision Making

Anisotropic multiresolution analyses for deepfake detection

no code implementations26 Oct 2022 Wei Huang, Michelangelo Valsecchi, Michael Multerer

We employ the fully separable wavelet transform and multiwavelets to obtain the anisotropic features to feed to standard CNN classifiers.

Audio Synthesis DeepFake Detection +1

Joint Microstrip Selection and Beamforming Design for MmWave Systems with Dynamic Metasurface Antennas

no code implementations22 Oct 2022 Wei Huang, Haiyang Zhang, Nir Shlezinger, Yonina C. Eldar

Dynamic metasurface antennas (DMAs) provide a new paradigm to realize large-scale antenna arrays for future wireless systems.

Identifiability and Asymptotics in Learning Homogeneous Linear ODE Systems from Discrete Observations

no code implementations12 Oct 2022 Yuanyuan Wang, Wei Huang, Mingming Gong, Xi Geng, Tongliang Liu, Kun Zhang, DaCheng Tao

This paper derives a sufficient condition for the identifiability of homogeneous linear ODE systems from a sequence of equally-spaced error-free observations sampled from a single trajectory.

Solving combinational optimization problems with evolutionary single-pixel imaging

no code implementations12 Oct 2022 Wei Huang, Jiaxiang Li, Shuming Jiao, Zibang Zhang

Single-pixel imaging (SPI) is a novel optical imaging technique by replacing the pixelated sensor array in a conventional camera with a single-pixel detector.

VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation

1 code implementation6 Oct 2022 Songhao Han, Wei Huang, Xiaotian Luan

In our work, we propose a vision-linguistics coordinate time sequence news recommendation.

News Recommendation

On the Sparse DAG Structure Learning Based on Adaptive Lasso

no code implementations7 Sep 2022 Danru Xu, Erdun Gao, Wei Huang, Menghan Wang, Andy Song, Mingming Gong

Learning the underlying Bayesian Networks (BNs), represented by directed acyclic graphs (DAGs), of the concerned events from purely-observational data is a crucial part of evidential reasoning.

Parameter Estimation of Mixed Gaussian-Impulsive Noise: An U-net++ Based Method

no code implementations6 Sep 2022 Tianfu Qi, Jun Wang, Xiaonan Chen, Wei Huang

In many scenarios, the communication system suffers from both Gaussian white noise and non-Gaussian impulsive noise.

blind source separation

SAFARI: Versatile and Efficient Evaluations for Robustness of Interpretability

1 code implementation ICCV 2023 Wei Huang, Xingyu Zhao, Gaojie Jin, Xiaowei Huang

Finally, we demonstrate two applications of our methods: ranking robust XAI methods and selecting training schemes to improve both classification and interpretation robustness.

Explainable Artificial Intelligence (XAI)

HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling

1 code implementation30 May 2022 Xiaosong Zhang, Yunjie Tian, Wei Huang, Qixiang Ye, Qi Dai, Lingxi Xie, Qi Tian

A key idea of efficient implementation is to discard the masked image patches (or tokens) throughout the target network (encoder), which requires the encoder to be a plain vision transformer (e. g., ViT), albeit hierarchical vision transformers (e. g., Swin Transformer) have potentially better properties in formulating vision inputs.

Transfer Learning

MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models

1 code implementation27 May 2022 Erdun Gao, Ignavier Ng, Mingming Gong, Li Shen, Wei Huang, Tongliang Liu, Kun Zhang, Howard Bondell

In this paper, we develop a general method, which we call MissDAG, to perform causal discovery from data with incomplete observations.

Causal Discovery Imputation +1

Hierarchical Distribution-Aware Testing of Deep Learning

1 code implementation17 May 2022 Wei Huang, Xingyu Zhao, Alec Banks, Victoria Cox, Xiaowei Huang

In this paper, we propose a new robustness testing approach for detecting AEs that considers both the feature level distribution and the pixel level distribution, capturing the perceptual quality of adversarial perturbations.

Adversarial Robustness Data Compression

Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained Analysis

2 code implementations11 May 2022 Wuyang Chen, Wei Huang, Xinyu Gong, Boris Hanin, Zhangyang Wang

Advanced deep neural networks (DNNs), designed by either human or AutoML algorithms, are growing increasingly complex.

Neural Architecture Search

Playing Tic-Tac-Toe Games with Intelligent Single-pixel Imaging

no code implementations7 May 2022 Shuming Jiao, Jiaxiang Li, Wei Huang, Zibang Zhang

Single-pixel imaging (SPI) is a novel optical imaging technique by replacing a two-dimensional pixelated sensor with a single-pixel detector and pattern illuminations.

Augmentation-Free Graph Contrastive Learning with Performance Guarantee

no code implementations11 Apr 2022 Haonan Wang, Jieyu Zhang, Qi Zhu, Wei Huang

Graph contrastive learning (GCL) is the most representative and prevalent self-supervised learning approach for graph-structured data.

Contrastive Learning Self-Supervised Learning

Enhancing Adversarial Training with Second-Order Statistics of Weights

1 code implementation CVPR 2022 Gaojie Jin, Xinping Yi, Wei Huang, Sven Schewe, Xiaowei Huang

In this paper, we show that treating model weights as random variables allows for enhancing adversarial training through \textbf{S}econd-Order \textbf{S}tatistics \textbf{O}ptimization (S$^2$O) with respect to the weights.

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection

no code implementations10 Mar 2022 Ganglai Wang, Peng Zhang, Lei Xie, Wei Huang, Yufei zha, Yanning Zhang

DeepFake based digital facial forgery is threatening the public media security, especially when lip manipulation has been used in talking face generation, the difficulty of fake video detection is further improved.

Decision Making Face Detection +2

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild

no code implementations8 Mar 2022 Ganglai Wang, Peng Zhang, Lei Xie, Wei Huang, Yufei zha

Rather than focusing on the unimportant regions of the face image, the proposed AttnWav2Lip model is able to pay more attention on the lip region reconstruction.

Talking Face Generation

Audio-visual speech separation based on joint feature representation with cross-modal attention

no code implementations5 Mar 2022 Junwen Xiong, Peng Zhang, Lei Xie, Wei Huang, Yufei zha, Yanning Zhang

Multi-modal based speech separation has exhibited a specific advantage on isolating the target character in multi-talker noisy environments.

Optical Flow Estimation Speech Separation

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

1 code implementation4 Mar 2022 Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei zha

Active speaker detection and speech enhancement have become two increasingly attractive topics in audio-visual scenario understanding.

Multi-Task Learning Speech Enhancement

Auto-scaling Vision Transformers without Training

1 code implementation ICLR 2022 Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou

The motivation comes from two pain spots: 1) the lack of efficient and principled methods for designing and scaling ViTs; 2) the tremendous computational cost of training ViT that is much heavier than its convolution counterpart.

Demystify Optimization and Generalization of Over-parameterized PAC-Bayesian Learning

no code implementations4 Feb 2022 Wei Huang, Chunrui Liu, Yilan Chen, Tianyu Liu, Richard Yi Da Xu

In addition to being a pure generalization bound analysis tool, PAC-Bayesian bound can also be incorporated into an objective function to train a probabilistic neural network, making them a powerful and relevant framework that can numerically provide a tight generalization bound for supervised learning.

Achieving Efficient Distributed Machine Learning Using a Novel Non-Linear Class of Aggregation Functions

no code implementations29 Jan 2022 Haizhou Du, Ryan Yang, Yijian Chen, Qiao Xiang, Andre Wibisono, Wei Huang

In this paper, we analyze properties of the WPM and rigorously prove convergence properties of our aggregation mechanism.

Autonomous Driving

Knowledge Graph Based Waveform Recommendation: A New Communication Waveform Design Paradigm

no code implementations24 Jan 2022 Wei Huang, Tianfu Qi, Yundi Guan, Qihang Peng, Jun Wang

In this paper, we investigate the waveform design from a novel perspective and propose a new waveform design paradigm with the knowledge graph (KG)-based intelligent recommendation system.

Collaborative Filtering Intelligent Communication +1

THE Benchmark: Transferable Representation Learning for Monocular Height Estimation

no code implementations30 Dec 2021 Zhitong Xiong, Wei Huang, Jingtao Hu, Xiao Xiang Zhu

Therefore, we propose a new benchmark dataset to study the transferability of height estimation models in a cross-dataset setting.

Representation Learning Transfer Learning

Reliability Assessment and Safety Arguments for Machine Learning Components in System Assurance

no code implementations30 Nov 2021 Yi Dong, Wei Huang, Vibhav Bharti, Victoria Cox, Alec Banks, Sen Wang, Xingyu Zhao, Sven Schewe, Xiaowei Huang

The increasing use of Machine Learning (ML) components embedded in autonomous systems -- so-called Learning-Enabled Systems (LESs) -- has resulted in the pressing need to assure their functional safety.

Semantic-Aware Generation for Self-Supervised Visual Representation Learning

1 code implementation25 Nov 2021 Yunjie Tian, Lingxi Xie, Xiaopeng Zhang, Jiemin Fang, Haohang Xu, Wei Huang, Jianbin Jiao, Qi Tian, Qixiang Ye

In this paper, we propose a self-supervised visual representation learning approach which involves both generative and discriminative proxies, where we focus on the former part by requiring the target network to recover the original image based on the mid-level features.

Representation Learning Semantic Segmentation

On the Equivalence between Neural Network and Support Vector Machine

1 code implementation NeurIPS 2021 Yilan Chen, Wei Huang, Lam M. Nguyen, Tsui-Wei Weng

Therefore, in this work, we propose to establish the equivalence between NN and SVM, and specifically, the infinitely wide NN trained by soft margin loss and the standard soft margin SVM with NTK trained by subgradient descent.

regression

Deep Active Learning by Leveraging Training Dynamics

no code implementations16 Oct 2021 Haonan Wang, Wei Huang, Ziwei Wu, Andrew Margenot, Hanghang Tong, Jingrui He

Active learning theories and methods have been extensively studied in classical statistical learning settings.

Active Learning

Long-tailed Distribution Adaptation

1 code implementation6 Oct 2021 Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye

We propose to jointly optimize empirical risks of the unbalanced and balanced domains and approximate their domain divergence by intra-class and inter-class distances, with the aim to adapt models trained on the long-tailed distribution to general distributions in an interpretable way.

Domain Adaptation Instance Segmentation +3

Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

no code implementations17 Sep 2021 Wei Huang, Chen Liu, Yihua Zhao, Xinyun Yang, Zhaoming Pan, Zhimin Zhang, Guiquan Liu

Hierarchical Text Classification (HTC), which aims to predict text labels organized in hierarchical space, is a significant task lacking in investigation in natural language processing.

text-classification Text Classification

Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking

1 code implementation31 Jul 2021 Jingxian Sun, Lichao Zhang, Yufei zha, Abel Gonzalez-Garcia, Peng Zhang, Wei Huang, Yanning Zhang

To solve this problem, we propose to distill representations of the TIR modality from the RGB modality with Cross-Modal Distillation (CMD) on a large amount of unlabeled paired RGB-TIR data.

Transfer Learning

Heterogeneous network-based drug repurposing for COVID-19

1 code implementation20 Jul 2021 Shuting Jin, Xiangxiang Zeng, Wei Huang, Feng Xia, Changzhi Jiang, Xiangrong Liu, Shaoliang Peng

The Corona Virus Disease 2019 (COVID-19) belongs to human coronaviruses (HCoVs), which spreads rapidly around the world.

Connection Sensitivity Matters for Training-free DARTS: From Architecture-Level Scoring to Operation-Level Sensitivity Analysis

no code implementations22 Jun 2021 Miao Zhang, Wei Huang, Li Wang

We investigate this question through the lens of edge connectivity, and provide an affirmative answer by defining a connectivity concept, ZERo-cost Operation Sensitivity (ZEROS), to score the importance of candidate operations in DARTS at initialization.

Computational Efficiency Network Pruning +1

Conformer: Local Features Coupling Global Representations for Visual Recognition

4 code implementations ICCV 2021 Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, YaoWei Wang, Jianbin Jiao, Qixiang Ye

Within Convolutional Neural Network (CNN), the convolution operations are good at extracting local features but experience difficulty to capture global representations.

Image Classification Instance Segmentation +4

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

1 code implementation CVPR 2021 Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao

To this end, we propose spatially probabilistic diversity normalization (SPDNorm) inside the modulation to model the probability of generating a pixel conditioned on the context information.

Image Inpainting Image Restoration

Advanced Deep Networks for 3D Mitochondria Instance Segmentation

1 code implementation16 Apr 2021 Mingxing Li, Chang Chen, Xiaoyu Liu, Wei Huang, Yueyi Zhang, Zhiwei Xiong

Mitochondria instance segmentation from electron microscopy (EM) images has seen notable progress since the introduction of deep learning methods.

3D Instance Segmentation Denoising +2

Detecting Operational Adversarial Examples for Reliable Deep Learning

no code implementations13 Apr 2021 Xingyu Zhao, Wei Huang, Sven Schewe, Yi Dong, Xiaowei Huang

The utilisation of Deep Learning (DL) raises new challenges regarding its dependability in critical applications.

Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information

no code implementations24 Mar 2021 Wei Huang, Richard Combes, Cindy Trinh

We propose a novel algorithm for multi-player multi-armed bandits without collision sensing information.

Multi-Armed Bandits

DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

1 code implementation CVPR 2021 Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bing Jiang, Wei Liu

While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content.

Texture Synthesis

Sequential Random Network for Fine-grained Image Classification

no code implementations12 Mar 2021 Chaorong Li, Malu Zhang, Wei Huang, Fengqing Qin, Anping Zeng, Yuanyuan Huang

To address this issue, we use the proposed SRN which composed of BiLSTM and several Tanh-Dropout blocks (called BiLSTM-TDN), to further process DCNN one-dimensional features for highlighting the detail information of image.

Classification

Towards Deepening Graph Neural Networks: A GNTK-based Optimization Perspective

no code implementations ICLR 2022 Wei Huang, Yayong Li, Weitao Du, Jie Yin, Richard Yi Da Xu, Ling Chen, Miao Zhang

Inspired by our theoretical insights on trainability, we propose Critical DropEdge, a connectivity-aware and graph-adaptive sampling method, to alleviate the exponential decay problem more fundamentally.

A Unified Framework for Specification Tests of Continuous Treatment Effect Models

no code implementations16 Feb 2021 Wei Huang, Oliver Linton, Zheng Zhang

We propose a general framework for the specification testing of continuous treatment effect models.

Fairness and Accuracy in Federated Learning

no code implementations18 Dec 2020 Wei Huang, Tianrui Li, Dexian Wang, Shengdong Du, Junbo Zhang

An appropriate weight selection algorithm that combines the information quantity of training accuracy and training frequency to measure the weights is proposed.

Fairness Federated Learning

BayLIME: Bayesian Local Interpretable Model-Agnostic Explanations

2 code implementations5 Dec 2020 Xingyu Zhao, Wei Huang, Xiaowei Huang, Valentin Robu, David Flynn

Given the pressing need for assuring algorithmic transparency, Explainable AI (XAI) has emerged as one of the key areas of AI research.

Explainable Artificial Intelligence (XAI)

Implicit bias of deep linear networks in the large learning rate phase

no code implementations25 Nov 2020 Wei Huang, Weitao Du, Richard Yi Da Xu, Chunrui Liu

We claim that depending on the separation conditions of data, the gradient descent iterates will converge to a flatter minimum in the catapult phase.

Binary Classification

Embedding and Extraction of Knowledge in Tree Ensemble Classifiers

2 code implementations16 Oct 2020 Wei Huang, Xingyu Zhao, Xiaowei Huang

Whilst, as the increasing use of machine learning models in security-critical applications, the embedding and extraction of malicious knowledge are equivalent to the notorious backdoor attack and its defence, respectively.

Backdoor Attack BIG-bench Machine Learning

Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations

1 code implementation ECCV 2020 Hongyu Liu, Bin Jiang, Yibing Song, Wei Huang, Chao Yang

We use CNN features from the deep and shallow layers of the encoder to represent structures and textures of an input image, respectively.

Image Inpainting

Achievable Rate Region of MISO Interference Channel Aided by Intelligent Reflecting Surface

no code implementations19 May 2020 Wei Huang, Yong Zeng, Yongming Huang

This paper investigates the achievable rate region of the multiple-input single-output (MISO) interference channel aided by intelligent reflecting surfaces (IRSs).

On the Neural Tangent Kernel of Deep Networks with Orthogonal Initialization

2 code implementations13 Apr 2020 Wei Huang, Weitao Du, Richard Yi Da Xu

The prevailing thinking is that orthogonal weights are crucial to enforcing dynamical isometry and speeding up training.

Gaussian Process Latent Variable Model Factorization for Context-aware Recommender Systems

2 code implementations19 Dec 2019 Wei Huang, Richard Yi Da Xu

Our work is primarily inspired by the Gaussian Process Latent Variable Model (GPLVM), which was a non-linear dimensionality reduction method.

Dimensionality Reduction Recommendation Systems

Mean field theory for deep dropout networks: digging up gradient backpropagation deeply

1 code implementation19 Dec 2019 Wei Huang, Richard Yi Da Xu, Weitao Du, Yutian Zeng, Yunce Zhao

In recent years, the mean field theory has been applied to the study of neural networks and has achieved a great deal of success.

One-Stage Inpainting with Bilateral Attention and Pyramid Filling Block

no code implementations18 Dec 2019 Hongyu Liu, Bin Jiang, Wei Huang, Chao Yang

However, the two-stage architecture is time-consuming, the contextual information lack high-level semantics and ignores both the semantic relevance and distance information of hole's feature patches, these limitations result in blurry textures and distorted structures of final result.

Image Inpainting Texture Synthesis

Coverage Guided Testing for Recurrent Neural Networks

1 code implementation5 Nov 2019 Wei Huang, Youcheng Sun, Xingyu Zhao, James Sharp, Wenjie Ruan, Jie Meng, Xiaowei Huang

The test metrics and test case generation algorithm are implemented into a tool TestRNN, which is then evaluated on a set of LSTM benchmarks.

Defect Detection Drug Discovery +3

Transferability of Spectral Graph Convolutional Neural Networks

no code implementations30 Jul 2019 Ron Levie, Wei Huang, Lorenzo Bucci, Michael M. Bronstein, Gitta Kutyniok

Transferability, which is a certain type of generalization capability, can be loosely defined as follows: if two graphs describe the same phenomenon, then a single filter or ConvNet should have similar repercussions on both graphs.

testRNN: Coverage-guided Testing on Recurrent Neural Networks

1 code implementation20 Jun 2019 Wei Huang, Youcheng Sun, Xiaowei Huang, James Sharp

Recurrent neural networks (RNNs) have been widely applied to various sequential tasks such as text processing, video recognition, and molecular property prediction.

Molecular Property Prediction Property Prediction +1

Attributes-aided Part Detection and Refinement for Person Re-identification

no code implementations27 Feb 2019 Shuzhao Li, Huimin Yu, Wei Huang, Jing Zhang

Person attributes are often exploited as mid-level human semantic information to help promote the performance of person re-identification task.

Attribute Person Re-Identification

Multiobjective Optimization Differential Evolution Enhanced with Principle Component Analysis for Constrained Optimization

no code implementations1 May 2018 Wei Huang, Tao Xu, Kangshun Li, Jun He

PMODE and HECO-PDE are compared with the algorithms from the IEEE CEC 2018 competition and another recent MOEA for constrained optimisation.

Evolutionary Algorithms Multiobjective Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.