Search Results for author: Di Wang

Found 231 papers, 81 papers with code

p-Norm Flow Diffusion for Local Graph Clustering

1 code implementation ICML 2020 Kimon Fountoulakis, Di Wang, Shenghao Yang

Local graph clustering and the closely related seed set expansion problem are primitives on graphs that are central to a wide range of analytic and learning tasks such as local clustering, community detection, nodes ranking and feature inference.

Clustering Community Detection +1

TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series

1 code implementation13 May 2025 Xiaolei Qin, Di Wang, Jing Zhang, Fengxiang Wang, Xin Su, Bo Du, Liangpei Zhang

Satellite image time series (SITS) provide continuous observations of the Earth's surface, making them essential for applications such as environmental management and disaster assessment.

Temporal Sequences Time Series

DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration

1 code implementation30 Apr 2025 Hebaixu Wang, Jing Zhang, HaoNan Guo, Di Wang, Jiayi Ma, Bo Du

To address these challenges, we introduce \textbf{DGSolver}, a diffusion generalist solver with universal posterior sampling.

Image Restoration Noise Estimation

Understanding the Repeat Curse in Large Language Models from a Feature Perspective

no code implementations19 Apr 2025 Junchi Yao, Shu Yang, Jianhua Xu, Lijie Hu, Mengdi Li, Di Wang

Large language models (LLMs) have made remarkable progress in various domains, yet they often suffer from repetitive text generation, a phenomenon we refer to as the "Repeat Curse".

Text Generation

Understanding Aha Moments: from External Observations to Internal Mechanisms

no code implementations3 Apr 2025 Shu Yang, Junchao Wu, Xin Chen, Yunze Xiao, Xinyi Yang, Derek F. Wong, Di Wang

We demonstrate that the "aha moment" is externally manifested in a more frequent use of anthropomorphic tones for self-reflection and an adaptive adjustment of uncertainty based on problem difficulty.

TransMamba: Flexibly Switching between Transformer and Mamba

no code implementations31 Mar 2025 Yixing Li, Ruobing Xie, Zhen Yang, Xingwu Sun, Shuaipeng Li, Weidong Han, Zhanhui Kang, Yu Cheng, Chengzhong Xu, Di Wang, Jie Jiang

Transformers are the cornerstone of modern large language models, but their quadratic computational complexity limits efficiency in long-sequence processing.

Mamba Scheduling

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

no code implementations31 Mar 2025 Fengxiang Wang, Hongzhen Wang, Mingshuo Chen, Di Wang, Yulin Wang, Zonghao Guo, Qiang Ma, Long Lan, Wenjing Yang, Jing Zhang, Zhiyuan Liu, Maosong Sun

On top of the XLRS-Bench, 16 sub-tasks are defined to evaluate MLLMs' 10 kinds of perceptual capabilities and 6 kinds of reasoning capabilities, with a primary emphasis on advanced cognitive processes that facilitate real-world decision-making and the capture of spatiotemporal changes.

Improved Rates of Differentially Private Nonconvex-Strongly-Concave Minimax Optimization

no code implementations24 Mar 2025 Ruijia Zhang, Mingxi Lei, Meng Ding, Zihang Xiang, Jinhui Xu, Di Wang

In this paper, we study the problem of (finite sum) minimax optimization in the Differential Privacy (DP) model.

ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory

1 code implementation16 Mar 2025 Liangyu Wang, Jie Ren, Hang Xu, Junxiao Wang, Huanyi Xie, David E. Keyes, Di Wang

Alternatively, zeroth-order (ZO) techniques can compute gradients using just forward operations, eliminating the need to store activations.

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

1 code implementation13 Mar 2025 Fengxiang Wang, Hongzhen Wang, Yulin Wang, Di Wang, Mingshuo Chen, Haiyan Zhao, Yangang Sun, Shuo Wang, Long Lan, Wenjing Yang, Jing Zhang

Recent advances in self-supervised learning for Vision Transformers (ViTs) have fueled breakthroughs in remote sensing (RS) foundation models.

Computational Efficiency Mamba +5

C^2 ATTACK: Towards Representation Backdoor on CLIP via Concept Confusion

no code implementations12 Mar 2025 Lijie Hu, Junchi Liao, Weimin Lyu, Shaopeng Fu, Tianhao Huang, Shu Yang, Guimin Hu, Di Wang

Backdoor attacks pose a significant threat to deep learning models, enabling adversaries to embed hidden triggers that manipulate the behavior of the model during inference.

Backdoor Attack

Nearly Optimal Differentially Private ReLU Regression

no code implementations8 Mar 2025 Meng Ding, Mingxi Lei, Shaowei Wang, Tianhang Zheng, Di Wang, Jinhui Xu

We first show that when $\varepsilon = \tilde{O}(\sqrt{\frac{1}{N}})$ and there is some public data, it is possible to achieve an upper bound of $\Tilde{O}(\frac{d^2}{N^2 \varepsilon^2})$ for the excess population risk in $(\epsilon, \delta)$-DP, where $d$ is the dimension and $N$ is the number of data samples.

regression

Online Bidding under RoS Constraints without Knowing the Value

no code implementations5 Mar 2025 Sushant Vijayan, Zhe Feng, Swati Padmanabhan, Karthikeyan Shanmugam, Arun Suggala, Di Wang

This establishes the first optimal regret and constraint violation bounds for bidding in the online setting with unknown impression values.

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering

1 code implementation26 Feb 2025 Langming Liu, Shilei Liu, Yujin Yuan, Yizhen Zhang, Bencheng Yan, Zhiyuan Zeng, ZiHao Wang, Jiaqi Liu, Di Wang, Wenbo Su, Pengjie Wang, Jian Xu, Bo Zheng

To address this concern, we propose \name, a benchmark designed to evaluate the effectiveness of user embeddings in prompting LLMs for personalization.

Question Answering

Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation

no code implementations25 Feb 2025 Tong Li, Shu Yang, Junchao Wu, Jiyao Wei, Lijie Hu, Mengdi Li, Derek F. Wong, Joshua R. Oltmanns, Di Wang

We present a comprehensive evaluation framework for assessing Large Language Models' (LLMs) capabilities in suicide prevention, focusing on two critical aspects: the Identification of Implicit Suicidal ideation (IIS) and the Provision of Appropriate Supportive responses (PAS).

Towards User-level Private Reinforcement Learning with Human Feedback

no code implementations22 Feb 2025 Jiaming Zhang, Mingxi Lei, Meng Ding, Mengdi Li, Zihang Xiang, Difei Xu, Jinhui Xu, Di Wang

We first show that the classical random response algorithm, which achieves an acceptable performance in item-level privacy, leads to suboptimal utility when in the user-level settings.

reinforcement-learning Reinforcement Learning

Mechanistic Unveiling of Transformer Circuits: Self-Influence as a Key to Model Reasoning

no code implementations13 Feb 2025 Lin Zhang, Lijie Hu, Di Wang

Transformer-based language models have achieved notable success, yet their internal reasoning mechanisms remain largely opaque due to complex non-linear interactions and high-dimensional operations.

EAP-GP: Mitigating Saturation Effect in Gradient-based Automated Circuit Identification

no code implementations7 Feb 2025 Lin Zhang, Wenshuo Dong, Zhuoran Zhang, Shu Yang, Lijie Hu, Ninghao Liu, Pan Zhou, Di Wang

In this paper, we revisit existing gradient-based circuit identification methods and find that their performance is either affected by the zero-gradient problem or saturation effects, where edge attribution scores become insensitive to input changes, resulting in noisy and unreliable attribution evaluations for circuit components.

"Short-length" Adversarial Training Helps LLMs Defend "Long-length" Jailbreak Attacks: Theoretical and Empirical Evidence

1 code implementation6 Feb 2025 Shaopeng Fu, Liang Ding, Di Wang

This paper focuses on adversarial suffix jailbreak attacks and unveils that to defend against a jailbreak attack with an adversarial suffix of length $\Theta(M)$, it is enough to align LLMs on prompts with adversarial suffixes of length $\Theta(\sqrt{M})$.

In-Context Learning

Evaluating Data Influence in Meta Learning

no code implementations27 Jan 2025 Chenyang Ren, Huanyi Xie, Shu Yang, Meng Ding, Lijie Hu, Di Wang

As one of the most fundamental models, meta learning aims to effectively address few-shot learning challenges.

Bilevel Optimization Computational Efficiency +1

Autonomy-of-Experts Models

no code implementations22 Jan 2025 Ang Lv, Ruobing Xie, Yining Qian, Songhao Wu, Xingwu Sun, Zhanhui Kang, Di Wang, Rui Yan

We argue that the separation between the router's decision-making and the experts' execution is a critical yet overlooked issue, leading to suboptimal expert selection and ineffective learning.

Decision Making Mixture-of-Experts

Infrared and Visible Image Fusion: From Data Compatibility to Task Adaption

1 code implementation18 Jan 2025 JinYuan Liu, Guanyao Wu, Zhu Liu, Di Wang, Zhiying Jiang, Long Ma, Wei Zhong, Xin Fan, Risheng Liu

We introduce a multi-dimensional framework to elucidate common learning-based IVIF methods, from visual enhancement strategies to data compatibility and task adaptability.

Infrared And Visible Image Fusion

Plug-and-Play DISep: Separating Dense Instances for Scene-to-Pixel Weakly-Supervised Change Detection in High-Resolution Remote Sensing Images

1 code implementation9 Jan 2025 Zhenghui Zhao, Chen Wu, Lixiang Ru, Di Wang, Hongruixuan Chen, Cuiqun Chen

Existing Weakly-Supervised Change Detection (WSCD) methods often encounter the problem of "instance lumping" under scene-level supervision, particularly in scenarios with a dense distribution of changed instances (i. e., changed objects).

Change Detection

Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets

no code implementations7 Jan 2025 Jing Liu, Duanchu Wang, Haoran Gong, Chongyu Wang, Jihua Zhu, Di Wang

The Boreal3D dataset, and more broadly, the synthetic data augmentation framework, is poised to become a critical resource for advancing research in large-scale 3D forest scene understanding and structural parameter estimation.

Data Augmentation parameter estimation +2

Understanding the Dark Side of LLMs' Intrinsic Self-Correction

no code implementations19 Dec 2024 Qingjie Zhang, Han Qiu, Di Wang, Haoting Qian, Yiming Li, Tianwei Zhang, Minlie Huang

Intrinsic self-correction was proposed to improve LLMs' responses via feedback prompts solely based on their inherent capability.

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

no code implementations12 Dec 2024 Ke Li, Di Wang, Zhangyuan Hu, Shaofeng Li, Weiping Ni, Lin Zhao, Quan Wang

Infrared-visible object detection (IVOD) seeks to harness the complementary information in infrared and visible images, thereby enhancing the performance of detectors in complex environments.

object-detection Object Detection

SAMCL: Empowering SAM to Continually Learn from Dynamic Domains

no code implementations6 Dec 2024 Zeqing Wang, Kangye Ji, Di Wang, Fei Cheng

SAMCL decouples stability and plasticity during CS by two components: $\textit{AugModule}$ and $\textit{Module Selector}$.

HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset

1 code implementation3 Dec 2024 Zedong Chu, Feng Xiong, Meiduo Liu, Jinzhi Zhang, Mingqi Shao, Zhaoxu Sun, Di Wang, Mu Xu

With the rapid evolution of 3D generation algorithms, the cost of producing 3D humanoid character models has plummeted, yet the field is impeded by the lack of a comprehensive dataset for automatic rigging, which is a pivotal step in character animation.

3D Generation

Data Lineage Inference: Uncovering Privacy Vulnerabilities of Dataset Pruning

no code implementations24 Nov 2024 Qi Li, Cheng-Long Wang, Yinzhi Cao, Di Wang

Our findings reveal, for the first time, that even if data in the redundant set is solely used before model training, its pruning-phase membership status can still be detected through attacks.

Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning

no code implementations20 Nov 2024 Zhi Luo, Xiyuan Yang, Pan Zhou, Di Wang

Manipulating the interaction trajectories between the intelligent agent and the environment can control the agent's training and behavior, exposing the potential vulnerabilities of reinforcement learning (RL).

Autonomous Driving reinforcement-learning +2

Dissecting Representation Misalignment in Contrastive Learning via Influence Function

no code implementations18 Nov 2024 Lijie Hu, Chenyang Ren, Huanyi Xie, Khouloud Saadi, Shu Yang, Zhen Tan, Jingfeng Zhang, Di Wang

ECIF considers both positive and negative samples and provides a closed-form approximation of contrastive learning models, eliminating the need for retraining.

Contrastive Learning Data Valuation

More Expressive Attention with Negative Weights

1 code implementation11 Nov 2024 Ang Lv, Ruobing Xie, Shuaipeng Li, Jiayi Liao, Xingwu Sun, Zhanhui Kang, Di Wang, Rui Yan

We propose a novel attention mechanism, named Cog Attention, that enables attention weights to be negative for enhanced expressiveness, which stems from two key factors: (1) Cog Attention enhances parameter flexibility.

Decoder Image Generation +2

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

3 code implementations4 Nov 2024 Xingwu Sun, Yanfeng Chen, Yiqing Huang, Ruobing Xie, Jiaqi Zhu, Kai Zhang, Shuaipeng Li, Zhen Yang, Jonny Han, Xiaobo Shu, Jiahao Bu, Zhongzhi Chen, Xuemeng Huang, Fengzong Lian, Saiyong Yang, Jianfeng Yan, Yuyuan Zeng, Xiaoqin Ren, Chao Yu, Lulu Wu, Yue Mao, Jun Xia, Tao Yang, Suncong Zheng, Kan Wu, Dian Jiao, Jinbao Xue, Xipeng Zhang, Decheng Wu, Kai Liu, Dengpeng Wu, Guanghui Xu, Shaohua Chen, Shuang Chen, Xiao Feng, Yigeng Hong, Junqiang Zheng, Chengcheng Xu, Zongwei Li, Xiong Kuang, Jianglu Hu, Yiqi Chen, Yuchi Deng, Guiyang Li, Ao Liu, Chenchen Zhang, Shihui Hu, Zilong Zhao, Zifan Wu, Yao Ding, Weichao Wang, Han Liu, Roberts Wang, Hao Fei, Peijie Yu, Ze Zhao, Xun Cao, Hai Wang, Fusheng Xiang, Mengyuan Huang, Zhiyuan Xiong, Bin Hu, Xuebin Hou, Lei Jiang, Jianqiang Ma, Jiajia Wu, Yaping Deng, Yi Shen, Qian Wang, Weijie Liu, Jie Liu, Meng Chen, Liang Dong, Weiwen Jia, Hu Chen, Feifei Liu, Rui Yuan, Huilin Xu, Zhenxiang Yan, Tengfei Cao, Zhichao Hu, Xinhua Feng, Dong Du, TingHao Yu, Yangyu Tao, Feng Zhang, Jianchen Zhu, Chengzhong Xu, Xirui Li, Chong Zha, Wen Ouyang, Yinben Xia, Xiang Li, Zekun He, Rongpeng Chen, Jiawei Song, Ruibin Chen, Fan Jiang, Chongqing Zhao, Bo wang, Hao Gong, Rong Gan, Winston Hu, Zhanhui Kang, Yong Yang, Yuhong Liu, Di Wang, Jie Jiang

In this paper, we introduce Hunyuan-Large, which is currently the largest open-source Transformer-based mixture of experts model, with a total of 389 billion parameters and 52 billion activation parameters, capable of handling up to 256K tokens.

Logical Reasoning Mathematical Problem-Solving +1

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection

1 code implementation31 Oct 2024 Ke Li, Fuyu Dong, Di Wang, Shaofeng Li, Quan Wang, Xinbo Gao, Tat-Seng Chua

Furthermore, we present VisTA, a simple yet effective baseline method that unifies the tasks of question answering and grounding by delivering both visual and textual answers.

Change Detection Question Answering +1

CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation

1 code implementation30 Oct 2024 Ziyang Gong, Zhixiang Wei, Di Wang, Xianzheng Ma, Hongruixuan Chen, Yuru Jia, Yupeng Deng, Zhenming Ji, Xiangwei Zhu, Naoto Yokoya, Jing Zhang, Bo Du, Liangpei Zhang

The field of Remote Sensing Domain Generalization (RSDG) has emerged as a critical and valuable research frontier, focusing on developing models that generalize effectively across diverse scenarios.

Domain Generalization Segmentation +1

TAAD: Time-varying adversarial anomaly detection in dynamic graphs

no code implementations Information Processing and Management 2024 Guanghua Liu, Jia Zhang, Peng Lv, Chenlong Wang, Huan Wang, Di Wang

The time-varying discriminator cooperates with the feature extractor to conduct adversarial training, which decreases the distributional differences in the feature representations of nodes between historical and newly emerged moments to learn transferable features.

Anomaly Detection

Private Language Models via Truncated Laplacian Mechanism

no code implementations10 Oct 2024 Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang

To prevent privacy leakage, researchers have investigated word-level perturbations, relying on the formal guarantees of differential privacy (DP) in the embedding space.

Dissecting Fine-Tuning Unlearning in Large Language Models

1 code implementation9 Oct 2024 Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang

Our findings reveal that these methods alter the model's knowledge retrieval process, providing further evidence that they do not genuinely erase the problematic knowledge embedded in the model parameters.

Retrieval

Faithful Interpretation for Graph Neural Networks

no code implementations9 Oct 2024 Lijie Hu, Tianhao Huang, Lu Yu, WanYu Lin, Tianhang Zheng, Di Wang

In this paper, we propose a solution to this problem by introducing a novel notion called Faithful Graph Attention-based Interpretation (FGAI).

Graph Attention

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

no code implementations8 Oct 2024 Zhuoran Zhang, Yongxiang Li, Zijian Kan, Keyuan Cheng, Lijie Hu, Di Wang

In this paper, leveraging tools in mechanistic interpretability, we first identify that in multi-hop tasks, LLMs tend to retrieve knowledge with implicit subject information from deeper MLP layers, unlike single-hop tasks, which rely on shallow layers.

knowledge editing

What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs

1 code implementation7 Oct 2024 Shu Yang, Shenzhe Zhu, Ruoxuan Bao, Liang Liu, Yu Cheng, Lijie Hu, Mengdi Li, Di Wang

Large language models (LLMs) have demonstrated remarkable capabilities in generating human-like text and exhibiting personality traits similar to those in humans.

Understanding Reasoning in Chain-of-Thought from the Hopfieldian View

no code implementations4 Oct 2024 Lijie Hu, Liang Liu, Shu Yang, Xin Chen, Zhen Tan, Muhammad Asif Ali, Mengdi Li, Di Wang

Moreover, we propose the Representation-of-Thought (RoT) framework, which leverages the robustness of low-dimensional representation spaces to enhance the robustness of the reasoning process in CoTs.

Selective Transformer for Hyperspectral Image Classification

no code implementations4 Oct 2024 Yichu Xu, Di Wang, Lefei Zhang, Liangpei Zhang

The SFormer is designed to dynamically select receptive fields for capturing both spatial and spectral contextual information, while mitigating the impact of redundant data by prioritizing the most relevant features.

Classification Hyperspectral Image Classification

Multilateral Cascading Network for Semantic Segmentation of Large-Scale Outdoor Point Clouds

no code implementations21 Sep 2024 Haoran Gong, Haodong Wang, Di Wang

Semantic segmentation of large-scale outdoor point clouds is of significant importance in environment perception and scene understanding.

Scene Understanding Semantic Segmentation

Component-based Sketching for Deep ReLU Nets

no code implementations21 Sep 2024 Di Wang, Shao-Bo Lin, Deyu Meng, Feilong Cao

To address this issue, we develop a novel sketching scheme based on deep net components for various tasks.

Philosophy

HMoE: Heterogeneous Mixture of Experts for Language Modeling

no code implementations20 Aug 2024 An Wang, Xingwu Sun, Ruobing Xie, Shuaipeng Li, Jiaqi Zhu, Zhen Yang, Pinxue Zhao, J. N. Han, Zhanhui Kang, Di Wang, Naoaki Okazaki, Cheng-Zhong Xu

To address the imbalance in expert activation, we propose a novel training objective that encourages the frequent activation of smaller experts, enhancing computational efficiency and parameter utilization.

Computational Efficiency Language Modeling +2

Incremental Structure Discovery of Classification via Sequential Monte Carlo

no code implementations15 Aug 2024 Changze Huang, Di Wang

To alleviate the requirement of prior knowledge used in GPs and learn new features from data that arrive successively, this paper presents a novel method to automatically discover models of classification on complex data with little prior knowledge.

Classification Gaussian Processes

Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services

1 code implementation5 Aug 2024 Shaopeng Fu, Xuexue Sun, Ke Qing, Tianhang Zheng, Di Wang

Experiments show that the PEI attack succeeds in revealing the hidden encoder in most cases and seldom makes mistakes even when the hidden encoder is not in the candidate set.

Image Classification text-classification +2

XTraffic: A Dataset Where Traffic Meets Incidents with Explainability and More

no code implementations16 Jul 2024 Xiaochuan Gou, Ziyue Li, Tian Lan, Junpeng Lin, Zhishuai Li, Bingyu Zhao, Chen Zhang, Di Wang, Xiangliang Zhang

Our data can revolutionalize traditional traffic-related tasks towards higher interpretability and practice: instead of traditional prediction or classification tasks, we conduct: (1) post-incident traffic forecasting to quantify the impact of different incidents on traffic indexes; (2) incident classification using traffic indexes to determine the incidents types for precautions measures; (3) global causal analysis among the traffic indexes, meta-attributes, and incidents to give high-level guidance of the interrelations of various factors; (4) local causal analysis within road nodes to examine how different incidents affect the road segments' relations.

Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning

no code implementations6 Jul 2024 Binhao Ma, Tianhang Zheng, Hongsheng Hu, Di Wang, Shuo Wang, Zhongjie Ba, Zhan Qin, Kui Ren

Our evaluation demonstrates that unlearning this benign data, comprising no more than 1% of the total training data, can reduce model accuracy by up to 50%.

Data Poisoning Machine Unlearning

Semi-supervised Concept Bottleneck Models

no code implementations27 Jun 2024 Lijie Hu, Tianhao Huang, Huanyi Xie, Chenyang Ren, Zhengyu Hu, Lu Yu, Di Wang

Concept Bottleneck Models (CBMs) have garnered increasing attention due to their ability to provide concept-based explanations for black-box deep learning models while achieving high final prediction accuracy using human-like concepts.

A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

no code implementations18 Jun 2024 Lijie Hu, Liang Liu, Shu Yang, Xin Chen, Hongru Xiao, Mengdi Li, Pan Zhou, Muhammad Asif Ali, Di Wang

Chain-of-Thought (CoT) holds a significant place in augmenting the reasoning performance for large language models (LLMs).

Retrieval

HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model

1 code implementation17 Jun 2024 Di Wang, Meiqi Hu, Yao Jin, Yuchun Miao, Jiaqi Yang, Yichu Xu, Xiaolei Qin, Jiaqi Ma, Lingyu Sun, Chenxing Li, Chuan Fu, Hongruixuan Chen, Chengxi Han, Naoto Yokoya, Jing Zhang, Minqiang Xu, Lin Liu, Lefei Zhang, Chen Wu, Bo Du, DaCheng Tao, Liangpei Zhang

Accurate hyperspectral image (HSI) interpretation is critical for providing valuable insights into various earth observation-related applications such as urban planning, precision agriculture, and environmental monitoring.

Computational Efficiency Earth Observation +1

Scaling Efficient Masked Image Modeling on Large Remote Sensing Dataset

1 code implementation17 Jun 2024 Fengxiang Wang, Hongzhen Wang, Di Wang, Zonghao Guo, Zhenyu Zhong, Long Lan, Jing Zhang, Zhiyuan Liu, Maosong Sun

To address these issues, we present a new pre-training pipeline for RS models, featuring the creation of a large-scale RS dataset and an efficient MIM approach.

Aerial Scene Classification Diversity +4

Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

1 code implementation10 Jun 2024 Yubin Xiao, Di Wang, Xuan Wu, Yuesong Wu, Boyang Li, Wei Du, Liupu Wang, You Zhou

The DS decoder explicitly models VRPs of multiple training distribution patterns through multiple auxiliary light decoders, expanding the model representation space to encompass a broader range of distributional scenarios.

Benchmarking Decoder

How to Strategize Human Content Creation in the Era of GenAI?

no code implementations7 Jun 2024 Seyed A. Esmaeili, Kevin Lim, Kshipra Bhawalkar, Zhe Feng, Di Wang, Haifeng Xu

In time-sensitive content domains (e. g., news or pop music creation) where contents' value diminishes over time, we show that there is no polynomial time algorithm for finding the human's optimal (dynamic) strategy, unless the randomized exponential time hypothesis is false.

Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives

1 code implementation1 Jun 2024 Xuan Wu, Di Wang, Lijie Wen, Yubin Xiao, Chunguo Wu, Yuesong Wu, Chaoyu Yu, Douglas L. Maskell, You Zhou

Although several surveys on Neural Combinatorial Optimization (NCO) solvers specifically designed to solve Vehicle Routing Problems (VRPs) have been conducted.

Combinatorial Optimization

Text Guided Image Editing with Automatic Concept Locating and Forgetting

no code implementations30 May 2024 Jia Li, Lijie Hu, Zhixian He, Jingfeng Zhang, Tianhang Zheng, Di Wang

With the advancement of image-to-image diffusion models guided by text, significant progress has been made in image editing.

text-guided-image-editing

A Real-Time Voice Activity Detection Based On Lightweight Neural

no code implementations27 May 2024 Jidong Jia, Pei Zhao, Di Wang

Voice activity detection (VAD) is the task of detecting speech in an audio stream, which is challenging due to numerous unseen noises and low signal-to-noise ratios in real environments.

Action Detection Activity Detection

Understanding Forgetting in Continual Learning with Linear Regression

no code implementations27 May 2024 Meng Ding, Kaiyi Ji, Di Wang, Jinhui Xu

In this paper, we provide a general theoretical analysis of forgetting in the linear regression model via Stochastic Gradient Descent (SGD) applicable to both underparameterized and overparameterized regimes.

Continual Learning regression

Editable Concept Bottleneck Models

no code implementations24 May 2024 Lijie Hu, Chenyang Ren, Zhengyu Hu, Hongbin Lin, Cheng-Long Wang, Hui Xiong, Jingfeng Zhang, Di Wang

Concept Bottleneck Models (CBMs) have garnered much attention for their ability to elucidate the prediction process through a humanunderstandable concept layer.

Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top

no code implementations24 May 2024 Keyuan Cheng, Muhammad Asif Ali, Shu Yang, Gang Lin, Yuxuan zhai, Haoyang Fei, Ke Xu, Lu Yu, Lijie Hu, Di Wang

To address these issues, in this paper, we propose a novel framework named RULE-KE, i. e., RULE based Knowledge Editing, which is a cherry on the top for augmenting the performance of all existing MQA methods under KE.

knowledge editing Multi-hop Question Answering +2

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

no code implementations23 May 2024 Shuaipeng Li, Penghao Zhao, Hailin Zhang, Xingwu Sun, Hao Wu, Dian Jiao, Weiyan Wang, Chengjun Liu, Zheng Fang, Jinbao Xue, Yangyu Tao, Bin Cui, Di Wang

First, we raise the scaling law between batch sizes and optimal learning rates in the sign of gradient case, in which we prove that the optimal learning rate first rises and then falls as the batch size increases.

LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation

1 code implementation16 May 2024 Wentao Jiang, Jing Zhang, Di Wang, Qiming Zhang, Zengmao Wang, Bo Du

Experimental results in classification and dense prediction tasks show that LeMeViT has a significant $1. 7 \times$ speedup, fewer parameters, and competitive performance compared to the baseline models, and achieves a better trade-off between efficiency and performance.

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

no code implementations3 May 2024 Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal

The current state-of-the-art theoretical analysis of Actor-Critic (AC) algorithms significantly lags in addressing the practical aspects of AC implementations.

Auctions with LLM Summaries

no code implementations11 Apr 2024 Kumar Avinava Dubey, Zhe Feng, Rahul Kidambi, Aranyak Mehta, Di Wang

We study an auction setting in which bidders bid for placement of their content within a summary generated by a large language model (LLM), e. g., an ad auction in which the display is a summary paragraph of multiple ads.

Language Modeling Language Modelling +2

Multi-hop Question Answering under Temporal Knowledge Editing

no code implementations30 Mar 2024 Keyuan Cheng, Gang Lin, Haoyang Fei, Yuxuan zhai, Lu Yu, Muhammad Asif Ali, Lijie Hu, Di Wang

Multi-hop question answering (MQA) under knowledge editing (KE) has garnered significant attention in the era of large language models.

knowledge editing Multi-hop Question Answering +3

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

no code implementations30 Mar 2024 Shu Yang, Jiayuan Su, Han Jiang, Mengdi Li, Keyuan Cheng, Muhammad Asif Ali, Lijie Hu, Di Wang

With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial.

knowledge editing Navigate +2

Prompt-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression

no code implementations30 Mar 2024 Muhammad Asif Ali, ZhengPing Li, Shu Yang, Keyuan Cheng, Yang Cao, Tianhao Huang, Guimin Hu, Weimin Lyu, Lijie Hu, Lu Yu, Di Wang

We also propose GSM8K-aug, i. e., an extended version of the existing GSM8K benchmark for task-agnostic prompts in order to provide a comprehensive evaluation platform.

GSM8K Relation

Towards Lifecycle Unlearning Commitment Management: Measuring Sample-level Approximate Unlearning Completeness

no code implementations19 Mar 2024 Cheng-Long Wang, Qi Li, Zihang Xiang, Yinzhi Cao, Di Wang

Our analysis, conducted across multiple unlearning benchmarks, reveals that these algorithms inconsistently fulfill their unlearning commitments due to two main issues: 1) unlearning new data can significantly affect the unlearning utility of previously requested data, and 2) approximate algorithms fail to ensure equitable unlearning utility across different groups.

Computational Efficiency Machine Unlearning +1

Revisiting Differentially Private Hyper-parameter Tuning

no code implementations20 Feb 2024 Zihang Xiang, Tianhao Wang, Chenglong Wang, Di Wang

Recent works propose a generic private selection solution for the tuning process, yet a fundamental question persists: is this privacy bound tight?

Privacy-Preserving Low-Rank Adaptation against Membership Inference Attacks for Latent Diffusion Models

1 code implementation19 Feb 2024 Zihao Luo, Xilie Xu, Feng Liu, Yun Sing Koh, Di Wang, Jingfeng Zhang

To mitigate this issue, we further propose a Stable Membership-Privacy-preserving LoRA (SMP-LoRA) that adapts the LDM by minimizing the ratio of the adaptation loss to the MI gain.

Privacy Preserving

MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions

no code implementations17 Feb 2024 Shu Yang, Muhammad Asif Ali, Lu Yu, Lijie Hu, Di Wang

The increasing significance of large models and their multi-modal variants in societal information processing has ignited debates on social safety and ethics.

Diversity Ethics

MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning

no code implementations17 Feb 2024 Shu Yang, Muhammad Asif Ali, Cheng-Long Wang, Lijie Hu, Di Wang

Adapting large language models (LLMs) to new domains/tasks and enabling them to be efficient lifelong learners is a pivotal challenge.

Lifelong learning Mixture-of-Experts

NIV-SSD: Neighbor IoU-Voting Single-Stage Object Detector From Point Cloud

1 code implementation23 Jan 2024 Shuai Liu, Di Wang, Quan Wang, Kai Huang

NIV strategy can serve as a bridge between classification and regression branches by calculating two types of statistical data from the regression output to correct the classification confidence.

Classification Data Augmentation +1

Communication Efficient and Provable Federated Unlearning

1 code implementation19 Jan 2024 Youming Tao, Cheng-Long Wang, Miao Pan, Dongxiao Yu, Xiuzhen Cheng, Di Wang

We start by giving a rigorous definition of \textit{exact} federated unlearning, which guarantees that the unlearned model is statistically indistinguishable from the one trained without the deleted data.

Federated Learning

Antonym vs Synonym Distinction using InterlaCed Encoder NETworks (ICE-NET)

no code implementations18 Jan 2024 Muhammad Asif Ali, Yan Hu, Jianbin Qin, Di Wang

In this paper, we propose InterlaCed Encoder NETworks (i. e., ICE-NET) for antonym vs synonym distinction, that aim to capture and model the relation-specific properties of the antonyms and synonyms pairs in order to perform the classification task in a performance-enhanced manner.

Relation

Weighted Spectral Filters for Kernel Interpolation on Spheres: Estimates of Prediction Accuracy for Noisy Data

no code implementations16 Jan 2024 Xiaotong Liu, Jinxin Wang, Di Wang, Shao-Bo Lin

In this paper, we introduce a weighted spectral filter approach to reduce the condition number of the kernel matrix and then stabilize kernel interpolation.

Image Reconstruction

Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection

no code implementations CVPR 2024 Ke Li, Di Wang, Zhangyuan Hu, Wenxuan Zhu, Shaofeng Li, Quan Wang

In this paper we propose an efficient convolution module for SAR object detection called SFS-Conv which increases feature diversity within each convolutional layer through a shunt-perceive-select strategy.

feature selection Model Compression +3

A comprehensive framework for occluded human pose estimation

1 code implementation30 Dec 2023 Linhao Xu, Lin Zhao, Xinxin Sun, Di Wang, Guangyu Li, Kedong Yan

The challenges posed by occlusion can be attributed to the following factors: 1) Data: The collection and annotation of occluded human pose samples are relatively challenging.

Data Augmentation Pose Estimation

Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning

1 code implementation29 Dec 2023 Zhongzhi Chen, Xingwu Sun, Xianfeng Jiao, Fengzong Lian, Zhanhui Kang, Di Wang, Cheng-Zhong Xu

We introduce Truth Forest, a method that enhances truthfulness in LLMs by uncovering hidden truth representations using multi-dimensional orthogonal probes.

TruthfulQA

Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks

no code implementations27 Dec 2023 Chenyang Qiu, Guoshun Nan, Tianyu Xiong, Wendi Deng, Di Wang, Zhiyang Teng, Lijuan Sun, Qimei Cui, Xiaofeng Tao

This finding motivates us to present a novel method that aims to harden GCNs by automatically learning Latent Homophilic Structures over heterophilic graphs.

Contrastive Learning Node Classification

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

1 code implementation21 Dec 2023 Zhixiang Su, Di Wang, Chunyan Miao, Lizhen Cui

To address this challenge, we propose Anchoring Path Sentence Transformer (APST) by introducing Anchoring Paths (APs) to alleviate the reliance of CPs.

Inductive Relation Prediction Knowledge Graphs +6

SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation

1 code implementation17 Dec 2023 Xiaoqi An, Lin Zhao, Chen Gong, Nannan Wang, Di Wang, Jian Yang

In this paper, we address the following question: "Only sparse human keypoint locations are detected for human pose estimation, is it really necessary to describe the whole image in a dense, high-resolution manner?"

Pose Estimation

Improving Interpretation Faithfulness for Vision Transformers

no code implementations29 Nov 2023 Lijie Hu, Yixin Liu, Ninghao Liu, Mengdi Huai, Lichao Sun, Di Wang

However, ViTs suffer from issues with explanation faithfulness, as their focal points are fragile to adversarial attacks and can be easily changed with even slight perturbations on the input image.

Denoising

Fair Text-to-Image Diffusion via Fair Mapping

no code implementations29 Nov 2023 Jia Li, Lijie Hu, Jingfeng Zhang, Tianhang Zheng, Hua Zhang, Di Wang

In this paper, we address the limitations of existing text-to-image diffusion models in generating demographically fair results when given human-related descriptions.

Fairness Text-to-Image Generation

Preserving Node-level Privacy in Graph Neural Networks

no code implementations12 Nov 2023 Zihang Xiang, Tianhao Wang, Di Wang

In this study, we propose a solution that specifically addresses the issue of node-level privacy.

Distributed Uncertainty Quantification of Kernel Interpolation on Spheres

no code implementations25 Oct 2023 Shao-Bo Lin, Xingping Sun, Di Wang

For radial basis function (RBF) kernel interpolation of scattered data, Schaback in 1995 proved that the attainable approximation error and the condition number of the underlying interpolation matrix cannot be made small simultaneously.

Uncertainty Quantification

GARI: Graph Attention for Relative Isomorphism of Arabic Word Embeddings

1 code implementation19 Oct 2023 Muhammad Asif Ali, Maha Alshmrani, Jianbin Qin, Yan Hu, Di Wang

Bilingual Lexical Induction (BLI) is a core challenge in NLP, it relies on the relative isomorphism of individual embedding spaces.

Graph Attention Word Embeddings

GRI: Graph-based Relative Isomorphism of Word Embedding Spaces

1 code implementation18 Oct 2023 Muhammad Asif Ali, Yan Hu, Jianbin Qin, Di Wang

Automated construction of bilingual dictionaries using monolingual embedding spaces is a core challenge in machine translation.

Machine Translation

Worst-Case Analysis is Maximum-A-Posteriori Estimation

no code implementations15 Oct 2023 Hongjun Wu, Di Wang

The worst-case resource usage of a program can provide useful information for many software-engineering tasks, such as performance optimization and algorithmic-complexity-vulnerability discovery.

Probabilistic Programming

Differentially Private Non-convex Learning for Multi-layer Neural Networks

no code implementations12 Oct 2023 Hanpu Shen, Cheng-Long Wang, Zihang Xiang, Yiming Ying, Di Wang

This paper focuses on the problem of Differentially Private Stochastic Optimization for (multi-layer) fully connected neural networks with a single output node.

Stochastic Optimization

Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model

no code implementations11 Oct 2023 Liyang Zhu, Meng Ding, Vaneet Aggarwal, Jinhui Xu, Di Wang

To address these issues, we first consider the problem in the $\epsilon$ non-interactive LDP model and provide a lower bound of $\Omega(\frac{\sqrt{dk\log d}}{\sqrt{n}\epsilon})$ on the $\ell_2$-norm estimation error for sub-Gaussian data, where $n$ is the sample size and $d$ is the dimension of the space.

regression

Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach

1 code implementation9 Oct 2023 Shaopeng Fu, Di Wang

Adversarial training (AT) is a canonical method for enhancing the robustness of deep neural networks (DNNs).

Fake News Detectors are Biased against Texts Generated by Large Language Models

no code implementations15 Sep 2023 Jinyan Su, Terry Yue Zhuo, Jonibek Mansurov, Di Wang, Preslav Nakov

The spread of fake news has emerged as a critical challenge, undermining trust and posing threats to society.

Misinformation

Adaptive Distributed Kernel Ridge Regression: A Feasible Distributed Learning Scheme for Data Silos

no code implementations8 Sep 2023 Di Wang, Xiaotong Liu, Shao-Bo Lin, Ding-Xuan Zhou

Data silos, mainly caused by privacy and interoperability, significantly constrain collaborations among different organizations with similar data for the same purpose.

Decision Making regression

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

no code implementations18 Jun 2023 Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal

To achieve that, we propose a Natural Actor-Critic algorithm with 2-Layer critic parametrization (NAC2L).

Decision Making

Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks

no code implementations26 May 2023 Puyu Wang, Yunwen Lei, Di Wang, Yiming Ying, Ding-Xuan Zhou

This sheds light on sufficient or necessary conditions for under-parameterized and over-parameterized NNs trained by GD to attain the desired risk rate of $O(1/\sqrt{n})$.

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

1 code implementation23 May 2023 Jinyan Su, Terry Yue Zhuo, Di Wang, Preslav Nakov

One is called DetectLLM-LRR, which is fast and efficient, and the other is called DetectLLM-NPR, which is more accurate, but slower due to the need for perturbations.

Misinformation

An Interactively Reinforced Paradigm for Joint Infrared-Visible Image Fusion and Saliency Object Detection

1 code implementation17 May 2023 Di Wang, JinYuan Liu, Risheng Liu, Xin Fan

Their common characteristic of seeking complementary cues from different source images motivates us to explore the collaborative relationship between Fusion and Salient object detection tasks on infrared and visible images via an Interactively Reinforced multi-task paradigm for the first time, termed IRFS.

Infrared And Visible Image Fusion object-detection +2

SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model

2 code implementations NeurIPS 2023 Di Wang, Jing Zhang, Bo Du, Minqiang Xu, Lin Liu, DaCheng Tao, Liangpei Zhang

In this study, we leverage SAM and existing RS object detection datasets to develop an efficient pipeline for generating a large-scale RS segmentation dataset, dubbed SAMRS.

Instance Segmentation Object +4

Empowering Agrifood System with Artificial Intelligence: A Survey of the Progress, Challenges and Opportunities

1 code implementation3 May 2023 Tao Chen, Liang Lv, Di Wang, Jing Zhang, Yue Yang, Zeyang Zhao, Chen Wang, Xiaowei Guo, Hao Chen, Qingye Wang, Yufei Xu, Qiming Zhang, Bo Du, Liangpei Zhang, DaCheng Tao

With the world population rapidly increasing, transforming our agrifood systems to be more productive, efficient, safe, and sustainable is crucial to mitigate potential food shortages.

Survey

HKNAS: Classification of Hyperspectral Imagery Based on Hyper Kernel Neural Architecture Search

1 code implementation23 Apr 2023 Di Wang, Bo Du, Liangpei Zhang, DaCheng Tao

Recent neural architecture search (NAS) based approaches have made great progress in hyperspectral image (HSI) classification tasks.

Neural Architecture Search

DCN-T: Dual Context Network with Transformer for Hyperspectral Image Classification

2 code implementations19 Apr 2023 Di Wang, Jing Zhang, Bo Du, Liangpei Zhang, DaCheng Tao

Hyperspectral image (HSI) classification is challenging due to spatial variability caused by complex imaging conditions.

Hyperspectral Image Classification Image Generation

Practical Differentially Private and Byzantine-resilient Federated Learning

1 code implementation15 Apr 2023 Zihang Xiang, Tianhao Wang, WanYu Lin, Di Wang

In contrast, we leverage the random noise to construct an aggregation that effectively rejects many existing Byzantine attacks.

Federated Learning Privacy Preserving

Inductive Graph Unlearning

1 code implementation6 Apr 2023 Cheng-Long Wang, Mengdi Huai, Di Wang

To extend machine unlearning to graph data, \textit{GraphEraser} has been proposed.

Fairness Graph Learning +2

Differentially Private Stochastic Convex Optimization in (Non)-Euclidean Space Revisited

no code implementations31 Mar 2023 Jinyan Su, Changhong Zhao, Di Wang

In this paper, we revisit the problem of Differentially Private Stochastic Convex Optimization (DP-SCO) in Euclidean and general $\ell_p^d$ spaces.

DIME-Net: Neural Network-Based Dynamic Intrinsic Parameter Rectification for Cameras with Optical Image Stabilization System

no code implementations20 Mar 2023 Shu-Hao Yeh, Shuangyu Xie, Di Wang, Wei Yan, Dezhen Song

Here we propose a novel neural network-based approach that estimates $\mathrm{K}$ matrix in real-time so that pose estimation or scene reconstruction can be run at camera native resolution for the highest accuracy on mobile devices.

3D Reconstruction Camera Pose Estimation +1

Sketching with Spherical Designs for Noisy Data Fitting on Spheres

no code implementations8 Mar 2023 Shao-Bo Lin, Di Wang, Ding-Xuan Zhou

These interesting findings show that the proposed sketching strategy is capable of fitting massive and noisy data on spheres.

Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes

no code implementations21 Feb 2023 Di Wang, Yao Wang, Shaojie Tang, Shao-Bo Lin

The novelties of our research are as follows: 1) From a methodological perspective, we present a novel and scalable approach for generating DTRs by combining distributed learning with Q-learning.

Learning Theory Medical Diagnosis +2

Personalized and privacy-preserving federated heterogeneous medical image analysis with PPPML-HMI

1 code implementation20 Feb 2023 Juexiao Zhou, Longxi Zhou, Di Wang, Xiaopeng Xu, Haoyang Li, Yuetan Chu, Wenkai Han, Xin Gao

However, there are few open-source frameworks for federated heterogeneous medical image analysis with personalization and privacy protection simultaneously without the demand to modify the existing model structures or to share any private data.

Medical Image Analysis Privacy Preserving

Robust Budget Pacing with a Single Sample

no code implementations3 Feb 2023 Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang

Given the inherent non-stationarity in an advertiser's value and also competing advertisers' values over time, a commonly used approach is to learn a target expenditure plan that specifies a target spend as a function of time, and then run a controller that tracks this plan.

Quantum Heavy-tailed Bandits

no code implementations23 Jan 2023 Yulian Wu, Chaowen Guan, Vaneet Aggarwal, Di Wang

In this paper, we study multi-armed bandits (MAB) and stochastic linear bandits (SLB) with heavy-tailed rewards and quantum reward oracle.

Multi-Armed Bandits

Differentially Private Natural Language Models: Recent Advances and Future Directions

no code implementations22 Jan 2023 Lijie Hu, Ivan Habernal, Lei Shen, Di Wang

In this paper, we provide the first systematic review of recent advances in DP deep learning models in NLP.

Deep Learning

USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval

1 code implementation17 Jan 2023 Yan Zhang, Zhong Ji, Di Wang, Yanwei Pang, Xuelong Li

(2) It limits the scale of negative sample pairs by employing the mini-batch based end-to-end training mechanism.

Contrastive Learning Image-text Retrieval +3

Loss-Controlling Calibration for Predictive Models

no code implementations11 Jan 2023 Di Wang, Junzhi Shi, PingPing Wang, Shuo Zhuang, Hongyue Li

By comparison, the predictors built by the proposed loss-controlling approach are not limited to set predictors, and the loss function can be any measurable function without the monotone assumption.

Prediction Weather Forecasting

Conformal Loss-Controlling Prediction

no code implementations6 Jan 2023 Di Wang, Ping Wang, Zhong Ji, Xiaojun Yang, Hongyue Li

Conformal prediction is a learning framework controlling prediction coverage of prediction sets, which can be built on any learning algorithm for point prediction.

Conformal Prediction Prediction +1

Multi-Aspect Explainable Inductive Relation Prediction by Sentence Transformer

1 code implementation4 Jan 2023 Zhixiang Su, Di Wang, Chunyan Miao, Lizhen Cui

Recent studies on knowledge graphs (KGs) show that path-based methods empowered by pre-trained language models perform well in the provision of inductive and explainable relation predictions.

Inductive Relation Prediction Knowledge Graphs +4

Broad Learning System with Takagi-Sugeno Fuzzy Subsystem for Tobacco Origin Identification based on Near Infrared Spectroscopy

no code implementations31 Dec 2022 Di Wang, Simon X. Yang

In this paper, a novel broad learning system with Takagi-Sugeno (TS) fuzzy subsystem is proposed for rapid identification of tobacco origin.

Incremental Learning

Quantizing Heavy-tailed Data in Statistical Estimation: (Near) Minimax Rates, Covariate Quantization, and Uniform Recovery

no code implementations30 Dec 2022 Junren Chen, Michael K. Ng, Di Wang

Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the proposed scheme.

compressed sensing Matrix Completion +1

Intelligent Feature Extraction, Data Fusion and Detection of Concrete Bridge Cracks: Current Development and Challenges

no code implementations24 Dec 2022 Di Wang, Simon X. Yang

As a common appearance defect of concrete bridges, cracks are important indices for bridge structure health assessment.

SEAT: Stable and Explainable Attention

no code implementations23 Nov 2022 Lijie Hu, Yixin Liu, Ninghao Liu, Mengdi Huai, Lichao Sun, Di Wang

Results show that SEAT is more stable against different perturbations and randomness while also keeps the explainability of attention, which indicates it is a more faithful explanation.

Semantic-aware Texture-Structure Feature Collaboration for Underwater Image Enhancement

1 code implementation19 Nov 2022 Di Wang, Long Ma, Risheng Liu, Xin Fan

To address the above limitations, we develop an efficient and compact enhancement network in collaboration with a high-level semantic-aware pretrained model, aiming to exploit its hierarchical feature representation as an auxiliary for the low-level underwater image enhancement.

Image Enhancement object-detection +2

1st ICLR International Workshop on Privacy, Accountability, Interpretability, Robustness, Reasoning on Structured Data (PAIR^2Struct)

no code implementations7 Oct 2022 Hao Wang, WanYu Lin, Hao He, Di Wang, Chengzhi Mao, Muhan Zhang

Recent years have seen advances on principles and guidance relating to accountable and ethical use of artificial intelligence (AI) spring up around the globe.

On Stability and Generalization of Bilevel Optimization Problem

no code implementations3 Oct 2022 Meng Ding, Mingxi Lei, Yunwen Lei, Di Wang, Jinhui Xu

In this paper, we conduct a thorough analysis on the generalization of first-order (gradient-based) methods for the bilevel optimization problem.

Bilevel Optimization Meta-Learning

On PAC Learning Halfspaces in Non-interactive Local Privacy Model with Public Unlabeled Data

no code implementations17 Sep 2022 Jinyan Su, Jinhui Xu, Di Wang

In this paper, we study the problem of PAC learning halfspaces in the non-interactive local differential privacy model (NLDP).

PAC learning Self-Supervised Learning

Truthful Generalized Linear Models

no code implementations16 Sep 2022 Yuan Qiu, Jinyan Liu, Di Wang

In the first part of the paper, we consider the case where the covariates are sub-Gaussian and the responses are heavy-tailed where they only have the finite fourth moments.

regression

Online Bidding Algorithms for Return-on-Spend Constrained Advertisers

no code implementations29 Aug 2022 Zhe Feng, Swati Padmanabhan, Di Wang

We contribute a simple online algorithm that achieves near-optimal regret in expectation while always respecting the specified RoS constraint when the input sequence of queries are i. i. d.

KL-divergence Based Deep Learning for Discrete Time Model

no code implementations10 Aug 2022 Li Liu, Xiangeng Fang, Di Wang, Weijing Tang, Kevin He

Neural Network (Deep Learning) is a modern model in Artificial Intelligence and it has been exploited in Survival Analysis.

Deep Learning Survival Analysis +1

Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model

2 code implementations8 Aug 2022 Di Wang, Qiming Zhang, Yufei Xu, Jing Zhang, Bo Du, DaCheng Tao, Liangpei Zhang

Large-scale vision foundation models have made significant progress in visual tasks on natural images, with vision transformers being the primary choice due to their good scalability and representation ability.

Aerial Scene Classification Few-Shot Learning +2

Deep neural network heatmaps capture Alzheimer's disease patterns reported in a large meta-analysis of neuroimaging studies

no code implementations22 Jul 2022 Di Wang, Nicolas Honnorat, Peter T. Fox, Kerstin Ritter, Simon B. Eickhoff, Sudha Seshadri, Mohamad Habes

Deep neural networks currently provide the most advanced and accurate machine learning models to distinguish between structural MRI scans of subjects with Alzheimer's disease and healthy controls.

Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration

1 code implementation24 May 2022 Di Wang, JinYuan Liu, Xin Fan, Risheng Liu

Moreover, to better fuse the registered infrared images and visible images, we present a feature Interaction Fusion Module (IFM) to adaptively select more meaningful features for fusion in the Dual-path Interaction Fusion Network (DIFN).

Image Generation Infrared And Visible Image Fusion +1

An Empirical Study of Remote Sensing Pretraining

2 code implementations6 Apr 2022 Di Wang, Jing Zhang, Bo Du, Gui-Song Xia, DaCheng Tao

To this end, we train different networks from scratch with the help of the largest RS scene recognition dataset up to now -- MillionAID, to obtain a series of RS pretrained backbones, including both convolutional neural networks (CNN) and vision transformers such as Swin and ViTAE, which have shown promising performance on computer vision tasks.

Aerial Scene Classification Building change detection for remote sensing images +5

High Dimensional Statistical Estimation under Uniformly Dithered One-bit Quantization

no code implementations26 Feb 2022 Junren Chen, Cheng-Long Wang, Michael K. Ng, Di Wang

In heavy-tailed regime, while the rates of our estimators become essentially slower, these results are either the first ones in an 1-bit quantized and heavy-tailed setting, or already improve on existing comparable results from some respect.

compressed sensing Low-Rank Matrix Completion +2

Differentially Private $\ell_1$-norm Linear Regression with Heavy-tailed Data

no code implementations10 Jan 2022 Di Wang, Jinhui Xu

Firstly, we study the case where the $\ell_2$ norm of data has bounded second order moment.

regression

VDPC: Variational Density Peak Clustering Algorithm

no code implementations29 Dec 2021 Yizhang Wang, Di Wang, You Zhou, Xiaofeng Zhang, Chai Quek

Furthermore, we divide all data points into different levels according to their local density and propose a unified clustering framework by combining the advantages of both DPC and DBSCAN.

Clustering

Fed2: Feature-Aligned Federated Learning

no code implementations28 Nov 2021 Fuxun Yu, Weishan Zhang, Zhuwei Qin, Zirui Xu, Di Wang, ChenChen Liu, Zhi Tian, Xiang Chen

Federated learning learns from scattered data by fusing collaborative models from local nodes.

Federated Learning

A Survey of Large-Scale Deep Learning Serving System Optimization: Challenges and Opportunities

no code implementations28 Nov 2021 Fuxun Yu, Di Wang, Longfei Shangguan, Minjia Zhang, Xulong Tang, ChenChen Liu, Xiang Chen

With both scaling trends, new problems and challenges emerge in DL inference serving systems, which gradually trends towards Large-scale Deep learning Serving systems (LDS).

Deep Learning Survey

Carousel Memory: Rethinking the Design of Episodic Memory for Continual Learning

1 code implementation14 Oct 2021 Soobee Lee, Minindu Weerakoon, Jonghyun Choi, Minjia Zhang, Di Wang, Myeongjae Jeon

In particular, in mobile and IoT devices, real-time data can be stored not just in high-speed RAMs but in internal storage devices as well, which offer significantly larger capacity than the RAMs.

Continual Learning Management

Incorporating Surprisingly Popular Algorithm and Euclidean Distance-based Adaptive Topology into PSO

1 code implementation25 Aug 2021 Xuan Wu, Jizong Han, Di Wang, Pengyue Gao, Quanlong Cui, Liang Chen, Yanchun Liang, Han Huang, Heow Pueh Lee, Chunyan Miao, You Zhou, Chunguo Wu

While many Particle Swarm Optimization (PSO) algorithms only use fitness to assess the performance of particles, in this work, we adopt Surprisingly Popular Algorithm (SPA) as a complementary metric in addition to fitness.

Diversity Single Particle Analysis

PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction

1 code implementation ACL 2021 Shulin Liu, Tao Yang, Tianchi Yue, Feng Zhang, Di Wang

In this paper, we propose a Pre-trained masked Language model with Misspelled knowledgE (PLOME) for CSC, which jointly learns how to understand language and correct spelling errors.

Language Modeling Language Modelling +1

Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification

1 code implementation ACL 2021 Xuepeng Wang, Li Zhao, Bing Liu, Tao Chen, Feng Zhang, Di Wang

In this paper, we propose a novel concept-based label embedding method that can explicitly represent the concept and model the sharing mechanism among classes for the hierarchical text classification.

text-classification Text Classification

Faster Rates of Private Stochastic Convex Optimization

no code implementations31 Jul 2021 Jinyan Su, Lijie Hu, Di Wang

Specifically, we first show that under some mild assumptions on the loss functions, there is an algorithm whose output could achieve an upper bound of $\tilde{O}((\frac{1}{\sqrt{n}}+\frac{\sqrt{d\log \frac{1}{\delta}}}{n\epsilon})^\frac{\theta}{\theta-1})$ for $(\epsilon, \delta)$-DP when $\theta\geq 2$, here $n$ is the sample size and $d$ is the dimension of the space.

High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

no code implementations23 Jul 2021 Lijie Hu, Shuo Ni, Hanshen Xiao, Di Wang

To better understand the challenges arising from irregular data distribution, in this paper we provide the first study on the problem of DP-SCO with heavy-tailed data in the high dimensional space.

Sparse Learning Stochastic Optimization +1

Spectral-Spatial Global Graph Reasoning for Hyperspectral Image Classification

2 code implementations26 Jun 2021 Di Wang, Bo Du, Liangpei Zhang

To tackle these problems, in this paper, different from previous approaches, we perform the superpixel generation on intermediate features during network training to adaptively produce homogeneous regions, obtain graph structures, and further generate spatial descriptors, which are served as graph nodes.

Classification Hyperspectral Image Classification

UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction

1 code implementation Findings (ACL) 2021 Huanqin Wu, Wei Liu, Lei LI, Dan Nie, Tao Chen, Feng Zhang, Di Wang

Keyphrase Prediction (KP) task aims at predicting several keyphrases that can summarize the main idea of the given document.

Diversity Relation

Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits

no code implementations4 Jun 2021 Youming Tao, Yulian Wu, Peng Zhao, Di Wang

Finally, we establish the lower bound to show that the instance-dependent regret of our improved algorithm is optimal.

Multi-Armed Bandits

$\ell_2$-norm Flow Diffusion in Near-Linear Time

no code implementations30 May 2021 Li Chen, Richard Peng, Di Wang

Diffusion is a fundamental graph procedure and has been a basic building block in a wide range of theoretical and empirical applications such as graph partitioning and semi-supervised learning on graphs.

Clustering Graph Clustering +3

GSA-Forecaster: Forecasting Graph-Based Time-Dependent Data with Graph Sequence Attention

no code implementations13 Apr 2021 Yang Li, Di Wang, José M. F. Moura

This task is challenging as models need not only to capture spatial dependency and temporal dependency within the data, but also to leverage useful auxiliary information for accurate predictions.

3DMNDT:3D multi-view registration method based on the normal distributions transform

no code implementations20 Mar 2021 Jihua Zhu, Di Wang, Jiaxi Mu, Huimin Lu, Zhiqiang Tian, Zhongyu Li

Under the NDT framework, this paper proposes a novel multi-view registration method, named 3D multi-view registration based on the normal distributions transform (3DMNDT), which integrates the K-means clustering and Lie algebra solver to achieve multi-view registration.

Clustering

Minimum Cost Flows, MDPs, and $\ell_1$-Regression in Nearly Linear Time for Dense Instances

no code implementations14 Jan 2021 Jan van den Brand, Yin Tat Lee, Yang P. Liu, Thatchaphol Saranurak, Aaron Sidford, Zhao Song, Di Wang

In the special case of the minimum cost flow problem on $n$-vertex $m$-edge graphs with integer polynomially-bounded costs and capacities we obtain a randomized method which solves the problem in $\tilde{O}(m+n^{1. 5})$ time.

Data Structures and Algorithms Optimization and Control

CARE: Commonsense-Aware Emotional Response Generation with Latent Concepts

no code implementations15 Dec 2020 Peixiang Zhong, Di Wang, Pengfei Li, Chen Zhang, Hao Wang, Chunyan Miao

Experimental results on two large-scale datasets support our hypothesis and show that our model can produce more accurate and commonsense-aware emotional responses and achieve better human ratings than state-of-the-art models that only specialize in one aspect.

Response Generation

Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep Neural Networks

no code implementations22 Nov 2020 Fuxun Yu, Dimitrios Stamoulis, Di Wang, Dimitrios Lymberopoulos, Xiang Chen

This paper gives an overview of our ongoing work on the design space exploration of efficient deep neural networks (DNNs).

Empirical Risk Minimization in the Non-interactive Local Model of Differential Privacy

no code implementations11 Nov 2020 Di Wang, Marco Gaboardi, Adam Smith, Jinhui Xu

In our second attempt, we show that for any $1$-Lipschitz generalized linear convex loss function, there is an $(\epsilon, \delta)$-LDP algorithm whose sample complexity for achieving error $\alpha$ is only linear in the dimensionality $p$.

Deep Learning Analysis and Age Prediction from Shoeprints

1 code implementation7 Nov 2020 Muhammad Hassan, Yan Wang, Di Wang, Daixi Li, Yanchun Liang, You Zhou, Dong Xu

We collected 100, 000 shoeprints of subjects ranging from 7 to 80 years old and used the data to develop a deep learning end-to-end model ShoeNet to analyze age-related patterns and predict age.

Deep Learning Gender Classification +1

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

no code implementations22 Oct 2020 Di Wang, Jiahao Ding, Lijie Hu, Zejun Xie, Miao Pan, Jinhui Xu

To address this issue, we propose in this paper the first DP version of (Gradient) EM algorithm with statistical guarantees.

On Differentially Private Stochastic Convex Optimization with Heavy-tailed Data

no code implementations ICML 2020 Di Wang, Hanshen Xiao, Srini Devadas, Jinhui Xu

For this case, we propose a method based on the sample-and-aggregate framework, which has an excess population risk of $\tilde{O}(\frac{d^3}{n\epsilon^4})$ (after omitting other factors), where $n$ is the sample size and $d$ is the dimensionality of the data.

Estimating Stochastic Linear Combination of Non-linear Regressions Efficiently and Scalably

no code implementations19 Oct 2020 Di Wang, Xiangyu Guo, Chaowen Guan, Shi Li, Jinhui Xu

To the best of our knowledge, this is the first work that studies and provides theoretical guarantees for the stochastic linear combination of non-linear regressions model.

LEMMA

Robust High Dimensional Expectation Maximization Algorithm via Trimmed Hard Thresholding

no code implementations19 Oct 2020 Di Wang, Xiangyu Guo, Shi Li, Jinhui Xu

In this paper, we study the problem of estimating latent variable models with arbitrarily corrupted samples in high dimensional space ({\em i. e.,} $d\gg n$) where the underlying parameter is assumed to be sparse.

Vocal Bursts Intensity Prediction

Learning Robust Algorithms for Online Allocation Problems Using Adversarial Training

no code implementations16 Oct 2020 Goran Zuzic, Di Wang, Aranyak Mehta, D. Sivakumar

In this paper, we focus on the AdWords problem, which is a classical online budgeted matching problem of both theoretical and practical significance.

ECG Beats Fast Classification Base on Sparse Dictionaries

1 code implementation8 Sep 2020 Nanyu Li, Yujuan Si, Di Wang, Tong Liu, Jinrun Yu

In VQ method, a set of dictionaries corresponding to segments of ECG beats is trained, and VQ codes are used to represent each heartbeat.

Classification Dictionary Learning +3

Heterogeneous Federated Learning

no code implementations15 Aug 2020 Fuxun Yu, Weishan Zhang, Zhuwei Qin, Zirui Xu, Di Wang, ChenChen Liu, Zhi Tian, Xiang Chen

Specifically, we design a feature-oriented regulation method ({$\Psi$-Net}) to ensure explicit feature information allocation in different neural network structures.

Federated Learning

AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency

no code implementations14 Aug 2020 Fuxun Yu, ChenChen Liu, Di Wang, Yanzhi Wang, Xiang Chen

Based on the neural network attention mechanism, we propose a comprehensive dynamic optimization framework including (1) testing-phase channel and column feature map pruning, as well as (2) training-phase optimization by targeted dropout.

Raising Expectations: Automating Expected Cost Analysis with Types

no code implementations24 Jun 2020 Di Wang, David M Kahn, Jan Hoffmann

The effectiveness of the technique is evaluated by analyzing the sample complexity of discrete distributions and with a novel average-case estimation for deterministic programs that combines expected cost analysis with statistical methods.

Programming Languages

$p$-Norm Flow Diffusion for Local Graph Clustering

2 code implementations20 May 2020 Kimon Fountoulakis, Di Wang, Shenghao Yang

Local graph clustering and the closely related seed set expansion problem are primitives on graphs that are central to a wide range of analytic and learning tasks such as local clustering, community detection, nodes ranking and feature inference.

Clustering Community Detection +1

Towards Assessment of Randomized Smoothing Mechanisms for Certifying Adversarial Robustness

no code implementations15 May 2020 Tianhang Zheng, Di Wang, Baochun Li, Jinhui Xu

Based on our framework, we assess the Gaussian and Exponential mechanisms by comparing the magnitude of additive noise required by these mechanisms and the lower bounds (criteria).

Adversarial Robustness

Distributed Kernel Ridge Regression with Communications

no code implementations27 Mar 2020 Shao-Bo Lin, Di Wang, Ding-Xuan Zhou

This paper focuses on generalization performance analysis for distributed algorithms in the framework of learning theory.

Learning Theory regression

Robust Feature-Based Point Registration Using Directional Mixture Model

no code implementations25 Nov 2019 Saman Fahandezh-Saadi, Di Wang, Masayoshi Tomizuka

This paper presents a robust probabilistic point registration method for estimating the rigid transformation (i. e. rotation matrix and translation vector) between two pointcloud dataset.

Translation

Unsupervised Domain Adaptation for Object Detection via Cross-Domain Semi-Supervised Learning

1 code implementation17 Nov 2019 Fuxun Yu, Di Wang, Yinpeng Chen, Nikolaos Karianakis, Tong Shen, Pei Yu, Dimitrios Lymberopoulos, Sidi Lu, Weisong Shi, Xiang Chen

In this work, we show that such adversarial-based methods can only reduce the domain style gap, but cannot address the domain content distribution gap that is shown to be important for object detectors.

Object object-detection +2

Facility Location Problem in Differential Privacy Model Revisited

no code implementations NeurIPS 2019 Yunus Esencayi, Marco Gaboardi, Shi Li, Di Wang

On the negative side, we show that the approximation ratio of any $\epsilon$-DP algorithm is lower bounded by $\Omega(\frac{1}{\sqrt{\epsilon}})$, even for instances on HST metrics with uniform facility cost, under the super-set output setting.

model

Estimating Smooth GLM in Non-interactive Local Differential Privacy Model with Public Unlabeled Data

no code implementations1 Oct 2019 Di Wang, Lijie Hu, Huanyu Zhang, Marco Gaboardi, Jinhui Xu

In the second part of the paper, we extend our idea to the problem of estimating non-linear regressions and show similar results as in GLMs for both multivariate Gaussian and sub-Gaussian cases.

LEMMA

Faster width-dependent algorithm for mixed packing and covering LPs

no code implementations NeurIPS 2019 Digvijay Boob, Saurabh Sawlani, Di Wang

As a special case of our result, we report a $1+\eps$ approximation algorithm for the densest subgraph problem which runs in time $O(md/ \eps)$, where $m$ is the number of edges in the graph and $d$ is the maximum graph degree.

Combinatorial Optimization

A Unified framework for randomized smoothing based certified defenses

no code implementations25 Sep 2019 Tianhang Zheng, Di Wang, Baochun Li, Jinhui Xu

We answer the above two questions by first demonstrating that Gaussian mechanism and Exponential mechanism are the (near) optimal options to certify the $\ell_2$ and $\ell_\infty$-normed robustness.

YaoGAN: Learning Worst-case Competitive Algorithms from Self-generated Inputs

no code implementations25 Sep 2019 Goran Zuzic, Di Wang, Aranyak Mehta, D. Sivakumar

To answer this question, we draw insights from classic results in game theory, analysis of algorithms, and online learning to introduce a novel framework.

Combinatorial Optimization Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.