Search Results for author: Xiao Zhang

Found 202 papers, 81 papers with code

RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax

1 code implementation ECCV 2020 Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li

To address this problem, this paper introduces a novel Radial Basis Function (RBF) distances to replace the commonly used inner products in the softmax loss function, such that it can adaptively assign losses to regularize the intra-class and inter-class distances by reshaping the relative differences, and thus creating more representative prototypes of classes to improve optimization.

软件标识符的自然语言规范性研究(Research on the Natural Language Normalness of Software Identifiers)

no code implementations CCL 2021 Dongzhen Wen, Fan Zhang, Xiao Zhang, Liang Yang, Yuan Lin, Bo Xu, Hongfei Lin

“软件源代码的理解则是软件协同开发与维护的核心, 而源代码中占半数以上的标识符的理解则在软件理解中起到重要作用, 传统软件工程主要研究通过命名规范限制标识符的命名过程以构造更易理解和交流的标识符。本文则在梳理分析常见编程语言命名规范的基础上, 提出一种全新的标识符可理解性评价标准。具体而言, 本文首先总结梳理了常见主流编程语言中的命名规范并类比自然语言语素概念本文提出基于软件语素的标识符构成过程, 即标识符的构成可被视为软件语素的生成、排列和连接过程。在此基础上, 本文提出一种结合自然语料库的软件标识符规范性评价方法, 用来衡量软件标识符是否易于理解。最后, 本文通过源代码理解数据集和乇乩乴乨乵乢平台中开源项目对规范性指标进行了验证性实验, 结果表明本文提出的规范性分数能够很好衡量软件项目的可理解性。”

TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval

no code implementations19 Jan 2025 Shuai Lyu, Zijing Tian, Zhonghong Ou, Yifan Zhu, Xiao Zhang, Qiankun Ha, Haoran Luo, Meina Song

In order to resolve this problem, we introduce a Tripartite learning with Semantic Variation Consistency (TSVC) for robust image-text retrieval.

ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding

no code implementations14 Jan 2025 Zhongxiang Sun, QiPeng Wang, Weijie Yu, Xiaoxue Zang, Kai Zheng, Jun Xu, Xiao Zhang, Song Yang, Han Li

ReARTeR addresses three core challenges: (1) misalignment between PRM and PEM, tackled through off-policy preference learning; (2) bias in PRM training data, mitigated by balanced annotation methods and stronger annotations for challenging examples; and (3) early-step bias in PRM, resolved through a temporal-difference-based look-ahead search strategy.

RAG Retrieval

DivTrackee versus DynTracker: Promoting Diversity in Anti-Facial Recognition against Dynamic FR Strategy

no code implementations11 Jan 2025 Wenshu Fan, Minxing Zhang, Hongwei Li, Wenbo Jiang, Hanxiao Chen, Xiangyu Yue, Michael Backes, Xiao Zhang

Through comprehensive experiments on various facial image benchmarks and feature extractors, we demonstrate DynTracker's strength in breaking existing AFR methods and the superiority of DivTrackee in preventing user facial images from being identified by dynamic FR strategies.

Diversity Image Generation

HELPNet: Hierarchical Perturbations Consistency and Entropy-guided Ensemble for Scribble Supervised Medical Image Segmentation

1 code implementation25 Dec 2024 Xiao Zhang, Shaoxuan Wu, Peilin Zhang, Zhuo Jin, Xiaosong Xiong, Qirong Bu, Jingkun Chen, Jun Feng

Experimental results on three public datasets ACDC, MSCMRseg, and CHAOS show that HELPNet significantly outperforms state-of-the-art methods for scribble-based weakly supervised segmentation and achieves performance comparable to fully supervised methods.

Image Segmentation Medical Image Segmentation +4

Trigger$^3$: Refining Query Correction via Adaptive Model Selector

no code implementations17 Dec 2024 Kepu Zhang, Zhongxiang Sun, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Yang song, Jun Xu

To tackle these, we propose Trigger$^3$, a large-small model collaboration framework that integrates the traditional correction model and LLM for query correction, capable of adaptively choosing the appropriate correction method based on the query and the correction results from the traditional correction model and LLM.

Retrieval-Augmented Semantic Parsing: Using Large Language Models to Improve Generalization

no code implementations13 Dec 2024 Xiao Zhang, Qianru Meng, Johan Bos

Open-domain semantic parsing remains a challenging task, as models often rely on heuristics and struggle to handle unseen concepts.

Decoder Retrieval +1

Nested Diffusion Models Using Hierarchical Latent Priors

no code implementations8 Dec 2024 Xiao Zhang, Ruoxi Jiang, Rebecca Willett, Michael Maire

Our approach employs a series of diffusion models to progressively generate latent variables at different semantic levels.

Dimensionality Reduction Image Generation

Can Targeted Clean-Label Poisoning Attacks Generalize?

1 code implementation5 Dec 2024 Zhizhen Chen, Subrat Kishore Dutta, Zhengyu Zhao, Chenhao Lin, Chao Shen, Xiao Zhang

In a common clean-label setting, they are achieved by slightly perturbing a subset of training samples given access to those specific targets.

LampMark: Proactive Deepfake Detection via Training-Free Landmark Perceptual Watermarks

no code implementations26 Nov 2024 Tianyi Wang, Mengxiao Huang, Harry Cheng, Xiao Zhang, Zhiqi Shen

Relying on promising watermark recovery accuracies, Deepfake detection is accomplished by assessing the consistency between the content-matched landmark perceptual watermark and the robustly recovered watermark of the suspect image.

DeepFake Detection Face Swapping

PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement

no code implementations26 Nov 2024 Tewodros Ayalew, Xiao Zhang, Kevin Yuanbo Wu, Tianchong Jiang, Michael Maire, Matthew R. Walter

Utilizing this progress prediction as a dense reward together with an adversarial push-back, we show that PROGRESSOR enables robots to learn complex behaviors without any external supervision.

Offline RL Reinforcement Learning (RL)

GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs

1 code implementation21 Nov 2024 Advik Raj Basani, Xiao Zhang

Large Language Models (LLMs) have shown impressive proficiency across a range of natural language processing tasks yet remain vulnerable to adversarial prompts, known as jailbreak attacks, carefully designed to elicit harmful responses from LLMs.

Bayesian Optimization Red Teaming

Length-Induced Embedding Collapse in Transformer-based Models

no code implementations31 Oct 2024 Yuqi Zhou, Sunhao Dai, Zhanshuo Cao, Xiao Zhang, Jun Xu

In this paper, we find that the performance degradation is due to a phenomenon called Length Collapse, where longer text embeddings collapse into a narrow space.

DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination

1 code implementation31 Oct 2024 Jia Fu, Xiao Zhang, Sepideh Pashami, Fatemeh Rahimian, Anders Holst

In the ever-evolving adversarial machine learning landscape, developing effective defenses against patch attacks has become a critical challenge, necessitating reliable solutions to safeguard real-world AI systems.

Adversarial Robustness Binarization +3

Predicting time-varying flux and balance in metabolic systems using structured neural-ODE processes

1 code implementation18 Oct 2024 Santanu Rathod, Pietro Lio, Xiao Zhang

We develop a novel data-driven framework as an alternative to dynamic flux balance analysis, bypassing the demand for deep domain knowledge and manual efforts to formulate the optimization problem.

Decoder

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

1 code implementation17 Oct 2024 Yuzhe Yang, Yifei Zhang, Yan Hu, Yilin Guo, Ruoli Gan, Yueru He, Mingcong Lei, Xiao Zhang, Haining Wang, Qianqian Xie, Jimin Huang, Honghai Yu, Benyou Wang

Our results show a significant alignment between benchmark scores and human preferences, with a Pearson correlation coefficient of 0. 78, confirming the effectiveness of the UCFE dataset and our evaluation approach.

Benchmarking

ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability

no code implementations15 Oct 2024 Zhongxiang Sun, Xiaoxue Zang, Kai Zheng, Jun Xu, Xiao Zhang, Weijie Yu, Yang song, Han Li

We discover hallucinations occur when the Knowledge FFNs in LLMs overemphasize parametric knowledge in the residual stream, while Copying Heads fail to effectively retain or integrate external knowledge from retrieved content.

Hallucination RAG +1

LargePiG: Your Large Language Model is Secretly a Pointer Generator

no code implementations15 Oct 2024 Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Kai Zheng, Yang song, Xiao Zhang, Jun Xu

To validate the effectiveness of LargePiG, we constructed two datasets for assessing the hallucination problems in query generation, covering both document and video scenarios.

Hallucination Language Modeling +3

Fast Second-Order Online Kernel Learning through Incremental Matrix Sketching and Decomposition

no code implementations15 Oct 2024 Dongxie Wen, Xiao Zhang, Zhewei Wei

Moreover, the absence of incremental updates to manage approximate kernel space causes these algorithms to perform poorly in adversarial environments and real-world streaming recommendation datasets.

Recommendation Systems

Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation

no code implementations14 Oct 2024 Chenglei Shen, Jiahao Zhao, Xiao Zhang, Weijie Yu, Ming He, Jianping Fan

To address this issue, we propose a novel controllable learning approach via Parameter Diffusion for controllable multi-task Recommendation (PaDiRec), which allows the customization and adaptation of recommendation model parameters to new task requirements without retraining.

Recommendation Systems Test-time Adaptation

Matrix Sketching in Bandits: Current Pitfalls and New Framework

no code implementations14 Oct 2024 Dongxie Wen, Hanyan Yin, Xiao Zhang, Zhewei Wei

To prevent this issue, we propose Dyadic Block Sketching, an innovative streaming matrix sketching approach that adaptively manages sketch size to constrain global spectral loss.

Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients

no code implementations11 Oct 2024 Yan Li, Mingyi Li, Xiao Zhang, Guangwei Xu, Feng Chen, Yuan Yuan, Yifei Zou, Mengying Zhao, Jianbo Lu, Dongxiao Yu

In this work, we study to release the potential of massive heterogeneous weak computing power to collaboratively train large-scale models on dispersed datasets.

Transfer Learning Unity

Understanding Adversarially Robust Generalization via Weight-Curvature Index

no code implementations10 Oct 2024 Yuelin Xu, Xiao Zhang

Despite extensive research on adversarial examples, the underlying mechanisms of adversarially robust generalization, a critical yet challenging task for deep learning, remain largely unknown.

Adversarial Robustness Generalization Bounds

CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs

no code implementations2 Oct 2024 Kangsheng Wang, Xiao Zhang, Hao liu, Songde Han, Huimin Ma, Tianyu Hu

Large language models (LLMs) have demonstrated limitations in handling combinatorial optimization problems involving long-range reasoning, partially due to causal hallucinations and huge search space.

Combinatorial Optimization

Enhancing elusive clues in knowledge learning by contrasting attention of language models

1 code implementation26 Sep 2024 Jian Gao, Xiao Zhang, Ji Wu, Miao Li

Causal language models acquire vast amount of knowledge from general text corpus during pretraining, but the efficiency of knowledge learning is known to be unsatisfactory, especially when learning from knowledge-dense and small-sized corpora.

Data Augmentation Language Modeling +2

Reliable and diverse evaluation of LLM medical knowledge mastery

no code implementations22 Sep 2024 Yuxuan Zhou, Xien Liu, Chen Ning, Xiao Zhang, Ji Wu

Finally, these produced predicate variants are converted into textual language, resulting in a series of reliable and diverse test samples to evaluate whether LLMs fully master the given medical factual knowledge point.

Diversity MedQA

Co-occurrence is not Factual Association in Language Models

1 code implementation21 Sep 2024 Xiao Zhang, Miao Li, Ji Wu

Pretrained language models can encode a large amount of knowledge and utilize it for various reasoning tasks, yet they can still struggle to learn novel factual knowledge effectively from finetuning on limited textual demonstrations.

Multi-hop Question Answering Question Answering

CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency

no code implementations20 Sep 2024 Kangsheng Wang, Xiao Zhang, Zizheng Guo, Tianyu Hu, Huimin Ma

Chain-based reasoning methods like chain of thought (CoT) play a rising role in solving reasoning tasks for large language models (LLMs).

Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration

no code implementations10 Sep 2024 Weicong Qin, Yi Xu, Weijie Yu, Chenglei Shen, Xiao Zhang, Ming He, Jianping Fan, Jun Xu

Specifically, MoRE introduces three reflectors for generating LLM-based reflections on explicit preferences, implicit preferences, and collaborative signals.

Collaborative Filtering

Stream-Based Ground Segmentation for Real-Time LiDAR Point Cloud Processing on FPGA

no code implementations19 Aug 2024 Xiao Zhang, Zhanhong Huang, Garcia Gonzalez Antony, Witek Jachimczyk, Xinming Huang

This paper presents a novel and fast approach for ground plane segmentation in a LiDAR point cloud, specifically optimized for processing speed and hardware efficiency on FPGA hardware platforms.

Segmentation

Accelerating Point Cloud Ground Segmentation: From Mechanical to Solid-State Lidars

no code implementations19 Aug 2024 Xiao Zhang, Zhanhong Huang, Garcia Gonzalez Antony, Xinming Huang

In this study, we propose a novel parallel processing method for point cloud ground segmentation, aimed at the technology evolution from mechanical to solid-state Lidar (SSL).

Autonomous Vehicles Segmentation

Modeling Domain and Feedback Transitions for Cross-Domain Sequential Recommendation

no code implementations15 Aug 2024 Changshuo Zhang, Teng Shi, Xiao Zhang, Qi Liu, Ruobing Xie, Jun Xu, Ji-Rong Wen

In this paper, we propose $\text{Transition}^2$, a novel method to model transitions across both domains and types of user feedback.

Sequential Recommendation

InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models

no code implementations15 Jul 2024 Nirat Saini, Navaneeth Bodla, Ashish Shrivastava, Avinash Ravichandran, Xiao Zhang, Abhinav Shrivastava, Bharat Singh

This process begins with inserting the object into a single frame using a ControlNet-based inpainting diffusion model, and then generating subsequent frames conditioned on features from an inpainted frame as an anchor to minimize the domain gap between the background and the object.

Object Video Editing

Preference-Guided Reinforcement Learning for Efficient Exploration

1 code implementation9 Jul 2024 GuoJian Wang, Faguo Wu, Xiao Zhang, Tianyuan Chen, Xuyang Chen, Lin Zhao

To tackle this issue, we introduce LOPE: Learning Online with trajectory Preference guidancE, an end-to-end preference-guided RL framework that enhances exploration efficiency in hard-exploration tasks.

Efficient Exploration reinforcement-learning +2

A Survey of Controllable Learning: Methods and Applications in Information Retrieval

no code implementations4 Jul 2024 Chenglei Shen, Xiao Zhang, Teng Shi, Changshuo Zhang, Guofu Xie, Jun Xu

Controllability has become a crucial aspect of trustworthy machine learning, enabling learners to meet predefined targets and adapt dynamically at test time without requiring retraining as the targets shift.

Information Retrieval Retrieval

Gaze-directed Vision GNN for Mitigating Shortcut Learning in Medical Image

1 code implementation20 Jun 2024 Shaoxuan Wu, Xiao Zhang, Bin Wang, Zhuo Jin, Hansheng Li, Jun Feng

In this paper, we propose a novel gaze-directed Vision GNN (called GD-ViG) to leverage the visual patterns of radiologists from gaze as expert knowledge, directing the network toward disease-relevant regions, and thereby mitigating shortcut learning.

Medical Image Analysis

Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification

no code implementations9 Jun 2024 Yuxin Hong, Xiao Zhang, Xin Zhang, Joey Tianyi Zhou

In the medical field, managing high-dimensional massive medical imaging data and performing reliable medical analysis from it is a critical challenge, especially in resource-limited environments such as remote medical facilities and mobile devices.

Image Classification Medical Image Analysis +1

QAGCF: Graph Collaborative Filtering for Q&A Recommendation

no code implementations7 Jun 2024 Changshuo Zhang, Teng Shi, Xiao Zhang, Yanping Zheng, Ruobing Xie, Qi Liu, Jun Xu, Ji-Rong Wen

Traditional recommendation methods treat the question-answer pair as a whole or only consider the answer as a single item, which overlooks the two challenges and cannot effectively model user interests.

Collaborative Filtering Contrastive Learning +1

Conditional Language Learning with Context

1 code implementation4 Jun 2024 Xiao Zhang, Miao Li, Ji Wu

In this fashion, conditional finetuning achieves selective learning from a corpus, learning knowledge useful for downstream tasks while avoiding learning useless corpus statistics like topic biases.

Causal Language Modeling Language Modeling +1

Latent Intrinsics Emerge from Training to Relight

no code implementations31 May 2024 Xiao Zhang, William Gao, Seemandhar Jain, Michael Maire, David. A. Forsyth, Anand Bhattad

Image relighting is the task of showing what a scene from a source image would look like if illuminated differently.

Image Relighting

ReCODE: Modeling Repeat Consumption with Neural ODE

1 code implementation26 May 2024 Sunhao Dai, Changle Qu, Sirui Chen, Xiao Zhang, Jun Xu

In real-world recommender systems, such as in the music domain, repeat consumption is a common phenomenon where users frequently listen to a small set of preferred songs or artists repeatedly.

Recommendation Systems

Leveraging Unknown Objects to Construct Labeled-Unlabeled Meta-Relationships for Zero-Shot Object Navigation

no code implementations24 May 2024 Yanwei Zheng, Changrui Li, Chuanlin Lan, Yaling Li, Xiao Zhang, Yifei Zou, Dongxiao Yu, Zhipeng Cai

Furthermore, we propose the label-wise meta-correlation module (LWMCM) to harness relationships among objects with and without labels, and obtain enhanced objects information.

Object

Optimal Matrix Sketching over Sliding Windows

no code implementations13 May 2024 Hanyan Yin, Dongxie Wen, Jiajun Li, Zhewei Wei, Xiao Zhang, Zengfeng Huang, Feifei Li

Matrix sketching, aimed at approximating a matrix $\boldsymbol{A} \in \mathbb{R}^{N\times d}$ consisting of vector streams of length $N$ with a smaller sketching matrix $\boldsymbol{B} \in \mathbb{R}^{\ell\times d}, \ell \ll N$, has garnered increasing attention in fields such as large-scale data analytics and machine learning.

Evaluation of Machine Translation Based on Semantic Dependencies and Keywords

no code implementations20 Apr 2024 Kewei Yuan, Qiurong Zhao, Yang Xu, Xiao Zhang, Huansheng Ning

To achieve a comprehensive and in-depth evaluation of the semantic correctness of sentences, the experimental results show that the accuracy of the evaluation algorithm has been improved compared with similar methods, and it can more accurately measure the semantic correctness of machine translation.

Information Retrieval Machine Translation +2

Neural Semantic Parsing with Extremely Rich Symbolic Meaning Representations

1 code implementation19 Apr 2024 Xiao Zhang, Gosse Bouma, Johan Bos

We introduce a neural "taxonomical" semantic parser to utilize this new representation system of predicates, and compare it with a standard neural semantic parser trained on the traditional meaning representation format, employing a novel challenge set and evaluation metric for evaluation.

Semantic Parsing

Residual Connections Harm Generative Representation Learning

no code implementations16 Apr 2024 Xiao Zhang, Ruoxi Jiang, William Gao, Rebecca Willett, Michael Maire

We show that introducing a weighting factor to reduce the influence of identity shortcuts in residual networks significantly enhances semantic feature learning in generative representation learning frameworks, such as masked autoencoders (MAEs) and diffusion models.

Representation Learning

UniSAR: Modeling User Transition Behaviors between Search and Recommendation

1 code implementation15 Apr 2024 Teng Shi, Zihua Si, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Dewei Leng, Yanan Niu, Yang song

In this paper, we propose a framework named UniSAR that effectively models the different types of fine-grained behavior transitions for providing users a Unified Search And Recommendation service.

Contrastive Learning

PMB5: Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks

no code implementations12 Apr 2024 Xiao Zhang, Chunliu Wang, Rik van Noord, Johan Bos

The Parallel Meaning Bank (PMB) serves as a corpus for semantic processing with a focus on semantic parsing and text generation.

Semantic Parsing Text Generation

To Search or to Recommend: Predicting Open-App Motivation with Neural Hawkes Process

1 code implementation4 Apr 2024 Zhongxiang Sun, Zihua Si, Xiao Zhang, Xiaoxue Zang, Yang song, Hongteng Xu, Jun Xu

The model, referred to as Neural Hawkes Process-based Open-App Motivation prediction model (NHP-OAM), employs a hierarchical transformer and a novel intensity function to encode multiple factors, and open-app motivation prediction layer to integrate time and user-specific information for predicting users' open-app motivations.

Large Language Models Enhanced Collaborative Filtering

no code implementations26 Mar 2024 Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Kai Zheng, Yang song, Xiao Zhang, Jun Xu

In this paper, drawing inspiration from the in-context learning and chain of thought reasoning in LLMs, we propose the Large Language Models enhanced Collaborative Filtering (LLM-CF) framework, which distils the world knowledge and reasoning capabilities of LLMs into collaborative filtering.

Collaborative Filtering In-Context Learning +2

IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History

no code implementations24 Mar 2024 Yi Xu, Weiran Shen, Xiao Zhang, Jun Xu

This poses a new challenge for existing imitation learning approaches that can only utilize data from experienced experts.

Imitation Learning Out-of-Distribution Generalization +1

An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

1 code implementation7 Mar 2024 Aosong Feng, Weikang Qiu, Jinbin Bai, Xiao Zhang, Zhen Dong, Kaicheng Zhou, Rex Ying, Leandros Tassiulas

Building on the success of text-to-image diffusion models (DPMs), image editing is an important application to enable human interaction with AI-generated content.

Descriptive

A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications

no code implementations3 Mar 2024 Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong

However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL.

Federated Learning Survey +1

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

1 code implementation2 Mar 2024 Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu

In this paper, we propose AutoDefense, a multi-agent defense framework that filters harmful responses from LLMs.

Instruction Following LLM real-life tasks +1

On the Decision-Making Abilities in Role-Playing using Large Language Models

no code implementations29 Feb 2024 Chenglei Shen, Guofu Xie, Xiao Zhang, Jun Xu

Large language models (LLMs) are now increasingly utilized for role-playing tasks, especially in impersonating domain-specific experts, primarily through role-playing prompts.

Decision Making

RFI-DRUnet: Restoring dynamic spectra corrupted by radio frequency interference -- Application to pulsar observations

no code implementations21 Feb 2024 Xiao Zhang, Ismaël Cognard, Nicolas Dobigeon

Conversely, this work proposes to tackle RFI mitigation as a joint detection and restoration that allows parts of the dynamic spectrum affected by RFI to be not only identified but also recovered.

Astronomy Image Denoising

FinBen: A Holistic Financial Benchmark for Large Language Models

2 code implementations20 Feb 2024 Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang

Our evaluation of 15 representative LLMs, including GPT-4, ChatGPT, and the latest Gemini, reveals several key findings: While LLMs excel in IE and textual analysis, they struggle with advanced reasoning and complex tasks like text generation and forecasting.

Question Answering RAG +2

FairSync: Ensuring Amortized Group Exposure in Distributed Recommendation Retrieval

1 code implementation16 Feb 2024 Chen Xu, Jun Xu, Yiming Ding, Xiao Zhang, Qi Qi

Specifically, FairSync resolves the issue by moving it to the dual space, where a central node aggregates historical fairness data into a vector and distributes it to all servers.

Distributed Optimization Fairness +2

Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English

1 code implementation12 Feb 2024 Xiao Zhang, Ruoyu Xiang, Chenhan Yuan, Duanyu Feng, Weiguang Han, Alejandro Lopez-Lira, Xiao-Yang Liu, Sophia Ananiadou, Min Peng, Jimin Huang, Qianqian Xie

We evaluate our model and existing LLMs using FLARE-ES, the first comprehensive bilingual evaluation benchmark with 21 datasets covering 9 tasks.

Learning Diverse Policies with Soft Self-Generated Guidance

no code implementations7 Feb 2024 GuoJian Wang, Faguo Wu, Xiao Zhang, Jianxiang Liu

However, existing methods often require these experiences to be successful and may overly exploit them, which can cause the agent to adopt suboptimal behaviors.

continuous-control Continuous Control +2

UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems

1 code implementation17 Jan 2024 Changshuo Zhang, Sirui Chen, Xiao Zhang, Sunhao Dai, Weijie Yu, Jun Xu

Reinforcement learning (RL) has gained traction for enhancing user long-term experiences in recommender systems by effectively exploring users' interests.

Diversity Fairness +2

Curator: Efficient Indexing for Multi-Tenant Vector Databases

no code implementations13 Jan 2024 Yicheng Jin, Yongji Wu, WenJun Hu, Bruce M. Maggs, Xiao Zhang, Danyang Zhuo

Vector databases have emerged as key enablers for bridging intelligent applications with unstructured data, providing generic search and management support for embedding vectors extracted from the raw unstructured data.

Clustering

Leveraging Frequency Domain Learning in 3D Vessel Segmentation

no code implementations11 Jan 2024 Xinyuan Wang, Chengwei Pan, Hongming Dai, Gangming Zhao, Jinpeng Li, Xiao Zhang, Yizhou Yu

In this study, we leverage Fourier domain learning as a substitute for multi-scale convolutional kernels in 3D hierarchical segmentation models, which can reduce computational expenses while preserving global receptive fields within the network.

Segmentation

Trajectory-Oriented Policy Optimization with Sparse Rewards

no code implementations4 Jan 2024 GuoJian Wang, Faguo Wu, Xiao Zhang

The proposed algorithm undergoes evaluation across extensive discrete and continuous control tasks with sparse and misleading rewards.

continuous-control Continuous Control +1

Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations

no code implementations30 Dec 2023 GuoJian Wang, Faguo Wu, Xiao Zhang, Tianyuan Chen, Zhiming Zheng

The sparsity of reward feedback remains a challenging problem in online deep reinforcement learning (DRL).

Deep Reinforcement Learning

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

1 code implementation27 Dec 2023 GuoJian Wang, Faguo Wu, Xiao Zhang, Ning Guo, Zhiming Zheng

Deep reinforcement learning (DRL) faces significant challenges in addressing the hard-exploration problems in tasks with sparse or deceptive rewards and large state spaces.

Deep Reinforcement Learning Multi-agent Reinforcement Learning +1

Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations

1 code implementation CVPR 2024 Xiao Zhang, David Yunis, Michael Maire

We present an approach for analyzing grouping information contained within a neural network's activations, permitting extraction of spatial layout and semantic segmentation from the behavior of large pre-trained vision models.

Semantic Segmentation

Neural Retrievers are Biased Towards LLM-Generated Content

2 code implementations31 Oct 2023 Sunhao Dai, Yuqi Zhou, Liang Pang, Weihao Liu, Xiaolin Hu, Yong liu, Xiao Zhang, Gang Wang, Jun Xu

Surprisingly, our findings indicate that neural retrieval models tend to rank LLM-generated documents higher.

Information Retrieval Retrieval +1

Integrating View Conditions for Image Synthesis

1 code implementation24 Oct 2023 Jinbin Bai, Zhen Dong, Aosong Feng, Xiao Zhang, Tian Ye, Kaicheng Zhou

In the field of image processing, applying intricate semantic modifications within existing images remains an enduring challenge.

Image Generation Object

Provably Robust Cost-Sensitive Learning via Randomized Smoothing

1 code implementation12 Oct 2023 Yuan Xin, Michael Backes, Xiao Zhang

Built upon the notion of cost-sensitive certified radius, we first illustrate how to adapt the standard certification algorithm of randomized smoothing to produce tight robustness certificates for any binary cost matrix, and then develop a robust training method to promote certified cost-sensitive robustness while maintaining the model's overall accuracy.

Transferable Availability Poisoning Attacks

1 code implementation8 Oct 2023 Yiyong Liu, Michael Backes, Xiao Zhang

We consider availability data poisoning attacks, where an adversary aims to degrade the overall test accuracy of a machine learning model by crafting small perturbations to its training data.

Contrastive Learning Data Poisoning +1

Generating Less Certain Adversarial Examples Improves Robust Generalization

2 code implementations6 Oct 2023 Minxing Zhang, Michael Backes, Xiao Zhang

Therefore, we hypothesize that generating less certain adversarial examples improves robust generalization, and propose a formal definition of adversarial certainty that captures the variance of the model's predicted logits on adversarial examples.

Structural Adversarial Objectives for Self-Supervised Representation Learning

1 code implementation30 Sep 2023 Xiao Zhang, Michael Maire

Within the framework of generative adversarial networks (GANs), we propose objectives that task the discriminator for self-supervised representation learning via additional structural modeling responsibilities.

Contrastive Learning Data Augmentation +1

PINF: Continuous Normalizing Flows for Physics-Constrained Deep Learning

no code implementations26 Sep 2023 Feng Liu, Faguo Wu, Xiao Zhang

The normalization constraint on probability density poses a significant challenge for solving the Fokker-Planck equation.

Deep Learning

Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning

1 code implementation23 Sep 2023 Zihua Si, Zhongxiang Sun, Jiale Chen, Guozhang Chen, Xiaoxue Zang, Kai Zheng, Yang song, Xiao Zhang, Jun Xu, Kun Gai

To obtain efficiency and effectiveness, this paper introduces a generative retrieval framework, namely SEATER, which learns SEmAntic Tree-structured item identifiERs via contrastive learning.

Contrastive Learning Recommendation Systems +2

HyperBandit: Contextual Bandit with Hypernewtork for Time-Varying User Preferences in Streaming Recommendation

no code implementations14 Aug 2023 Chenglei Shen, Xiao Zhang, Wei Wei, Jun Xu

In real-world streaming recommender systems, user preferences often dynamically change over time (e. g., a user may have different preferences during weekdays and weekends).

Recommendation Systems

SciMRC: Multi-perspective Scientific Machine Reading Comprehension

no code implementations25 Jun 2023 Xiao Zhang, Heqi Zheng, Yuxiang Nie, Heyan Huang, Xian-Ling Mao

However, the dataset has ignored the fact that different readers may have different levels of understanding of the text, and only includes single-perspective question-answer pairs, leading to a lack of consideration of different perspectives.

Machine Reading Comprehension

KuaiSAR: A Unified Search And Recommendation Dataset

no code implementations13 Jun 2023 Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Dewei Leng, Yanan Niu, Yang song, Xiao Zhang, Jun Xu

We believe this dataset will serve as a catalyst for innovative research and bridge the gap between academia and industry in understanding the S&R services in practical, real-world applications.

Multi-Task Learning Recommendation Systems

Controllable Multi-Objective Re-ranking with Policy Hypernetworks

1 code implementation8 Jun 2023 Sirui Chen, YuAn Wang, Zijing Wen, Zhiyu Li, Changshuo Zhang, Xiao Zhang, Quan Lin, Cheng Zhu, Jun Xu

In this paper, we propose a framework called controllable multi-objective re-ranking (CMR) which incorporates a hypernetwork to generate parameters for a re-ranking model according to different preference weights.

Recommendation Systems Re-Ranking

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance

2 code implementations8 Jun 2023 Qianqian Xie, Weiguang Han, Xiao Zhang, Yanzhao Lai, Min Peng, Alejandro Lopez-Lira, Jimin Huang

This paper introduces PIXIU, a comprehensive framework including the first financial LLM based on fine-tuning LLaMA with instruction data, the first instruction data with 136K data samples to support the fine-tuning, and an evaluation benchmark with 5 tasks and 9 datasets.

Conversational Question Answering Language Modeling +6

When Search Meets Recommendation: Learning Disentangled Search Representation for Recommendation

1 code implementation18 May 2023 Zihua Si, Zhongxiang Sun, Xiao Zhang, Jun Xu, Xiaoxue Zang, Yang song, Kun Gai, Ji-Rong Wen

In our paper, we propose a Search-Enhanced framework for the Sequential Recommendation (SESRec) that leverages users' search interests for recommendation, by disentangling similar and dissimilar representations within S&R behaviors.

Contrastive Learning Disentanglement +1

Uncovering ChatGPT's Capabilities in Recommender Systems

1 code implementation3 May 2023 Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu

The debut of ChatGPT has recently attracted the attention of the natural language processing (NLP) community and beyond.

Explainable Recommendation Information Retrieval +2

A Lightweight Recurrent Learning Network for Sustainable Compressed Sensing

1 code implementation23 Apr 2023 Yu Zhou, Yu Chen, Xiao Zhang, Pan Lai, Lei Huang, Jianmin Jiang

While the initial reconstruction sub-network has a hierarchical structure to progressively recover the image, reducing the number of parameters, the residual reconstruction sub-network facilitates recurrent residual feature extraction via recurrent learning to perform both feature fusion and deep reconstructions across different scales.

Transition System Representation of Boolean Control Networks

no code implementations22 Apr 2023 Daizhan Cheng, Xiao Zhang, Zhengping Ji

The first kind of representation is state-based, which converts a BCN into a TS with either distinct control or non-distinct control.

Aggregated (Bi-)Simulation of Finite Valued Networks

no code implementations25 Mar 2023 Zhengping Ji, Xiao Zhang, Daizhan Cheng

Then the overall network can be approximated by the quotient systems of each blocks, which is called the aggregated simulation.

P-MMF: Provider Max-min Fairness Re-ranking in Recommender System

1 code implementation12 Mar 2023 Chen Xu, Sirui Chen, Jun Xu, Weiran Shen, Xiao Zhang, Gang Wang, Zhenghua Dong

In this paper, we proposed an online re-ranking model named Provider Max-min Fairness Re-ranking (P-MMF) to tackle the problem.

Fairness Recommendation Systems +1

Semi-Tensor Product of Hypermatrices with Application to Compound Hypermatrices

no code implementations11 Mar 2023 Daizhan Cheng, Xiao Zhang, Zhengping Ji

Some basic properties of the STP of matrices are extended to the STP of hypermatrices.

Analysis of Discrete-Time Switched Linear Systems under Logic Dynamic Switchings

no code implementations23 Nov 2022 Xiao Zhang, Min Meng, Zhengping Ji

The control properties of discrete-time switched linear systems (SLS) with switching signals generated by logical dynamic systems are studied using the semi-tensor product (STP) approach.

PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds

no code implementations24 Oct 2022 Zhaoqi Leng, Shuyang Cheng, Benjamin Caine, Weiyue Wang, Xiao Zhang, Jonathon Shlens, Mingxing Tan, Dragomir Anguelov

To alleviate the cost of hyperparameter tuning and iterative pseudo labeling, we develop a population-based data augmentation framework for 3D detection, named AutoPseudoAugment.

Data Augmentation Pseudo Label

Law Article-Enhanced Legal Case Matching: a Causal Learning Approach

1 code implementation20 Oct 2022 Zhongxiang Sun, Jun Xu, Xiao Zhang, Zhenhua Dong, Ji-Rong Wen

We show that the framework is model-agnostic, and a number of legal case matching models can be applied as the underlying models.

Semantic Text Matching Text Matching

ET5: A Novel End-to-end Framework for Conversational Machine Reading Comprehension

1 code implementation COLING 2022 Xiao Zhang, Heyan Huang, Zewen Chi, Xian-Ling Mao

Conversational machine reading comprehension (CMRC) aims to assist computers to understand an natural language text and thereafter engage in a multi-turn conversation to answer questions related to the text.

Decision Making Machine Reading Comprehension

MICO: Selective Search with Mutual Information Co-training

1 code implementation COLING 2022 Zhanyu Wang, Xiao Zhang, Hyokun Yun, Choon Hui Teo, Trishul Chilimbi

In contrast to traditional exhaustive search, selective search first clusters documents into several groups before all the documents are searched exhaustively by a query, to limit the search executed within one group or only a few groups.

Retrieval

Invariant and Dual Invariant Subspaces of $k$-valued Networks

no code implementations1 Sep 2022 Daizhan Cheng, HongSheng Qi, Xiao Zhang, Zhengping Ji

Finally, the relationship between state invariant subspace and dual invariant subspace of a network is investigated.

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

1 code implementation23 Jul 2022 Xu Zhou, Xinyu Lei, Cong Yang, Yichun Shi, Xiao Zhang, Jingwen Shi

The key idea in FedKF is to let the server return the global knowledge to be fused with the local knowledge in each training round so that the local model can be regularized towards the global optima.

Data-free Knowledge Distillation Fairness +2

Feature Forgetting in Continual Representation Learning

no code implementations26 May 2022 Xiao Zhang, Dejing Dou, Ji Wu

To study the feature forgetting problem, we create a synthetic dataset to identify and visualize the prevalence of feature forgetting in neural networks.

Continual Learning Representation Learning

Cross-Lingual Phrase Retrieval

1 code implementation ACL 2022 Heqi Zheng, Xiao Zhang, Zewen Chi, Heyan Huang, Tan Yan, Tian Lan, Wei Wei, Xian-Ling Mao

In this paper, we propose XPR, a cross-lingual phrase retriever that extracts phrase representations from unlabeled example sentences.

Retrieval Sentence

Reinforcement Re-ranking with 2D Grid-based Recommendation Panels

no code implementations11 Apr 2022 Sirui Chen, Xiao Zhang, Xu Chen, Zhiyu Li, YuAn Wang, Quan Lin, Jun Xu

Then, it defines \emph{the MDP discrete time steps as the ranks in the initial ranking list, and the actions as the prediction of the user-item preference and the selection of the slots}.

Recommendation Systems Re-Ranking

A Model-Agnostic Causal Learning Framework for Recommendation using Search Data

1 code implementation9 Feb 2022 Zihua Si, Xueran Han, Xiao Zhang, Jun Xu, Yue Yin, Yang song, Ji-Rong Wen

In this paper, we propose a model-agnostic framework named IV4Rec that can effectively decompose the embedding vectors into these two parts, hence enhancing recommendation results.

Recommendation Systems

Combining Reinforcement Learning and Inverse Reinforcement Learning for Asset Allocation Recommendations

no code implementations6 Jan 2022 Igor Halperin, Jiayu Liu, Xiao Zhang

We suggest a simple practical method to combine the human and artificial intelligence to both learn best investment practices of fund managers, and provide recommendations to improve them.

reinforcement-learning Reinforcement Learning +1

Hidden Order of Boolean Networks

no code implementations25 Nov 2021 Xiao Zhang, Zhengping Ji, Daizhan Cheng

It is a common belief that the order of a Boolean network is mainly determined by its attractors, including fixed points and cycles.

Understanding Intrinsic Robustness Using Label Uncertainty

1 code implementation ICLR 2022 Xiao Zhang, David Evans

A fundamental question in adversarial machine learning is whether a robust classifier exists for a given task.

Adversarial Robustness Classification +1

Practical Assessment of Generalization Performance Robustness for Deep Networks via Contrastive Examples

no code implementations20 Jun 2021 Xuanyu Wu, Xuhong LI, Haoyi Xiong, Xiao Zhang, Siyu Huang, Dejing Dou

Incorporating with a set of randomized strategies for well-designed data transformations over the training set, ContRE adopts classification errors and Fisher ratios on the generated contrastive examples to assess and analyze the generalization performance of deep models in complement with a testing set.

Contrastive Learning

Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification

1 code implementation CVPR 2021 Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li

Unsupervised object re-identification targets at learning discriminative representations for object retrieval without any annotations.

Clustering Pseudo Label +1

Optimization Variance: Exploring Generalization Properties of DNNs

1 code implementation3 Jun 2021 Xiao Zhang, Dongrui Wu, Haoyi Xiong, Bo Dai

Unlike the conventional wisdom in statistical learning theory, the test error of a deep neural network (DNN) often demonstrates double descent: as the model complexity increases, it first follows a classical U-shaped curve and then shows a second descent.

Diversity Learning Theory

Multi-Grained Knowledge Distillation for Named Entity Recognition

no code implementations NAACL 2021 Xuan Zhou, Xiao Zhang, Chenyang Tao, Junya Chen, Bing Xu, Wei Wang, Jing Xiao

To maximally assimilate knowledge into the student model, we propose a multi-grained distillation scheme, which integrates cross entropy involved in conditional random field (CRF) and fuzzy learning. To validate the effectiveness of our proposal, we conducted a comprehensive evaluation on five NER benchmarks, reporting cross-the-board performance gains relative to competing prior-arts.

Knowledge Distillation named-entity-recognition +2

DCAP: Deep Cross Attentional Product Network for User Response Prediction

1 code implementation18 May 2021 Zekai Chen, Fangtian Zhong, Zhumin Chen, Xiao Zhang, Robert Pless, Xiuzhen Cheng

Prior studies in predicting user response leveraged the feature interactions by enhancing feature vectors with products of features to model second-order or high-order cross features, either explicitly or implicitly.

Recommendation Systems