Search Results for author: Xu Zhang

Found 116 papers, 54 papers with code

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

no code implementations13 May 2025 Anle Ke, Xu Zhang, Tong Chen, Ming Lu, Chao Zhou, Jiawen Gu, Zhan Ma

Existing multimodal large model-based image compression frameworks often rely on a fragmented integration of semantic retrieval, latent compression, and generative models, resulting in suboptimal performance in both reconstruction fidelity and coding efficiency.

Image Compression Retrieval +1

Histomorphology-driven multi-instance learning for breast cancer WSI classification

no code implementations23 Mar 2025 Baizhi Wang, Rui Yan, Wenxin Ma, Xu Zhang, Yuhao Wang, Xiaolong Li, Yunjie Gu, Zihang Jiang, S. Kevin Zhou

With the incorporation of histomorphological information, our framework strengthens the model's ability to capture key and fine-grained pathological patterns, thereby enhancing WSI classification performance.

Classification Diagnostic

Enabling Versatile Controls for Video Diffusion Models

1 code implementation21 Mar 2025 Xu Zhang, Hao Zhou, Haoming Qin, Xiaobin Lu, Jiaxing Yan, Guanzhong Wang, Zeyu Chen, Yi Liu

Despite substantial progress in text-to-video generation, achieving precise and flexible control over fine-grained spatiotemporal attributes remains a significant unresolved challenge in video generation research.

Text-to-Video Generation Video Generation

AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP

1 code implementation9 Mar 2025 Wenxin Ma, Xu Zhang, Qingsong Yao, Fenghe Tang, Chenxu Wu, Yingtai Li, Rui Yan, Zihang Jiang, S. Kevin Zhou

To address this problem, we propose Anomaly-Aware CLIP (AA-CLIP), which enhances CLIP's anomaly discrimination ability in both text and visual spaces while preserving its generalization capability.

Anomaly Detection Anomaly Localization +2

Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

no code implementations5 Feb 2025 Xu Zhang, Kaidi Xu, Ziqing Hu, Ren Wang

To push the boundaries of robustness and accuracy, we propose a novel joint training strategy JTDMoE for the dual-model.

Adversarial Robustness Mixture-of-Experts

UniUIR: Considering Underwater Image Restoration as An All-in-One Learner

no code implementations22 Jan 2025 Xu Zhang, huan zhang, Guoli Wang, Qian Zhang, Lefei Zhang, Bo Du

Existing underwater image restoration (UIR) methods generally only handle color distortion or jointly address color and haze issues, but they often overlook the more complex degradations that can occur in underwater scenes.

All Depth Estimation +4

A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition

no code implementations14 Jan 2025 Mingke Xiao, Yue Su, Liang Yu, Guanglong Qu, Yutong Jia, Yukuan Chang, Xu Zhang

Herein, we propose an ultra-lightweight binary neural network (BNN) model designed for hardware deployment, and conduct image classification research based on the German Traffic Sign Recognition Benchmark (GTSRB) dataset.

Autonomous Driving Image Classification +1

MobileNetV2: A lightweight classification model for home-based sleep apnea screening

1 code implementation28 Dec 2024 Hui Pan, Yanxuan Yu, Jilun Ye, Xu Zhang

For sleep stage classification, in UCDDB dataset, the ROC-AUC exceeded 0. 85 across all stages, with recall for Sleep reaching 0. 906 and specificity for REM and Wake states at 0. 956 and 0. 937, respectively.

Specificity

FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning

1 code implementation19 Nov 2024 Qingsong Lv, Jiasheng Sun, Sheng Zhou, Xu Zhang, Liangcheng Li, Yun Gao, Sun Qiao, Jie Song, Jiajun Bu

To reduce computational overhead while maintaining model performance, model pruning techniques have been proposed.

Computational Efficiency

QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou

no code implementations18 Nov 2024 Xinchen Luo, Jiangxia Cao, Tianyu Sun, Jinkai Yu, Rui Huang, Wei Yuan, Hezheng Lin, Yichen Zheng, Shiyao Wang, Qigen Hu, Changqing Qiu, JiaQi Zhang, Xu Zhang, Zhiheng Yan, Jingming Zhang, Simin Zhang, Mingxing Wen, Zhaojie Liu, Kun Gai, Guorui Zhou

In recent years, with the significant evolution of multi-modal large models, many recommender researchers realized the potential of multi-modal information for user interest modeling.

Multi-modal Recommendation

Joint multi-dimensional dynamic attention and transformer for general image restoration

1 code implementation12 Nov 2024 huan zhang, Xu Zhang, Nian Cai, Jianglei Di, Yun Zhang

Outdoor images often suffer from severe degradation due to rain, haze, and noise, impairing image quality and challenging high-level tasks.

Deblurring Decoder +3

Similarity and Dissimilarity Guided Co-association Matrix Construction for Ensemble Clustering

1 code implementation1 Nov 2024 Xu Zhang, Yuheng Jia, Mofei Song, Ran Wang

Finally, the adversarial relationship between the similarity matrix and the dissimilarity matrix is utilized to construct a promoted CA matrix for ensemble clustering.

Clustering

EMWaveNet: Physically Explainable Neural Network Based on Electromagnetic Propagation for SAR Target Recognition

no code implementations13 Oct 2024 Zhuoxuan Li, Xu Zhang, Shumeng Yu, Haipeng Wang

Deep learning technologies have significantly improved performance in the field of synthetic aperture radar (SAR) image target recognition compared to traditional methods.

De-aliasing

All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation

1 code implementation29 Sep 2024 Xu Zhang, Peiyao Guo, Ming Lu, Zhan Ma

Experimental results show that MPA achieves performance comparable to state-of-the-art methods in both task-specific and multi-objective optimization across human viewing and machine analysis tasks.

All Data Compression +4

Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration

1 code implementation28 Aug 2024 Xu Zhang, Jiaqi Ma, Guoli Wang, Qian Zhang, huan zhang, Lefei Zhang

Existing All-in-One image restoration methods often fail to perceive degradation types and severity levels simultaneously, overlooking the importance of fine-grained quality perception.

All Image Restoration +1

RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models

1 code implementation27 Aug 2024 Junyao Ge, Xu Zhang, Yang Zheng, Kaitai Guo, Jimin Liang

Abundant, well-annotated multimodal data in remote sensing are pivotal for aligning complex visual remote sensing (RS) scenes with human language, enabling the development of specialized vision language models across diverse RS interpretation tasks.

Descriptive Language Modeling +2

Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion

1 code implementation25 Aug 2024 Xu Zhang, Zhipeng Xie, Haiyang Yu, Qitong Wang, Peng Wang, Wei Wang

Based on this observation, we introduce the Collaborative Decision Making (CDM) module, which fuses the multiple classifier heads to enhance the inference performance of adaptive deep networks.

Decision Making Image Classification

Multi-Agent Continuous Control with Generative Flow Networks

1 code implementation13 Aug 2024 Shuang Luo, Yinchuan Li, Shunyu Liu, Xu Zhang, Yunfeng Shao, Chao Wu

Generative Flow Networks (GFlowNets) aim to generate diverse trajectories from a distribution in which the final states of the trajectories are proportional to the reward, serving as a powerful alternative to reinforcement learning for exploratory control tasks.

continuous-control Continuous Control

3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images

no code implementations23 Jul 2024 Jie Zhao, Jianing Li, Weihan Chen, Wentong Wang, Pengfei Yuan, Xu Zhang, Deshu Peng

Human pose estimation remains a multifaceted challenge in computer vision, pivotal across diverse domains such as behavior recognition, human-computer interaction, and pedestrian tracking.

3D Human Pose Estimation

Probing many-body Bell correlation depth with superconducting qubits

no code implementations25 Jun 2024 Ke Wang, Weikang Li, Shibo Xu, Mengyao Hu, Jiachen Chen, Yaozu Wu, Chuanyu Zhang, Feitong Jin, Xuhao Zhu, Yu Gao, Ziqi Tan, Aosai Zhang, Ning Wang, Yiren Zou, TingTing Li, Fanhao Shen, Jiarun Zhong, Zehang Bao, Zitian Zhu, Zixuan Song, Jinfeng Deng, Hang Dong, Xu Zhang, Pengfei Zhang, Wenjie Jiang, Zhide Lu, Zheng-Zhi Sun, Hekang Li, Qiujiang Guo, Zhen Wang, Patrick Emonts, Jordi Tura, Chao Song, H. Wang, Dong-Ling Deng

As an illustrating example, we variationally prepare the low-energy state of a two-dimensional honeycomb model with 73 qubits and certify its Bell correlations by measuring an energy that surpasses the corresponding classical bound with up to 48 standard deviations.

MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency

no code implementations19 Jun 2024 Junzhe Zhang, Huixuan Zhang, Xunjian Yin, Baizhou Huang, Xu Zhang, Xinyu Hu, Xiaojun Wan

Our benchmark facilitates independent correction of misreading and misrecognition errors by editing the corresponding knowledge component.

knowledge editing

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions

no code implementations13 Jun 2024 Xu Zhang, Xunjian Yin, Xiaojun Wan

While substantial advancements have been made in developing large language models (LLMs), achieving control over their behavior can be difficult.

Anomaly Detection Utilizing a Riemann Metric for Robust Myoelectric Pattern Recognition

no code implementations12 Jun 2024 ZongYe Hu, Ge Gao, Xiang Chen, Xu Zhang

Traditional myoelectric pattern recognition (MPR) systems excel within controlled laboratory environments but they are interfered when confronted with anomaly or novel motions not encountered during the training phase.

Anomaly Detection Motion Detection

Fast networked data selection via distributed smoothed quantile estimation

1 code implementation4 Jun 2024 Xu Zhang, Marcos M. Vasconcelos

Leveraging the piecewise linearity of the local objective functions in quantile estimation, we characterize the iteration complexity required to achieve top-$k$ selection, a challenging task due to the lack of strong convexity.

DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models

1 code implementation27 May 2024 Hengkang Wang, Xu Zhang, Taihui Li, Yuxiang Wan, Tiancong Chen, Ju Sun

However, such interleaving methods struggle to produce final results that look like natural objects of interest (i. e., manifold feasibility) and fit the measurement (i. e., measurement feasibility), especially for nonlinear IPs.

DFGNN: Dual-frequency Graph Neural Network for Sign-aware Feedback

no code implementations24 May 2024 Yiqing Wu, Ruobing Xie, Zhao Zhang, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Zhanhui Kang, Yongjun Xu

Based on the two observations, we propose a novel model that models positive and negative feedback from a frequency filter perspective called Dual-frequency Graph Neural Network for Sign-aware Recommendation (DFGNN).

Graph Neural Network

StyleSeg V2: Towards Robust One-shot Segmentation of Brain Tissue via Optimization-free Registration Error Perception

no code implementations6 May 2024 Zhiwei Wang, Xiaoyu Zeng, Chongwei Wu, Jinxin Lv, Xu Zhang, Wei Fang, Qiang Li

One-shot segmentation of brain tissue requires training registration-segmentation (reg-seg) dual-model iteratively, where reg-model aims to provide pseudo masks of unlabeled images for seg-model by warping a carefully-labeled atlas.

One-Shot Segmentation

ID-centric Pre-training for Recommendation

no code implementations6 May 2024 Yiqing Wu, Ruobing Xie, Zhao Zhang, Fuzhen Zhuang, Xu Zhang, Leyu Lin, Zhanhui Kang, Yongjun Xu

Specifically, in pre-training stage, besides the ID-based sequential model for recommendation, we also build a Cross-domain ID-matcher (CDIM) learned by both behavioral and modality information.

Language Modelling Sequential Recommendation

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

no code implementations1 Apr 2024 Fenggen Yu, Yiming Qian, Xu Zhang, Francisca Gil-Ureta, Brian Jackson, Eric Bennett, Hao Zhang

We present a differentiable rendering framework to learn structured 3D abstractions in the form of primitive assemblies from sparse RGB images capturing a 3D object.

NeRF Test-time Adaptation

Projected Gradient Descent for Spectral Compressed Sensing via Symmetric Hankel Factorization

1 code implementation14 Mar 2024 Jinsheng Li, Wei Cui, Xu Zhang

Current spectral compressed sensing methods via Hankel matrix completion employ symmetric factorization to demonstrate the low-rank property of the Hankel matrix.

compressed sensing Matrix Completion

Enhancing Jailbreak Attacks with Diversity Guidance

no code implementations1 Mar 2024 Xu Zhang, Dinghao Jing, Xiaojun Wan

Therefore, we propose DPP-based Stochastic Trigger Searching (DSTS), a new optimization algorithm for jailbreak attacks.

Diversity Language Modelling +2

Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation

1 code implementation18 Feb 2024 Xunjian Yin, Xu Zhang, Jie Ruan, Xiaojun Wan

In recent years, substantial advancements have been made in the development of large language models, achieving remarkable performance across diverse tasks.

Benchmarking Language Modeling +3

UFO: A UI-Focused Agent for Windows OS Interaction

1 code implementation8 Feb 2024 Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, QIngwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

Retrosynthesis prediction enhanced by in-silico reaction data augmentation

no code implementations31 Jan 2024 Xu Zhang, Yiming Mo, Wenguan Wang, Yi Yang

As a response, we exploit easy-to-access unpaired data (i. e., one component of product-reactant(s) pair) for generating in-silico paired data to facilitate model training.

Data Augmentation Prediction +1

UniVG: Towards UNIfied-modal Video Generation

no code implementations17 Jan 2024 Ludan Ruan, Lei Tian, Chuanwei Huang, Xu Zhang, Xinyan Xiao

This cannot fully meet the needs of real-world application scenarios, as users are likely to input images and text conditions in a flexible manner, either individually or in combination.

Video Generation

Plug-in Diffusion Model for Sequential Recommendation

1 code implementation5 Jan 2024 Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Zhanhui Kang

To address this issue, this paper presents a novel Plug-in Diffusion Model for Recommendation (PDRec) framework, which employs the diffusion model as a flexible plugin to jointly take full advantage of the diffusion-generating user preferences on all items.

Image Generation model +2

Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing

no code implementations2 Jan 2024 Jiangtao Wei, Yixiang Luomei, Xu Zhang, Feng Xu

Furthermore, a differentiable ray tracing (DRT) engine based on SAR images was constructed for CSVBSDF surface scattering parameter learning.

Negative Pre-aware for Noisy Cross-modal Matching

2 code implementations10 Dec 2023 Xu Zhang, Hao Li, Mang Ye

Since clean samples are easier distinguished by GMM with increasing noise, the memory bank can still maintain high quality at a high noise ratio.

Cross-modal retrieval with noisy correspondence Image-text matching +3

TaskWeaver: A Code-First Agent Framework

1 code implementation29 Nov 2023 Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, Minghua Ma, Pu Zhao, Si Qin, Xiaoting Qin, Chao Du, Yong Xu, QIngwei Lin, Saravan Rajmohan, Dongmei Zhang

TaskWeaver provides support for rich data structures, flexible plugin usage, and dynamic plugin selection, and leverages LLM coding capabilities for complex logic.

Natural Language Understanding

Semantically Grounded QFormer for Efficient Vision Language Understanding

no code implementations13 Nov 2023 Moulik Choraria, Xinbo Wu, Sourya Basu, Nitesh Sekhar, Yue Wu, Xu Zhang, Prateek Singhal, Lav R. Varshney

Consequently, instead of using QFormer latents as inputs to the LLM, we alter the framework by using the latents to directly condition the LLM latent space for image-to-text generation.

Diversity Image to text +2

An Empirical Study of Instruction-tuning Large Language Models in Chinese

1 code implementation11 Oct 2023 Qingyi Si, Tong Wang, Zheng Lin, Xu Zhang, Yanan Cao, Weiping Wang

This paper will release a powerful Chinese LLMs that is comparable to ChatGLM.

Multiagent Reinforcement Learning with an Attention Mechanism for Improving Energy Efficiency in LoRa Networks

no code implementations16 Sep 2023 Xu Zhang, Ziqi Lin, Shimin Gong, Bo Gu, Dusit Niyato

Long Range (LoRa) wireless technology, characterized by low power consumption and a long communication range, is regarded as one of the enabling technologies for the Industrial Internet of Things (IIoT).

MulMarker: a comprehensive framework for identifying multi-gene prognostic signatures

1 code implementation22 Aug 2023 Xu Zhang, Lei Chen

Prognostic signatures play an important role in clinical research, offering insights into the potential health outcomes of patients and guiding therapeutic decisions.

Chatbot

FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory

no code implementations ICCV 2023 Anwesan Pal, Sahil Wadhwa, Ayush Jaiswal, Xu Zhang, Yue Wu, Rakesh Chada, Pradeep Natarajan, Henrik I. Christensen

Extensive evaluation results show that our proposed method outperforms the previous state-of-the-art algorithm by 50. 5%, on Multi-turn FashionIQ -- the only existing multi-turn fashion dataset currently, in addition to having a relative improvement of 12. 6% on Multi-turn Shoes -- an extension of the single-turn Shoes dataset that we created in this work.

Image Retrieval Retrieval

TVPR: Text-to-Video Person Retrieval and a New Benchmark

no code implementations14 Jul 2023 Xu Zhang, Fan Ni, Guan-Nan Dong, Aichun Zhu, Jianhui Wu, Mingcheng Ni, Hui Liu

To the best of our knowledge, MFGF is the first successful attempt to use video for text-based person retrieval task and has achieved state-of-the-art performance on TVPReid dataset.

Person Retrieval Retrieval +3

Improved NL2SQL based on Multi-layer Expert Network

no code implementations30 Jun 2023 Chenduo Hao, Xu Zhang

The Natural Language to SQL (NL2SQL) technique is used to convert natural language queries into executable SQL statements.

Classification Natural Language Queries +2

Feature Representation Learning for NL2SQL Generation Based on Coupling and Decoupling

no code implementations30 Jun 2023 Chenduo Hao, Xu Zhang, Chuanbao Gao, Deyu Zhou

To address this issue, we propose the Clause Feature Correlation Decoupling and Coupling (CFCDC) model, which uses a feature representation decoupling method to separate the SELECT and WHERE clauses at the parameter level.

Feature Correlation Multi-Task Learning +3

Meta Generative Flow Networks with Personalization for Task-Specific Adaptation

no code implementations16 Jun 2023 Xinyuan Ji, Xu Zhang, Wei Xi, Haozhi Wang, Olga Gadyatskaya, Yinchuan Li

Multi-task reinforcement learning and meta-reinforcement learning have been developed to quickly adapt to new tasks, but they tend to focus on tasks with higher rewards and more frequent occurrences, leading to poor performance on tasks with sparse rewards.

Meta-Learning Meta Reinforcement Learning +2

PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation

2 code implementations11 Jun 2023 Xu Zhang, Kailun Yang, Jiacheng Lin, Jin Yuan, Zhiyong Li, Shutao Li

To tackle this problem, this paper proposes a simple yet effective Probabilistic Visual Prompt Unified Transformer (PVPUFormer) for interactive image segmentation, which allows users to flexibly input diverse visual prompts with the probabilistic prompt encoding and feature post-processing to excavate sufficient and robust prompt features for performance boosting.

Image Segmentation Interactive Segmentation +2

GRACE: Loss-Resilient Real-Time Video through Neural Codecs

no code implementations21 May 2023 Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, YuHan Liu, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang

In real-time video communication, retransmitting lost packets over high-latency networks is not viable due to strict latency requirements.

Decoder

Image-text Retrieval via Preserving Main Semantics of Vision

1 code implementation20 Apr 2023 Xu Zhang, Xinzheng Niu, Philippe Fournier-Viger, Xudong Dai

To address this issue, this paper presents a semantic optimization approach, implemented as a Visual Semantic Loss (VSL), to assist the model in focusing on an image's main content.

Cross-Modal Retrieval Image-text Retrieval +1

Triple Sequence Learning for Cross-domain Recommendation

no code implementations11 Apr 2023 Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Jie zhou

To address this issue, we present a novel framework, termed triple sequence learning for cross-domain recommendation (Tri-CDR), which jointly models the source, target, and mixed behavior sequences to highlight the global and target preference and precisely model the triple correlation in CDR.

Contrastive Learning

Federated Learning via Variational Bayesian Inference: Personalization, Sparsity and Clustering

no code implementations8 Mar 2023 Xu Zhang, Wenpeng Li, Yunfeng Shao, Yinchuan Li

data, we propose a clustered Bayesian FL model named cFedbayes by learning different prior distributions for different clients.

Bayesian Inference Clustering +1

Robust one-shot estimation over shared networks in the presence of denial-of-service attacks

1 code implementation28 Feb 2023 Xu Zhang, Marcos M. Vasconcelos

We consider the following scenario: multiple pairs of agents communicating strategically over shared communication networks in the presence of a jammer who may launch a denial-of-service.

Blocking

Online Decomposition of Surface Electromyogram into Individual Motor Unit Activities Using Progressive FastICA Peel-off

no code implementations5 Jan 2023 Haowen Zhao, Xu Zhang, Maoqi Chen, Ping Zhou

For decomposing experimental SEMG data, the proposed online method was able to extract an average of 12. 00 +- 3. 46 MUs per trial, with a matching rate of 90. 38% compared with results from the expert-guided offline decomposition.

Surveillance Face Anti-spoofing

no code implementations3 Jan 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

Efficient Visual Computing with Camera RAW Snapshots

1 code implementation15 Dec 2022 Zhihao LI, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP).

Autonomous Driving Image Compression +2

Avoiding spurious correlations via logit correction

1 code implementation2 Dec 2022 Sheng Liu, Xu Zhang, Nitesh Sekhar, Yue Wu, Prateek Singhal, Carlos Fernandez-Granda

Empirical studies suggest that machine learning models trained with empirical risk minimization (ERM) often rely on attributes that may be spuriously correlated with the class labels.

Attribute

An Empirical Study of Automatic Post-Editing

no code implementations16 Sep 2022 Xu Zhang, Xiaojun Wan

In view of the importance of data augmentation in APE, we separately study the impact of the construction method of artificial corpora and artificial data domain on the performance of APE models.

Automatic Post-Editing Data Augmentation

CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose

1 code implementation CVPR 2023 Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, DaCheng Tao

Motivated by the progress of visual-language research, we propose that pre-trained language models (e. g., CLIP) can facilitate animal pose estimation by providing rich prior knowledge for describing animal keypoints in text.

Animal Pose Estimation Contrastive Learning

Personalized Federated Learning via Variational Bayesian Inference

1 code implementation16 Jun 2022 Xu Zhang, Yinchuan Li, Wenpeng Li, Kaiyang Guo, Yunfeng Shao

Federated learning faces huge challenges from model overfitting due to the lack of data and statistical diversity among clients.

Bayesian Inference Diversity +2

Personalized Prompt for Sequential Recommendation

no code implementations19 May 2022 Yiqing Wu, Ruobing Xie, Yongchun Zhu, Fuzhen Zhuang, Xu Zhang, Leyu Lin, Qing He

Specifically, we build the personalized soft prefix prompt via a prompt generator based on user profiles and enable a sufficient training of prompts via a prompt-oriented contrastive learning with both prompt- and behavior-based augmentations.

Contrastive Learning Sequential Recommendation

Selective Fairness in Recommendation via Prompts

1 code implementation10 May 2022 Yiqing Wu, Ruobing Xie, Yongchun Zhu, Fuzhen Zhuang, Xiang Ao, Xu Zhang, Leyu Lin, Qing He

In this work, we define the selective fairness task, where users can flexibly choose which sensitive attributes should the recommendation model be bias-free.

Attribute Fairness +1

Experimental quantum adversarial learning with programmable superconducting qubits

no code implementations4 Apr 2022 Wenhui Ren, Weikang Li, Shibo Xu, Ke Wang, Wenjie Jiang, Feitong Jin, Xuhao Zhu, Jiachen Chen, Zixuan Song, Pengfei Zhang, Hang Dong, Xu Zhang, Jinfeng Deng, Yu Gao, Chuanyu Zhang, Yaozu Wu, Bing Zhang, Qiujiang Guo, Hekang Li, Zhen Wang, Jacob Biamonte, Chao Song, Dong-Ling Deng, H. Wang

Our results reveal experimentally a crucial vulnerability aspect of quantum learning systems under adversarial scenarios and demonstrate an effective defense strategy against adversarial attacks, which provide a valuable guide for quantum artificial intelligence applications with both near-term and future quantum devices.

BIG-bench Machine Learning Quantum Machine Learning

Robust remote estimation over the collision channel in the presence of an intelligent jammer

1 code implementation1 Apr 2022 Xu Zhang, Marcos M. Vasconcelos

We consider a sensor-receiver pair communicating over a wireless channel in the presence of a jammer who may launch a denial-of-service attack.

Multi-view Multi-behavior Contrastive Learning in Recommendation

1 code implementation20 Mar 2022 Yiqing Wu, Ruobing Xie, Yongchun Zhu, Xiang Ao, Xin Chen, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Qing He

We argue that MBR models should: (1) model the coarse-grained commonalities between different behaviors of a user, (2) consider both individual sequence view and global graph view in multi-behavior modeling, and (3) capture the fine-grained differences between multiple behaviors of a user.

Contrastive Learning

SMDT: Selective Memory-Augmented Neural Document Translation

no code implementations5 Jan 2022 Xu Zhang, Jian Yang, Haoyang Huang, Shuming Ma, Dongdong Zhang, Jinlong Li, Furu Wei

Existing document-level neural machine translation (NMT) models have sufficiently explored different context settings to provide guidance for target generation.

Document Level Machine Translation Document Translation +4

Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark

1 code implementation CVPR 2022 Jiaxu Miao, Xiaohan Wang, Yu Wu, Wei Li, Xu Zhang, Yunchao Wei, Yi Yang

In contrast, our large-scale VIdeo Panoptic Segmentation in the Wild (VIPSeg) dataset provides 3, 536 videos and 84, 750 frames with pixel-level panoptic annotations, covering a wide range of real-world scenarios and categories.

Segmentation Video Panoptic Segmentation

Topological and Algebraic Structures of Atanassov's Intuitionistic Fuzzy-Values Space

no code implementations17 Nov 2021 Xinxing Wu, Tao Wang, Qian Liu, Peide Liu, Guanrong Chen, Xu Zhang

By introducing a new operator for IFVs via the linear order based on a score function and an accuracy function, we show that such an operator is a strong negation on IFVs.

Negation

V2iFi: in-Vehicle Vital Sign Monitoring via Compact RF Sensing

no code implementations28 Oct 2021 Tianyue Zheng, Zhe Chen, Chao Cai, Jun Luo, Xu Zhang

Given the significant amount of time people spend in vehicles, health issues under driving condition have become a major concern.

Heart Rate Variability

Personalized Transfer of User Preferences for Cross-domain Recommendation

1 code implementation21 Oct 2021 Yongchun Zhu, Zhenwei Tang, Yudan Liu, Fuzhen Zhuang, Ruobing Xie, Xu Zhang, Leyu Lin, Qing He

Specifically, a meta network fed with users' characteristic embeddings is learned to generate personalized bridge functions to achieve personalized transfer of preferences for each user.

Recommendation Systems

Sparse Personalized Federated Learning

1 code implementation12 Jul 2021 Xiaofeng Liu, Yinchuan Li, Qing Wang, Xu Zhang, Yunfeng Shao, Yanhui Geng

By incorporating an approximated L1-norm and the correlation between client models and global model into standard FL loss function, the performance on statistical diversity data is improved and the communicational and computational loads required in the network are reduced compared with non-sparse FL.

Diversity Personalized Federated Learning

Learning to Expand Audience via Meta Hybrid Experts and Critics for Recommendation and Advertising

3 code implementations31 May 2021 Yongchun Zhu, Yudan Liu, Ruobing Xie, Fuzhen Zhuang, Xiaobo Hao, Kaikai Ge, Xu Zhang, Leyu Lin, Juan Cao

Besides, MetaHeac has been successfully deployed in WeChat for the promotion of both contents and advertisements, leading to great improvement in the quality of marketing.

Marketing Meta-Learning +1

Transfer-Meta Framework for Cross-domain Recommendation to Cold-Start Users

no code implementations11 May 2021 Yongchun Zhu, Kaikai Ge, Fuzhen Zhuang, Ruobing Xie, Dongbo Xi, Xu Zhang, Leyu Lin, Qing He

With the advantage of meta learning which has good generalization ability to novel tasks, we propose a transfer-meta framework for CDR (TMCDR) which has a transfer stage and a meta stage.

Meta-Learning Recommendation Systems

Optimal exit decision of venture capital under time-inconsistent preferences

no code implementations22 Mar 2021 Yanzhao Li, Ju'e Guo, Yongwu Li, Xu Zhang

Based on venture capitalists' understanding of future preferences, we consider four types of venture capitalists, namely time-consistent venture capitalists, venture capitalists who only realize critical time point inconsistency, naive venture capitalists and sophisticated venture capitalists, of which the latter three are time-inconsistent.

Understanding WeChat User Preferences and "Wow" Diffusion

1 code implementation4 Mar 2021 Fanjin Zhang, Jie Tang, Xueyi Liu, Zhenyu Hou, Yuxiao Dong, Jing Zhang, Xiao Liu, Ruobing Xie, Kai Zhuang, Xu Zhang, Leyu Lin, Philip S. Yu

"Top Stories" is a novel friend-enhanced recommendation engine in WeChat, in which users can read articles based on preferences of both their own and their friends.

Graph Representation Learning Social and Information Networks

UPRec: User-Aware Pre-training for Recommender Systems

no code implementations22 Feb 2021 Chaojun Xiao, Ruobing Xie, Yuan YAO, Zhiyuan Liu, Maosong Sun, Xu Zhang, Leyu Lin

Existing sequential recommendation methods rely on large amounts of training data and usually suffer from the data sparsity problem.

Self-Supervised Learning Sequential Recommendation

FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation

1 code implementation LREC 2022 Wenhao Zhu, ShuJian Huang, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen

Previous research for adapting a general neural machine translation (NMT) model into a specific domain usually neglects the diversity in translation within the same domain, which is a core problem for domain adaptation in real-world scenarios.

Autonomous Vehicles Diversity +4

Generalized Relation Learning with Semantic Correlation Awareness for Link Prediction

no code implementations22 Dec 2020 Yao Zhang, Xu Zhang, Jun Wang, Hongru Liang, Wenqiang Lei, Zhe Sun, Adam Jatowt, Zhenglu Yang

The current methods for the link prediction taskhavetwonaturalproblems:1)the relation distributions in KGs are usually unbalanced, and 2) there are many unseen relations that occur in practical situations.

Knowledge Graphs Link Prediction +2

Learning to Build User-tag Profile in Recommendation System (UTPM)

1 code implementation ACM International Conference on Information and Knowledge Management 2020 Su Yan, Xin Chen, Ran Huo, Xu Zhang, Leyu Lin

User profiling is one of the most important components in recommendation systems, where a user is profiled using demographic (e. g. gender, age, and location) and user behavior information (e. g. browsing and search history).

Multi-Label Classification MUlTI-LABEL-ClASSIFICATION +2

MLBF-Net: A Multi-Lead-Branch Fusion Network for Multi-Class Arrhythmia Classification Using 12-Lead ECG

no code implementations17 Aug 2020 Jing Zhang, Deng Liang, Aiping Liu, Min Gao, Xiang Chen, Xu Zhang, Xun Chen

MLBF-Net is composed of three components: 1) multiple lead-specific branches for learning the diversity of multi-lead ECG; 2) cross-lead features fusion by concatenating the output feature maps of all branches for learning the integrity of multi-lead ECG; 3) multi-loss co-optimization for all the individual branches and the concatenated network.

Arrhythmia Detection Diversity

Deep Learning Guided Building Reconstruction from Satellite Imagery-derived Point Clouds

no code implementations19 May 2020 Bo Xu, Xu Zhang, Zhixin Li, Matt Leotta, Shih-Fu Chang, Jie Shan

For points that belong to the same roof shape, a multi-cue, hierarchical RANSAC approach is proposed for efficient and reliable segmenting and reconstructing the building point cloud.

3D Reconstruction

Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation

no code implementations30 Mar 2020 Pei Zhang, Xu Zhang, Wei Chen, Jian Yu, Yan-Feng Wang, Deyi Xiong

In this paper, we propose a new framework to model cross-sentence dependencies by training neural machine translation (NMT) to predict both the target translation and surrounding sentences of a source sentence.

Document Level Machine Translation Machine Translation +4

Detecting and Simulating Artifacts in GAN Fake Images

1 code implementation15 Jul 2019 Xu Zhang, Svebor Karaman, Shih-Fu Chang

By using the simulated images to train a spectrum based classifier, even without seeing the fake images produced by the targeted GAN model during training, our approach achieves state-of-the-art performances on detecting fake images generated by popular GAN models such as CycleGAN.

GAN image forensics

Real-time Attention Based Look-alike Model for Recommender System

1 code implementation12 Jun 2019 Yudan Liu, Kaikai Ge, Xu Zhang, Leyu Lin

Recently, deep learning models play more and more important roles in contents recommender systems.

Clustering Recommendation Systems +1

Super Interaction Neural Network

1 code implementation29 May 2019 Yang Yao, Xu Zhang, Baile Xu, Furao Shen, Jian Zhao

Recent studies have demonstrated that the convolutional networks heavily rely on the quality and quantity of generated features.

Label Mapping Neural Networks with Response Consolidation for Class Incremental Learning

no code implementations20 May 2019 Xu Zhang, Yang Yao, Baile Xu, Lekun Mao, Furao Shen, Jian Zhao, QIngwei Lin

In this paper, it is the first time to discuss the difficulty without support of old classes in class incremental learning, which is called as softmax suppression problem.

class-incremental learning Class Incremental Learning +2

Unsupervised Embedding Learning via Invariant and Spreading Instance Feature

1 code implementation CVPR 2019 Mang Ye, Xu Zhang, Pong C. Yuen, Shih-Fu Chang

This paper studies the unsupervised embedding learning problem, which requires an effective similarity measurement between samples in low-dimensional embedding space.

Data Augmentation

Exact Controllability for a Refined Stochastic Wave Equation

1 code implementation18 Jan 2019 Qi Lü, Xu Zhang

By means of a new global Carleman estimate, we establish the exact controllability of our stochastic wave equation with three controls.

Optimization and Control 93B05, 60H15, 93B07, 35B45

FARSA: Fully Automated Roadway Safety Assessment

1 code implementation17 Jan 2019 Weilian Song, Scott Workman, Armin Hadzic, Xu Zhang, Eric Green, Mei Chen, Reginald Souleyrette, Nathan Jacobs

An emerging approach for conducting such assessments in the United States is through the US Road Assessment Program (usRAP), which rates roads from highest risk (1 star) to lowest (5 stars).

Assessment of central serous chorioretinopathy (CSC) depicted on color fundus photographs using deep Learning

no code implementations14 Jan 2019 Yi Zhen, Hang Chen, Xu Zhang, Meng Liu, Xin Meng, Jian Zhang, Jiantao Pu

To investigate whether and to what extent central serous chorioretinopathy (CSC) depicted on color fundus photographs can be assessed using deep learning technology.

Deep Learning

Optimal Feedback for Stochastic Linear Quadratic Control and Backward Stochastic Riccati Equations in Infinite Dimensions

1 code implementation4 Jan 2019 Qi Lu, Xu Zhang

It is a longstanding unsolved problem to characterize the optimal feedbacks for general SLQs (i. e., stochastic linear quadratic control problems) with random coefficients in infinite dimensions; while the same problem but in finite dimensions was just addressed in a recent work [36].

Optimization and Control Probability 60H15, 93E20, 60H25, 49J30

Second Order Optimality Conditions for Optimal Control Problems of Stochastic Evolution Equations

1 code implementation18 Nov 2018 Qi Lu, Haisen Zhang, Xu Zhang

In this paper, we establish some second order necessary/sufficient optimality conditions for optimal control problems of stochastic evolution equations in infinite dimensions.

Optimization and Control Primary 93E20, Secondary, 60H07, 60H15

SCMA based resource management of D2D communications for maximum sum-revenue

no code implementations12 Oct 2018 Linglin Kong, Li Ling, Xu Zhang

This problem is NP-hard, so we propose a heuristic algorithm based on semi-definite relaxation (SDR) programming to solve it.

Signal Processing

Heated-Up Softmax Embedding

1 code implementation ICLR 2019 Xu Zhang, Felix Xinnan Yu, Svebor Karaman, Wei zhang, Shih-Fu Chang

Metric learning aims at learning a distance which is consistent with the semantic meaning of the samples.

Metric Learning

Tropical Principal Component Analysis and its Application to Phylogenetics

1 code implementation7 Oct 2017 Ruriko Yoshida, Leon Zhang, Xu Zhang

Principal component analysis is a widely-used method for the dimensionality reduction of a given data set in a high-dimensional Euclidean space.

Combinatorics Populations and Evolution

Learning Spread-out Local Feature Descriptors

2 code implementations ICCV 2017 Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang

We propose a simple, yet powerful regularization technique that can be used to significantly improve both the pairwise and triplet losses in learning local feature descriptors.

Triplet

Learning discriminative and transformation covariant local feature detectors.

1 code implementation Computer Vision and Pattern Recognition 2017 Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

Specifically, we extend the covariant constraint proposed by Lenc and Vedaldi [8] by defining the concepts of “standard patch” and “canonical feature” and leverage these to train a novel robust covariant detector.

Image Retrieval

Learning Discriminative and Transformation Covariant Local Feature Detectors

1 code implementation CVPR 2017 Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

Specifically, we extend the covariant constraint proposed by Lenc and Vedaldi by defining the concepts of "standard patch" and "canonical feature" and leverage these to train a novel robust covariant detector.

Image Retrieval

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

no code implementations27 Aug 2016 Jianhong Wang, Tian Lan, Xu Zhang, Limin Luo

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR).

Action Recognition Temporal Action Localization

Fast Orthogonal Projection Based on Kronecker Product

no code implementations ICCV 2015 Xu Zhang, Felix X. Yu, Ruiqi Guo, Sanjiv Kumar, Shengjin Wang, Shi-Fu Chang

We propose a family of structured matrices to speed up orthogonal projections for high-dimensional data commonly seen in computer vision applications.

Image Retrieval Quantization

Deep Transfer Network: Unsupervised Domain Adaptation

no code implementations2 Mar 2015 Xu Zhang, Felix Xinnan Yu, Shih-Fu Chang, Shengjin Wang

In this paper, we propose a new domain adaptation framework named Deep Transfer Network (DTN), where the highly flexible deep neural networks are used to implement such a distribution matching process.

Unsupervised Domain Adaptation

Efficient classification using parallel and scalable compressed model and Its application on intrusion detection

no code implementations14 May 2014 Tieming Chen, Xu Zhang, Shichao Jin, Okhee Kim

In order to achieve high efficiency of classification in intrusion detection, a compressed model is proposed in this paper which combines horizontal compression with vertical compression.

Attribute Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.