Search Results for author: Xing Zhang

Found 31 papers, 11 papers with code

AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era

no code implementations15 Apr 2025 Chenyang Zhu, Xing Zhang, Yuyang Sun, Ching-Chun Chang, Isao Echizen

Recent advances in image generation, particularly diffusion models, have significantly lowered the barrier for creating sophisticated forgeries, making image manipulation detection and localization (IMDL) increasingly challenging.

Image Manipulation Image Manipulation Detection

Doing Less for More: Consumer Search and Undertreatment in Credence Service Markets

no code implementations27 Mar 2025 Xiaoyan Xu, Weishi Lim, Xing Zhang, Jeff Cai

This would arise when the cost of revisiting the initial provider is lower than that of searching for a new one.

Diagnostic

MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering

no code implementations23 Feb 2025 Caixiong Li, Xiongwei Zhao, Jinhang Zhang, Xing Zhang, Zhou Wu

Open-vocabulary detection (OVD) is a challenging task to detect and classify objects from an unrestricted set of categories, including those unseen during training.

Object object-detection +3

Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation

no code implementations30 Oct 2024 Wei Dong, Yuan Sun, Yiting Yang, Xing Zhang, Zhijun Lin, Qingsen Yan, Haokui Zhang, Peng Wang, Yang Yang, HengTao Shen

A common strategy for Parameter-Efficient Fine-Tuning (PEFT) of pre-trained Vision Transformers (ViTs) involves adapting the model to downstream tasks by learning a low-rank adaptation matrix.

parameter-efficient fine-tuning

Logic Synthesis with Generative Deep Neural Networks

no code implementations7 Jun 2024 Xihan Li, Xing Li, Lei Chen, Xing Zhang, Mingxuan Yuan, Jun Wang

While deep learning has achieved significant success in various domains, its application to logic circuit design has been limited due to complex constraints and strict feasibility requirement.

CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization

no code implementations8 May 2024 Zheyan Qu, Lu Yin, Zitong Yu, Wenbo Wang, Xing Zhang

Moreover, considering the alignment of LLM responses with user needs, a novel method for discrete prompt optimization based on LLM-as-Judge is introduced.

Diversity Knowledge Distillation +5

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

1 code implementation CVPR 2024 Wei Dong, Xing Zhang, Bihui Chen, Dawei Yan, Zhijun Lin, Qingsen Yan, Peng Wang, Yang Yang

Parameter-efficient fine-tuning for pre-trained Vision Transformers aims to adeptly tailor a model to downstream tasks by learning a minimal set of new adaptation parameters while preserving the frozen majority of pre-trained parameters.

Image Classification parameter-efficient fine-tuning

Circuit Transformer: A Transformer That Preserves Logical Equivalence

1 code implementation14 Mar 2024 Xihan Li, Xing Li, Lei Chen, Xing Zhang, Mingxuan Yuan, Jun Wang

In this study, we introduce a generative neural model, the "Circuit Transformer", which eliminates such wrong predictions and produces logic circuits strictly equivalent to given Boolean functions.

Hallucination

Computation Rate Maximization for Wireless Powered Edge Computing With Multi-User Cooperation

1 code implementation22 Jan 2024 Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

Simulation results show that the performance of the proposed algorithms is comparable to that of the exhaustive search method, and the deep learning-based algorithm significantly reduces the execution time of the algorithm.

Edge-computing

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

1 code implementation29 Nov 2023 Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang

The high-fidelity alignment is developed to further enhance the fidelity of both video generation and editing by taking the subject image as an additional model input.

Denoising Image to Video Generation +1

Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks

no code implementations28 Nov 2023 Yizhuo Cai, Bo Lei, Qianying Zhao, Jing Peng, Min Wei, Yushun Zhang, Xing Zhang

In this paper, to improve the communication efficiency of federated learning in complex networks, we study the communication efficiency optimization of federated learning for computing and network convergence of 6G networks, methods that gives decisions on its training process for different network conditions and arithmetic power of participating devices in federated learning.

Federated Learning

Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models

1 code implementation25 Oct 2023 Tianyi Lu, Xing Zhang, Jiaxi Gu, Renjing Pei, Songcen Xu, Xingjun Ma, Hang Xu, Zuxuan Wu

This paper is the first to reveal that T2I and T2V LDMs can complement each other in terms of structure and temporal consistency, ultimately generating high-quality videos.

Denoising Video Editing

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation

1 code implementation7 Sep 2023 Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei zhang, Yu-Gang Jiang, Hang Xu

Conditioned on an initial video clip with a small number of frames, additional frames are iteratively generated by reusing the original latent features and following the previous diffusion process.

Action Recognition Decoder +4

DocDiff: Document Enhancement via Residual Diffusion Models

2 code implementations6 May 2023 Zongyuan Yang, Baolin Liu, Yongping Xiong, Lan Yi, Guibin Wu, Xiaojun Tang, Ziqi Liu, Junjie Zhou, Xing Zhang

Removing degradation from document images not only improves their visual quality and readability, but also enhances the performance of numerous automated document analysis and recognition tasks.

Deblurring Denoising +1

Hardware Implementation of Task-based Quantization in Multi-user Signal Recovery

no code implementations27 Jan 2023 Xing Zhang, Haiyang Zhang, Nimrod Glazer, Oded Cohen, Eliya Reznitskiy, Shlomi Savariego, Moshe Namer, Yonina C. Eldar

In this work, we apply task-based quantization to multi-user signal recovery and present a hardware prototype implementation.

Quantization

Near-Field Sparse Channel Representation and Estimation in 6G Wireless Communications

no code implementations27 Dec 2022 Xing Zhang, Haiyang Zhang, Yonina C. Eldar

In this case, the spherical wave assumption which takes into account both the user angle and distance is more accurate than the conventional planar one that is only related to the user angle.

Dictionary Learning

Toward Robust Diagnosis: A Contour Attention Preserving Adversarial Defense for COVID-19 Detection

1 code implementation30 Nov 2022 Kun Xiang, Xing Zhang, Jinwen She, Jinpeng Liu, Haohan Wang, Shiqi Deng, Shancheng Jiang

As the COVID-19 pandemic puts pressure on healthcare systems worldwide, the computed tomography image based AI diagnostic system has become a sustainable solution for early diagnosis.

Adversarial Defense Adversarial Robustness +1

BiTAT: Neural Network Binarization with Task-dependent Aggregated Transformation

no code implementations4 Jul 2022 Geon Park, Jaehong Yoon, Haiyang Zhang, Xing Zhang, Sung Ju Hwang, Yonina C. Eldar

Neural network quantization aims to transform high-precision weights and activations of a given neural network into low-precision weights/activations for reduced memory usage and computation, while preserving the performance of the original model.

Binarization Quantization

Nonlinear ICA Using Volume-Preserving Transformations

no code implementations ICLR 2022 Xiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan

Nonlinear ICA is a fundamental problem in machine learning, aiming to identify the underlying independent components (sources) from data which is assumed to be a nonlinear function (mixing function) of these sources.

VideoLT: Large-scale Long-tailed Video Recognition

1 code implementation ICCV 2021 Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis

In this paper, we introduce VideoLT, a large-scale long-tailed video recognition dataset, as a step toward real-world video recognition.

Image Classification Video Recognition

Hierarchical Neural Architecture Search via Operator Clustering

1 code implementation26 Sep 2019 Guilin Li, Xing Zhang, Zitong Wang, Matthias Tan, Jiashi Feng, Zhenguo Li, Tong Zhang

Recently, the efficiency of automatic neural architecture design has been significantly improved by gradient-based search methods such as DARTS.

Clustering Neural Architecture Search

Formulating Camera-Adaptive Color Constancy as a Few-shot Meta-Learning Problem

no code implementations28 Nov 2018 Steven McDonagh, Sarah Parisot, Fengwei Zhou, Xing Zhang, Ales Leonardis, Zhenguo Li, Gregory Slabaugh

In this work, we propose a new approach that affords fast adaptation to previously unseen cameras, and robustness to changes in capture device by leveraging annotated samples across different cameras and datasets.

Few-Shot Camera-Adaptive Color Constancy Meta-Learning

Non-local NetVLAD Encoding for Video Classification

no code implementations29 Sep 2018 Yongyi Tang, Xing Zhang, Jingwen Wang, Shaoxiang Chen, Lin Ma, Yu-Gang Jiang

This paper describes our solution for the 2$^\text{nd}$ YouTube-8M video understanding challenge organized by Google AI.

Classification General Classification +3

Social Computing for Mobile Big Data in Wireless Networks

no code implementations30 Sep 2016 Xing Zhang, Zhenglei Yi, Zhi Yan, Geyong Min, Wenbo Wang, Sabita Maharjan, Yan Zhang

Mobile big data contains vast statistical features in various dimensions, including spatial, temporal, and the underlying social domain.

Marketing

Cannot find the paper you are looking for? You can Submit a new open access paper.