Search Results for author: Xing Zhang

Found 20 papers, 6 papers with code

VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model

1 code implementation29 Nov 2023 Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Zuxuan Wu, Hang Xu, Yu-Gang Jiang

Identity-consistent video generation seeks to synthesize videos that are guided by both textual prompts and reference images of entities.

Denoising Image to Video Generation +1

Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks

no code implementations28 Nov 2023 Yizhuo Cai, Bo Lei, Qianying Zhao, Jing Peng, Min Wei, Yushun Zhang, Xing Zhang

In this paper, to improve the communication efficiency of federated learning in complex networks, we study the communication efficiency optimization of federated learning for computing and network convergence of 6G networks, methods that gives decisions on its training process for different network conditions and arithmetic power of participating devices in federated learning.

Federated Learning

Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models

no code implementations25 Oct 2023 Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu

In this way, temporal consistency can be kept with video LDM while high-fidelity from the image LDM can also be exploited.

Denoising Video Editing

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation

no code implementations7 Sep 2023 Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei zhang, Yu-Gang Jiang, Hang Xu

Conditioned on an initial video clip with a small number of frames, additional frames are iteratively generated by reusing the original latent features and following the previous diffusion process.

Action Recognition Denoising +3

DocDiff: Document Enhancement via Residual Diffusion Models

1 code implementation6 May 2023 Zongyuan Yang, Baolin Liu, Yongping Xiong, Lan Yi, Guibin Wu, Xiaojun Tang, Ziqi Liu, Junjie Zhou, Xing Zhang

Removing degradation from document images not only improves their visual quality and readability, but also enhances the performance of numerous automated document analysis and recognition tasks.

Deblurring Denoising +1

Hardware Implementation of Task-based Quantization in Multi-user Signal Recovery

no code implementations27 Jan 2023 Xing Zhang, Haiyang Zhang, Nimrod Glazer, Oded Cohen, Eliya Reznitskiy, Shlomi Savariego, Moshe Namer, Yonina C. Eldar

In this work, we apply task-based quantization to multi-user signal recovery and present a hardware prototype implementation.


Near-Field Sparse Channel Representation and Estimation in 6G Wireless Communications

no code implementations27 Dec 2022 Xing Zhang, Haiyang Zhang, Yonina C. Eldar

In this case, the spherical wave assumption which takes into account both the user angle and distance is more accurate than the conventional planar one that is only related to the user angle.

Dictionary Learning

Toward Robust Diagnosis: A Contour Attention Preserving Adversarial Defense for COVID-19 Detection

1 code implementation30 Nov 2022 Kun Xiang, Xing Zhang, Jinwen She, Jinpeng Liu, Haohan Wang, Shiqi Deng, Shancheng Jiang

As the COVID-19 pandemic puts pressure on healthcare systems worldwide, the computed tomography image based AI diagnostic system has become a sustainable solution for early diagnosis.

Adversarial Defense Adversarial Robustness

BiTAT: Neural Network Binarization with Task-dependent Aggregated Transformation

no code implementations4 Jul 2022 Geon Park, Jaehong Yoon, Haiyang Zhang, Xing Zhang, Sung Ju Hwang, Yonina C. Eldar

Neural network quantization aims to transform high-precision weights and activations of a given neural network into low-precision weights/activations for reduced memory usage and computation, while preserving the performance of the original model.

Binarization Quantization

Nonlinear ICA Using Volume-Preserving Transformations

no code implementations ICLR 2022 Xiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan

Nonlinear ICA is a fundamental problem in machine learning, aiming to identify the underlying independent components (sources) from data which is assumed to be a nonlinear function (mixing function) of these sources.

VideoLT: Large-scale Long-tailed Video Recognition

1 code implementation ICCV 2021 Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis

In this paper, we introduce VideoLT, a large-scale long-tailed video recognition dataset, as a step toward real-world video recognition.

Image Classification Video Recognition

Hierarchical Neural Architecture Search via Operator Clustering

1 code implementation26 Sep 2019 Guilin Li, Xing Zhang, Zitong Wang, Matthias Tan, Jiashi Feng, Zhenguo Li, Tong Zhang

Recently, the efficiency of automatic neural architecture design has been significantly improved by gradient-based search methods such as DARTS.

Clustering Neural Architecture Search

Formulating Camera-Adaptive Color Constancy as a Few-shot Meta-Learning Problem

no code implementations28 Nov 2018 Steven McDonagh, Sarah Parisot, Fengwei Zhou, Xing Zhang, Ales Leonardis, Zhenguo Li, Gregory Slabaugh

In this work, we propose a new approach that affords fast adaptation to previously unseen cameras, and robustness to changes in capture device by leveraging annotated samples across different cameras and datasets.

Few-Shot Camera-Adaptive Color Constancy Meta-Learning

Non-local NetVLAD Encoding for Video Classification

no code implementations29 Sep 2018 Yongyi Tang, Xing Zhang, Jingwen Wang, Shaoxiang Chen, Lin Ma, Yu-Gang Jiang

This paper describes our solution for the 2$^\text{nd}$ YouTube-8M video understanding challenge organized by Google AI.

Classification General Classification +4

Social Computing for Mobile Big Data in Wireless Networks

no code implementations30 Sep 2016 Xing Zhang, Zhenglei Yi, Zhi Yan, Geyong Min, Wenbo Wang, Sabita Maharjan, Yan Zhang

Mobile big data contains vast statistical features in various dimensions, including spatial, temporal, and the underlying social domain.


Cannot find the paper you are looking for? You can Submit a new open access paper.