Search Results for author: Shuai Yang

Found 83 papers, 43 papers with code

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

no code implementations24 Jun 2025 Hao Li, Shuai Yang, Yilun Chen, Yang Tian, Xiaoda Yang, Xinyi Chen, Hanqing Wang, Tai Wang, Feng Zhao, Dahua Lin, Jiangmiao Pang

We propose CronusVLA, a unified framework that extends single-frame VLA models to the multi-frame paradigm through an efficient post-training stage.

Chunking Vision-Language-Action

Video World Models with Long-term Spatial Memory

no code implementations5 Jun 2025 Tong Wu, Shuai Yang, Ryan Po, Yinghao Xu, Ziwei Liu, Dahua Lin, Gordon Wetzstein

Emerging world models autoregressively generate video frames in response to actions, such as camera movements and text prompts, among other control signals.

Training-Free Watermarking for Autoregressive Image Generation

1 code implementation20 May 2025 Yu tong, Zihao Pan, Shuai Yang, Kaiyang Zhou

However, existing generative watermarking methods are mainly designed for diffusion models while watermarking for autoregressive image generation models remains largely underexplored.

Image Generation

Lessons from Deploying Learning-based CSI Localization on a Large-Scale ISAC Platform

no code implementations24 Apr 2025 Tianyu Zhang, Dongheng Zhang, Ruixu Geng, Xuecheng Xie, Shuai Yang, Yan Chen

In recent years, Channel State Information (CSI), recognized for its fine-grained spatial characteristics, has attracted increasing attention in WiFi-based indoor localization.

Indoor Localization Integrated sensing and communication +1

WORLDMEM: Long-term Consistent World Simulation with Memory

no code implementations16 Apr 2025 Zeqi Xiao, Yushi Lan, Yifan Zhou, Wenqi Ouyang, Shuai Yang, Yanhong Zeng, Xingang Pan

World simulation has gained increasing popularity due to its ability to model virtual environments and predict the consequences of actions.

Scene Generation

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

1 code implementation13 Apr 2025 Yongchao Feng, Yajie Liu, Shuai Yang, Wenrui Cai, Jinqing Zhang, Qiqi Zhan, Ziyue Huang, Hongxi Yan, Qiao Wan, ChenGuang Liu, Junzhe Wang, Jiahui Lv, Ziqi Liu, Tengyuan Shi, Qingjie Liu, Yunhong Wang

In this work, we present the systematic review of VLM-based detection and segmentation, view VLM as the foundational model and conduct comprehensive evaluations across multiple downstream tasks for the first time: 1) The evaluation spans eight detection scenarios (closed-set detection, domain adaptation, crowded objects, etc.)

Domain Adaptation Language Modeling +3

HeteRAG: A Heterogeneous Retrieval-augmented Generation Framework with Decoupled Knowledge Representations

no code implementations12 Apr 2025 Peiru Yang, Xintian Li, Zhiyang Hu, Jiapeng Wang, Jinhua Yin, Huili Wang, Lizhi He, Shuai Yang, Shangguang Wang, Yongfeng Huang, Tao Qi

The retrieval step benefits from comprehensive information to improve retrieval accuracy, whereas excessively long chunks may introduce redundant contextual information, thereby diminishing both the effectiveness and efficiency of the generation process.

RAG Retrieval +1

OmniCam: Unified Multimodal Video Generation via Camera Control

no code implementations3 Apr 2025 Xiaoda Yang, Jiayang Xu, Kaixuan Luan, Xinyu Zhan, Hongshun Qiu, Shijun Shi, Hao Li, Shuai Yang, Li Zhang, Checheng Yu, Cewu Lu, Lixin Yang

Camera control, which achieves diverse visual effects by changing camera position and pose, has attracted widespread attention.

Video Generation

A Survey on Remote Sensing Foundation Models: From Vision to Multimodality

1 code implementation28 Mar 2025 Ziyue Huang, Hongxi Yan, Qiqi Zhan, Shuai Yang, Mingming Zhang, Chenkai Zhang, Yiming Lei, Zeming Liu, Qingjie Liu, Yunhong Wang

This paper provides a comprehensive review of the state-of-the-art in vision and multimodal foundation models for remote sensing, focusing on their architecture, training methods, datasets and application scenarios.

Change Detection Land Cover Classification +1

Language-based Image Colorization: A Benchmark and Beyond

1 code implementation19 Mar 2025 YiFan Li, Shuai Yang, Jiaying Liu

In view of the lack of a comprehensive review of language-based colorization literature, we conduct a thorough analysis and benchmarking.

Benchmarking Colorization +2

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

1 code implementation12 Mar 2025 Yifan Zhou, Zeqi Xiao, Shuai Yang, Xingang Pan

Latent Diffusion Models (LDMs) are known to have an unstable generation process, where even small perturbations or shifts in the input noise can lead to significantly different outputs.

Image-to-Image Translation Video Editing

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

1 code implementation CVPR 2025 YuHan Wang, Fangzhou Hong, Shuai Yang, Liming Jiang, Wayne Wu, Chen Change Loy

In this paper, we explore human multiview diffusion models at the megapixel level and introduce a solution called mesh attention to enable training at 1024x1024 resolution.

3D Generation Image to 3D

Balanced Image Stylization with Style Matching Score

no code implementations10 Mar 2025 Yuxin Jiang, Liming Jiang, Shuai Yang, Jia-Wei Liu, Ivor Tsang, Mike Zheng Shou

We present Style Matching Score (SMS), a novel optimization method for image stylization with diffusion models.

Image Stylization

PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model

1 code implementation CVPR 2025 Xiang Gao, Shuai Yang, Jiaying Liu

At the heart of our method is a plug-and-play phase transfer mechanism that dynamically and progressively transplants diffusion features' phase spectrum from the denoising process to reconstruct the reference image into the one to sample the generated illusion image, realizing deep fusion of the reference structural information and the textual semantic information in the diffusion model latent space.

Denoising Image Generation

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

no code implementations8 Mar 2025 Ziyue Huang, Yongchao Feng, Shuai Yang, Ziqi Liu, Qingjie Liu, Yunhong Wang

However, existing OVD methods for remote sensing (RS) images are constrained by small-scale datasets and fail to address the unique challenges of remote sensing interpretation, include oriented object detection and the need for both high precision and real-time performance in diverse scenarios.

Object object-detection +3

Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support

1 code implementation25 Feb 2025 Guoxin Wang, Minyu Gao, Shuai Yang, Ya zhang, Lizhi He, Liang Huang, Hanlin Xiao, Yexuan Zhang, Wanyue Li, Lu Chen, Jintao Fei, Xin Li

Large language models (LLMs), particularly those with reasoning capabilities, have rapidly advanced in recent years, demonstrating significant potential across a wide range of applications.

Decision Making Diagnostic +3

Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space

no code implementations CVPR 2025 Yifan Zhou, Zeqi Xiao, Shuai Yang, Xingang Pan

Latent Diffusion Models (LDMs) are known to have an unstable generation process, where even small perturbations or shifts in the input noise can lead to significantly different outputs.

Image-to-Image Translation Video Editing

Imagine360: Immersive 360 Video Generation from Perspective Anchor

no code implementations4 Dec 2024 Jing Tan, Shuai Yang, Tong Wu, Jingwen He, Yuwei Guo, Ziwei Liu, Dahua Lin

$360^\circ$ videos offer a hyper-immersive experience that allows the viewers to explore a dynamic scene from full 360 degrees.

Denoising Video Denoising +1

Trajectory Attention for Fine-grained Video Motion Control

no code implementations28 Nov 2024 Zeqi Xiao, Wenqi Ouyang, Yifan Zhou, Shuai Yang, Lei Yang, Jianlou Si, Xingang Pan

This paper introduces trajectory attention, a novel approach that performs attention along available pixel trajectories for fine-grained camera motion control.

Inductive Bias Video Editing +1

GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation

no code implementations12 Nov 2024 Yushi Lan, Shangchen Zhou, Zhaoyang Lyu, Fangzhou Hong, Shuai Yang, Bo Dai, Xingang Pan, Chen Change Loy

While 3D content generation has advanced significantly, existing methods still face challenges with input formats, latent space design, and output representations.

3D Generation Disentanglement

Unified Generative and Discriminative Training for Multi-modal Large Language Models

no code implementations1 Nov 2024 Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun

Discriminative training, exemplified by models like CLIP, excels in zero-shot image-text classification and retrieval, yet struggles with complex scenarios requiring fine-grained semantic differentiation.

Dynamic Time Warping Image-text Classification +6

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

no code implementations23 Aug 2024 Shuai Yang, Jing Tan, Mengchen Zhang, Tong Wu, Yixuan Li, Gordon Wetzstein, Ziwei Liu, Dahua Lin

A desired virtual 3D scene should 1) exhibit omnidirectional view consistency, and 2) allow for free exploration in complex scene hierarchies.

Scene Generation

Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

no code implementations22 Jul 2024 Xin Jiang, Kaiqiang Wang, Yinlong Wang, Fengchang Lv, Taiyang Peng, Shuai Yang, Xianteng Wu, Pengye Zhang, Shuo Yuan, Yifan Zeng

In recommendation systems, the relevance and novelty of the final results are selected through a cascade system of Matching -> Ranking -> Strategy.

Recommendation Systems Retrieval

SEED-Story: Multimodal Long Story Generation with Large Language Model

1 code implementation11 Jul 2024 Shuai Yang, Yuying Ge, Yang Li, Yukang Chen, Yixiao Ge, Ying Shan, Yingcong Chen

We further propose multimodal attention sink mechanism to enable the generation of stories with up to 25 sequences (only 10 for training) in a highly efficient autoregressive manner.

Image Generation Language Modeling +4

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

1 code implementation13 Jun 2024 Ruiyuan Lyu, Jingli Lin, Tai Wang, Shuai Yang, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang

With the emergence of LLMs and their integration with other data modalities, multi-modal 3D perception attracts more attention due to its connectivity to the physical world and makes rapid progress.

3D visual grounding Attribute +1

Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond

1 code implementation5 Jun 2024 Jiahang Zhang, Lilang Lin, Shuai Yang, Jiaying Liu

Following the taxonomy of context-based, generative learning, and contrastive learning approaches, we make a thorough review and benchmark of existing works and shed light on the future possible directions.

Action Recognition Action Understanding +4

Video Diffusion Models are Training-free Motion Interpreter and Controller

1 code implementation23 May 2024 Zeqi Xiao, Yifan Zhou, Shuai Yang, Xingang Pan

MOFT provides a distinct set of benefits, including the ability to encode comprehensive motion information with clear interpretability, extraction without the need for training, and generalizability across diverse architectures.

Video Generation

Grounded 3D-LLM with Referent Tokens

1 code implementation16 May 2024 Yilun Chen, Shuai Yang, Haifeng Huang, Tai Wang, Runsen Xu, Ruiyuan Lyu, Dahua Lin, Jiangmiao Pang

To facilitate the use of referent tokens in subsequent language modeling, we provide a large-scale, automatically curated grounded scene-text dataset with over 1 million phrase-to-region correspondences and introduce Contrastive Language-Scene Pre-training (CLASP) to perform phrase-level scene-text alignment using this data.

Dense Captioning Diversity +7

FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

1 code implementation CVPR 2024 Shuai Yang, Yifan Zhou, Ziwei Liu, Chen Change Loy

In this paper, we introduce FRESCO, intra-frame correspondence alongside inter-frame correspondence to establish a more robust spatial-temporal constraint.

Translation valid

Forward Learning of Graph Neural Networks

1 code implementation16 Mar 2024 Namyong Park, Xing Wang, Antoine Simoulin, Shuai Yang, Grey Yang, Ryan Rossi, Puja Trivedi, Nesreen Ahmed

To address these limitations, the forward-forward algorithm (FF) was recently proposed as an alternative to BP in the image classification domain, which trains NNs by performing two forward passes over positive and negative data.

Drug Discovery Graph Learning +3

Causal Multi-Label Feature Selection in Federated Setting

no code implementations11 Mar 2024 Yukun Song, Dayuan Cao, Jiali Miao, Shuai Yang, Kui Yu

Multi-label feature selection serves as an effective mean for dealing with high-dimensional multi-label data.

feature selection

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

1 code implementation4 Mar 2024 Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Shuai Yang, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

Specifically, it is powered by a text-conditioned tri-plane latent diffusion model, which quickly generates coarse 3D samples for fast prototyping.

3D Generation Text to 3D +1

PRIME: Protect Your Videos From Malicious Editing

1 code implementation2 Feb 2024 Guanlin Li, Shuai Yang, Jie Zhang, Tianwei Zhang

With the development of generative models, the quality of generated content keeps increasing.

A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model

1 code implementation17 Jan 2024 Hao Yang, Jianxin Yuan, Shuai Yang, Linhe Xu, Shuo Yuan, Yifan Zeng

2) Prompt model is designed to generate individualized creatives for different user groups, which can further improve the diversity and quality.

Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

1 code implementation17 Jan 2024 Shuai Yang, Hao Yang, Zhuang Zou, Linhe Xu, Shuo Yuan, Yifan Zeng

These methods typically involve the training of calibrators using a validation set and subsequently applying these calibrators to correct the original estimated values during online inference.

Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs

no code implementations CVPR 2024 Lin Song, Yukang Chen, Shuai Yang, Xiaohan Ding, Yixiao Ge, Ying-Cong Chen, Ying Shan

We empirically show that sparse attention not only reduces computational demands but also enhances model performance in both NLP and multi-modal tasks.

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

no code implementations7 Dec 2023 Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xinggang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu

Extensive experiments demonstrate the effectiveness of HyperDreamer in modeling region-aware materials with high-resolution textures and enabling user-friendly editing.

Semantic Segmentation

VideoBooth: Diffusion-based Video Generation with Image Prompts

no code implementations CVPR 2024 Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu

In this paper, we study the task of video generation with image prompts, which provide more accurate and direct content control beyond the text prompts.

Video Generation

Denoising Diffusion Step-aware Models

1 code implementation5 Oct 2023 Shuai Yang, Yukang Chen, Luozhou Wang, Shu Liu, Yingcong Chen

Denoising Diffusion Probabilistic Models (DDPMs) have garnered popularity for data generation across various domains.

Denoising

DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields

1 code implementation8 Sep 2023 Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy

In this paper, we address the challenging problem of 3D toonification, which involves transferring the style of an artistic domain onto a target 3D face with stylized geometry and texture.

Decoder NeRF

Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

1 code implementation ICCV 2023 Yuxin Jiang, Liming Jiang, Shuai Yang, Chen Change Loy

The challenges of this task lie in the complexity of the scenes, the unique features of anime style, and the lack of high-quality datasets to bridge the domain gap.

Image-to-Image Translation

GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation

1 code implementation7 Jun 2023 Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models.

Translation Unsupervised Image-To-Image Translation +1

Graph Exploration Matters: Improving both individual-level and system-level diversity in WeChat Feed Recommender

no code implementations29 May 2023 Shuai Yang, Lixin Zhang, Feng Xia, Leyu Lin

Graph-based retrieval strategies are inevitably hijacked by heavy users and popular items, leading to the convergence of candidates for users and the lack of system-level diversity.

Diversity Recommendation Systems +2

Text2Performer: Text-Driven Human Video Generation

1 code implementation ICCV 2023 Yuming Jiang, Shuai Yang, Tong Liang Koh, Wayne Wu, Chen Change Loy, Ziwei Liu

In this work, we present Text2Performer to generate vivid human videos with articulated motions from texts.

Video Generation

Learning to Rank Normalized Entropy Curves with Differentiable Window Transformation

no code implementations25 Jan 2023 Hanyang Liu, Shuai Yang, Feng Qi, Shuaiwen Wang

We also introduce a novel differentiable indexing method for the proposed adaptive curve transformation, which allows gradients with respect to the discrete indices to flow freely through the curve transformation layer, enabling the learned window sizes to be updated flexibly during training.

Learning-To-Rank Recommendation Systems

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

1 code implementation3 Jan 2023 Shuhao Shi, Kai Qiao, Jian Chen, Shuai Yang, Jie Yang, Baojie Song, Linyuan Wang, Bin Yan

However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research.

Node Classification Stance Detection +1

DeformToon3D: Deformable Neural Radiance Fields for 3D Toonification

no code implementations ICCV 2023 Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy

In this paper, we address the challenging problem of 3D toonification, which involves transferring the style of an artistic domain onto a target 3D face with stylized geometry and texture.

Decoder NeRF

Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion

no code implementations CVPR 2023 Yushi Lan, Xuyi Meng, Shuai Yang, Chen Change Loy, Bo Dai

In this paper, we study the challenging problem of 3D GAN inversion where a latent code is predicted given a single face image to faithfully recover its 3D shapes and detailed textures.

3D Face Reconstruction

VToonify: Controllable High-Resolution Portrait Video Style Transfer

1 code implementation22 Sep 2022 Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

Although a series of successful portrait image toonification models built upon the powerful StyleGAN have been proposed, these image-oriented methods have obvious limitations when applied to videos, such as the fixed frame size, the requirement of face alignment, missing non-facial details and temporal inconsistency.

Face Alignment Style Transfer +2

Text2Human: Text-Driven Controllable Human Image Generation

2 code implementations31 May 2022 Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy, Ziwei Liu

In this work, we present a text-driven controllable framework, Text2Human, for a high-quality and diverse human generation.

Diversity Human Parsing +2

Select and Calibrate the Low-confidence: Dual-Channel Consistency based Graph Convolutional Networks

no code implementations8 May 2022 Shuhao Shi, Jian Chen, Kai Qiao, Shuai Yang, Linyuan Wang, Bin Yan

The Graph Convolutional Networks (GCNs) have achieved excellent results in node classification tasks, but the model's performance at low label rates is still unsatisfactory.

Node Classification

Unsupervised Image-to-Image Translation with Generative Prior

1 code implementation CVPR 2022 Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

In this work, we present a novel framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), to improve the overall quality and applicability of the translation algorithm.

Translation Unsupervised Image-To-Image Translation

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer

1 code implementation CVPR 2022 Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

Recent studies on StyleGAN show high performance on artistic portrait generation by transfer learning with limited data.

Style Transfer Transfer Learning +1

ShapeEditer: a StyleGAN Encoder for Face Swapping

no code implementations26 Jun 2021 Shuai Yang, Kai Qiao

In this paper, we propose a novel encoder, called ShapeEditor, for high-resolution, realistic and high-fidelity face exchange.

Attribute Face Swapping

Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification

no code implementations25 May 2021 S. Shi, Kai Qiao, Shuai Yang, L. Wang, J. Chen, Bin Yan

Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN.

Ensemble Learning Graph Neural Network +2

Towards Efficient Local Causal Structure Learning

no code implementations28 Feb 2021 Shuai Yang, Hao Wang, Kui Yu, Fuyuan Cao, Xindong Wu

Local causal structure learning aims to discover and distinguish direct causes (parents) and direct effects (children) of a variable of interest from data.

Learning causal representations for robust domain adaptation

no code implementations12 Nov 2020 Shuai Yang, Kui Yu, Fuyuan Cao, Lin Liu, Hao Wang, Jiuyong Li

In this paper, we study the cases where at the training phase the target domain data is unavailable and only well-labeled source domain data is available, called robust domain adaptation.

Domain Adaptation

Consistent Video Style Transfer via Relaxation and Regularization

1 code implementation23 Sep 2020 Wenjing Wang, Shuai Yang, Jizheng Xu, Jiaying Liu

In this article, we address the problem by jointly considering the intrinsic properties of stylization and temporal consistency.

Style Transfer Video Style Transfer

BARS-CTR: Open Benchmarking for Click-Through Rate Prediction

6 code implementations12 Sep 2020 Jieming Zhu, Jinyang Liu, Shuai Yang, Qi Zhang, Xiuqiang He

We have publicly released the benchmarking code, evaluation protocols, and hyper-parameter settings of our work to promote reproducible research in this field.

Benchmarking Click-Through Rate Prediction +2

New opportunities at the photon energy frontier

no code implementations8 Sep 2020 Jaroslav Adam, Christine Aidala, Aaron Angerami, Benjamin Audurier, Carlos Bertulani, Christian Bierlich, Boris Blok, James Daniel Brandenburg, Stanley Brodsky, Aleksandr Bylinkin, Veronica Canoa Roman, Francesco Giovanni Celiberto, Jan Cepila, Grigorios Chachamis, Brian Cole, Guillermo Contreras, David d'Enterria, Adrian Dumitru, Arturo Fernández Téllez, Leonid Frankfurt, Maria Beatriz Gay Ducati, Frank Geurts, Gustavo Gil da Silveira, Francesco Giuli, Victor P. Goncalves, Iwona Grabowska-Bold, Vadim Guzey, Lucian Harland-Lang, Martin Hentschinski, Timothy J. Hobbs, Jamal Jalilian-Marian, Valery A. Khoze, Yongsun Kim, Spencer R. Klein, Simon Knapen, Mariola Kłusek-Gawenda, Michal Krelina, Evgeny Kryshen, Tuomas Lappi, Constantin Loizides, Agnieszka Luszczak, Magno Machado, Heikki Mäntysaari, Daniel Martins, Ronan McNulty, Michael Murray, Jan Nemchik, Jacquelyn Noronha-Hostler, Joakim Nystrand, Alessandro Papa, Bernard Pire, Mateusz Ploskon, Marius Przybycien, John P. Ralston, Patricia Rebello Teles, Christophe Royon, Björn Schenke, William Schmidke, Janet Seger, Anna Stasto, Peter Steinberg, Mark Strikman, Antoni Szczurek, Lech Szymanowski, Daniel Tapia Takaki, Ralf Ulrich, Orlando Villalobos Baillie, Ramona Vogt, Samuel Wallon, Michael Winn, Keping Xie, Zhangbu Xu, Shuai Yang, Mikhail Zhalov, Jian Zhou

Ultra-peripheral collisions (UPCs) involving heavy ions and protons are the energy frontier for photon-mediated interactions.

High Energy Physics - Phenomenology High Energy Physics - Experiment Nuclear Experiment

From Design Draft to Real Attire: Unaligned Fashion Image Translation

no code implementations3 Aug 2020 Yu Han, Shuai Yang, Wenjing Wang, Jiaying Liu

Moreover, built upon the sampling network, we present design draft to real fashion item translation network (D2RNet), where two separate translation streams that focus on texture and shape, respectively, are combined tactfully to get both benefits.

Translation

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

no code implementations9 Jan 2020 Yueyu Hu, Shuai Yang, Wenhan Yang, Ling-Yu Duan, Jiaying Liu

In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly.

Facial Landmark Detection Image Reconstruction

Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches

1 code implementation ECCV 2020 Shuai Yang, Zhangyang Wang, Jiaying Liu, Zongming Guo

We present a sketch refinement strategy, as inspired by the coarse-to-fine drawing process of the artists, which we show can help our model well adapt to casual and varied sketches without the need for real sketch training data.

Sparse-Dense Subspace Clustering

no code implementations20 Oct 2019 Shuai Yang, Wenqi Zhu, Yuesheng Zhu

In the first stage, an affinity matrix is generated from data.

Clustering

Residual Encoder-Decoder Network for Deep Subspace Clustering

no code implementations12 Oct 2019 Shuai Yang, Wenqi Zhu, Yuesheng Zhu

Subspace clustering aims to cluster unlabeled data that lies in a union of low-dimensional linear subspaces.

Clustering Decoder

Towards Efficient Local Causal Structure Learning

1 code implementation3 Oct 2019 Shuai Yang, Hao Wang, Kui Yu, Fuyuan Cao, Xindong Wu

To tackle this issue, we propose a novel Efficient Local Causal Structure learning algorithm, named ELCS.

Causal Discovery

Semi-supervised representation learning via dual autoencoders for domain adaptation

1 code implementation4 Aug 2019 Shuai Yang, Hao Wang, Yuhong Zhang, Pei-Pei Li, Yi Zhu, Xuegang Hu

Domain adaptation aims to exploit the knowledge in source domain to promote the learning tasks in target domain, which plays a critical role in real-world applications.

Denoising Representation Learning +1

TE141K: Artistic Text Benchmark for Text Effect Transfer

no code implementations8 May 2019 Shuai Yang, Wenjing Wang, Jiaying Liu

To the best of our knowledge, this is the largest dataset for text effect transfer to date.

Style Transfer Text Effects Transfer

Controllable Artistic Text Style Transfer via Shape-Matching GAN

1 code implementation ICCV 2019 Shuai Yang, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu, Zongming Guo

In this paper, we present the first text style transfer network that allows for real-time control of the crucial stylistic degree of the glyph through an adjustable parameter.

Style Transfer Text Style Transfer

Three-Stage Subspace Clustering Framework with Graph-Based Transformation and Optimization

no code implementations2 May 2019 Shuai Yang, Wenqi Zhu, Yuesheng Zhu

The affinity matrix is obtained in the first stage, then it goes through the second stage, where the proposed GBTO is applied to generate a reconstructed affinity matrix with more authentic similarity between data points.

Clustering

Restricted Connection Orthogonal Matching Pursuit For Sparse Subspace Clustering

no code implementations1 May 2019 Wenqi Zhu, Yuesheng Zhu, Li Zhong, Shuai Yang

In this paper, we propose a noise-robust algorithm, Restricted Connection Orthogonal Matching Pursuit for Sparse Subspace Clustering (RCOMP-SSC), to improve the clustering accuracy and maintain the low computational time by restricting the number of connections of each data point during the iteration of OMP.

Clustering

TET-GAN: Text Effects Transfer via Stylization and Destylization

no code implementations16 Dec 2018 Shuai Yang, Jiaying Liu, Wenjing Wang, Zongming Guo

The key idea is to train our network to accomplish both the objective of style transfer and style removal, so that it can learn to disentangle and recombine the content and style features of text effects images.

One-Shot Learning Style Transfer +1

Context-Aware Text-Based Binary Image Stylization and Synthesis

no code implementations9 Oct 2018 Shuai Yang, Jiaying Liu, Wenhan Yang, Zongming Guo

The stylization is then followed by a context-aware layout design algorithm, where cues for both seamlessness and aesthetics are employed to determine the optimal layout of the shape in the background.

Image Inpainting Image Stylization +2

Awesome Typography: Statistics-Based Text Effects Transfer

1 code implementation CVPR 2017 Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

It allows our algorithm to produce artistic typography that fits for both local texture patterns and the global spatial distribution in the example.

Style Transfer Text Effects Transfer +1

Cannot find the paper you are looking for? You can Submit a new open access paper.