Search Results for author: Yilin Wang

Found 52 papers, 18 papers with code

UniHuman: A Unified Model for Editing Human Images in the Wild

no code implementations22 Dec 2023 Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin

In this paper, we propose UniHuman, a unified model that addresses multiple facets of human image editing in real-world settings.

TokenCompose: Grounding Diffusion with Token-level Supervision

1 code implementation6 Dec 2023 ZiRui Wang, Zhizhou Sha, Zheng Ding, Yilin Wang, Zhuowen Tu

We present TokenCompose, a Latent Diffusion Model for text-to-image generation that achieves enhanced consistency between user-specified text prompts and model-generated images.

Denoising Object +1

Dolfin: Diffusion Layout Transformers without Autoencoder

no code implementations25 Oct 2023 Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu

In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods.

Scalable Optimal Margin Distribution Machine

2 code implementations8 May 2023 Yilin Wang, Nan Cao, Teng Zhang, Xuanhua Shi, Hai Jin

Optimal margin Distribution Machine (ODM) is a newly proposed statistical learning framework rooting in the novel margin theory, which demonstrates better generalization performance than the traditional large margin based counterparts.

LightPainter: Interactive Portrait Relighting with Freehand Scribble

no code implementations CVPR 2023 Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel

Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map.

MRET: Multi-resolution Transformer for Video Quality Assessment

no code implementations13 Mar 2023 Junjie Ke, Tianhao Zhang, Yilin Wang, Peyman Milanfar, Feng Yang

No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience.

Video Quality Assessment Video Recognition +1

IDM-Follower: A Model-Informed Deep Learning Method for Long-Sequence Car-Following Trajectory Prediction

no code implementations20 Oct 2022 Yilin Wang, Yiheng Feng

Model-based and learning-based methods are two major types of methodologies to model car following behaviors.

Trajectory Prediction

CONVIQT: Contrastive Video Quality Estimator

1 code implementation29 Jun 2022 Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.

Self-Supervised Learning Video Quality Assessment +1

On the Role of Generalization in Transferability of Adversarial Examples

no code implementations18 Jun 2022 Yilin Wang, Farzan Farnia

We support our theoretical results by performing several numerical experiments showing the role of the substitute network's generalization in generating transferable adversarial examples.

Generalization Bounds

A Video Anomaly Detection Framework based on Appearance-Motion Semantics Representation Consistency

no code implementations8 Apr 2022 Xiangyu Huang, Caidan Zhao, Yilin Wang, Zhiqiang Wu

Firstly, we design a two-stream encoder to encode the appearance and motion information representations of normal samples and introduce constraints to further enhance the consistency of the feature semantics between appearance and motion information of normal samples so that abnormal samples with low consistency appearance and motion feature representation can be identified.

Anomaly Detection Optical Flow Estimation +1

Perceptual Quality Assessment of UGC Gaming Videos

no code implementations31 Mar 2022 Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased.

Video Quality Assessment Visual Question Answering (VQA)

Subjective and Objective Analysis of Streamed Gaming Videos

no code implementations24 Mar 2022 Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

A number of studies have been directed towards understanding the perceptual characteristics of professionally generated gaming videos arising in gaming video streaming, online gaming, and cloud gaming.

Video Quality Assessment Visual Question Answering (VQA)

Interactive Portrait Harmonization

no code implementations15 Mar 2022 Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel

To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect to a selected \emph{region} in the reference image instead of the entire background.

Image Harmonization

Lite Vision Transformer with Enhanced Self-Attention

1 code implementation CVPR 2022 Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille

We propose Lite Vision Transformer (LVT), a novel light-weight transformer network with two enhanced self-attention mechanisms to improve the model performances for mobile deployment.

Panoptic Segmentation Segmentation

Distributed Optimal Margin Distribution Machine

no code implementations29 Sep 2021 Yilin Wang, Nan Cao, Teng Zhang, Hai Jin

Optimal margin Distribution Machine (ODM), a newly proposed statistical learning framework rooting in the novel margin theory, demonstrates better generalization performance than the traditional large margin based counterparts.

High Frame Rate Video Quality Assessment using VMAF and Entropic Differences

no code implementations27 Sep 2021 Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor.

Video Quality Assessment Visual Question Answering (VQA) +1

SSH: A Self-Supervised Framework for Image Harmonization

1 code implementation ICCV 2021 Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

Image harmonization aims to improve the quality of image compositing by matching the "appearance" (\eg, color tone, brightness and contrast) between foreground and background images.

Benchmarking Data Augmentation +1

MUSIQ: Multi-scale Image Quality Transformer

2 code implementations ICCV 2021 Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang

To accommodate this, the input images are usually resized and cropped to a fixed shape, causing image quality degradation.

Image Quality Assessment

Rich Features for Perceptual Quality Assessment of UGC Videos

no code implementations CVPR 2021 Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang

Besides the subjective ratings and content labels of the dataset, we also propose a DNN-based framework to thoroughly analyze importance of content, technical quality, and compression level in perceptual quality.

Video Quality Assessment

Making CNNs Interpretable by Building Dynamic Sequential Decision Forests with Top-down Hierarchy Learning

no code implementations5 Jun 2021 Yilin Wang, Shaozuo Yu, Xiaokang Yang, Wei Shen

In this paper, we propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable, while maintaining their high classification accuracy.

Classification

Multimodal Contrastive Training for Visual Representation Learning

no code implementations CVPR 2021 Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta

We first train our model on COCO and evaluate the learned visual representations on various downstream tasks including image classification, object detection, and instance segmentation.

Cross-Modal Retrieval Image Classification +6

Classifying Video based on Automatic Content Detection Overview

no code implementations29 Mar 2021 Yilin Wang, Jiayi Ye

Video classification and analysis is always a popular and challenging field in computer vision.

Classification General Classification +3

Regression or Classification? New Methods to Evaluate No-Reference Picture and Video Quality Models

no code implementations30 Jan 2021 Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus.

General Classification Image Quality Assessment +2

RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content

1 code implementation26 Jan 2021 Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

However, these models are either incapable or inefficient for predicting the quality of complex and diverse UGC videos in practical applications.

Video Quality Assessment

Electronic Correlations and Absence of Superconductivity in the Collapsed Phase of LaFe$_2$As$_2$

no code implementations22 Dec 2020 Jianzhou Zhao, Yilin Wang, Xiaolong Feng, Shengyuan A. Yang

Our results indicate that the electronic structures of LaFe$_2$As$_2$ and CaFe$_2$As$_2$ are not too different, which further suggest that superconductivity might also be induced in the collapsed phase of LaFe$_2$As$_2$ under similar non-hydrostatic conditions as for CaFe$_2$As$_2$.

Strongly Correlated Electrons Superconductivity

Meticulous Object Segmentation

1 code implementation13 Dec 2020 Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zhe Lin, Alan Yuille

To evaluate segmentation quality near object boundaries, we propose the Meticulosity Quality (MQ) score considering both the mask coverage and boundary precision.

Image Segmentation Object +2

Mask Guided Matting via Progressive Refinement Network

1 code implementation CVPR 2021 Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille

We propose Mask Guided (MG) Matting, a robust matting framework that takes a general coarse mask as guidance.

Image Matting

The Loewner-Kufarev Energy and Foliations by Weil-Petersson Quasicircles

no code implementations10 Dec 2020 Fredrik Viklund, Yilin Wang

Moreover, if either of these two energies is finite they are equal up to a constant factor, and in this case, the foliation leaves are Weil-Petersson quasicircles.

Complex Variables Mathematical Physics Mathematical Physics Probability

An Overview Of 3D Object Detection

no code implementations29 Oct 2020 Yilin Wang, Jiayi Ye

Point cloud 3D object detection has recently received major attention and becomes an active research topic in 3D computer vision community.

3D Object Detection Object +2

ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction

1 code implementation26 Oct 2020 Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos.

Video Quality Assessment Visual Question Answering (VQA)

A Feasible Level Proximal Point Method for Nonconvex Sparse Constrained Optimization

no code implementations NeurIPS 2020 Digvijay Boob, Qi Deng, Guanghui Lan, Yilin Wang

We also establish new convergence complexities to achieve an approximate KKT solution when the objective can be smooth/nonsmooth, deterministic/stochastic and convex/nonconvex with complexity that is on a par with gradient descent for unconstrained optimization problems in respective cases.

Adaptive Debanding Filter

1 code implementation22 Sep 2020 Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions.

Quantization

Shape Adaptor: A Learnable Resizing Module

1 code implementation ECCV 2020 Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns

We present a novel resizing module for neural networks: shape adaptor, a drop-in enhancement built on top of traditional resizing layers, such as pooling, bilinear sampling, and strided convolution.

Image Classification Neural Architecture Search +1

Subjective and Objective Quality Assessment of High Frame Rate Videos

1 code implementation22 Jul 2020 Pavan C. Madhusudana, Xiangxu Yu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We also conducted a holistic evaluation of existing state-of-the-art Full and No-Reference video quality algorithms, and statistically benchmarked their performance on the new database.

Vocal Bursts Intensity Prediction

Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation

no code implementations ECCV 2020 Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang

To our best knowledge, the proposed method is first to enable adversarial learning in autoregressive models for image generation.

Image Generation

GIFnets: Differentiable GIF Encoding Framework

no code implementations CVPR 2020 Innfarn Yoo, Xiyang Luo, Yilin Wang, Feng Yang, Peyman Milanfar

DitherNet manipulates the input image to reduce color banding artifacts and provides an alternative to traditional dithering.

Capturing Video Frame Rate Variations via Entropic Differencing

no code implementations19 Jun 2020 Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

High frame rate videos are increasingly getting popular in recent years, driven by the strong requirements of the entertainment and streaming industries to provide high quality of experiences to consumers.

Video Quality Assessment

UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content

5 code implementations29 May 2020 Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to the evolution of affordable and reliable consumer capture devices, and the tremendous popularity of social media platforms.

Benchmarking feature selection +2

BBAND Index: A No-Reference Banding Artifact Predictor

no code implementations27 Feb 2020 Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos.

Video Compression

YouTube UGC Dataset for Video Compression Research

1 code implementation13 Apr 2019 Yilin Wang, Sasi Inguva, Balu Adsumilli

However, traditional metrics used in compression and quality assessment, like BD-Rate and PSNR, are designed for pristine originals.

Multimedia Image and Video Processing

Multimodal Style Transfer via Graph Cuts

2 code implementations ICCV 2019 Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang

An assumption widely used in recent neural style transfer methods is that image styles can be described by global statics of deep features like Gram or covariance matrices.

Style Transfer

Generalizing Graph Matching beyond Quadratic Assignment Model

no code implementations NeurIPS 2018 Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li

Graph matching has received persistent attention over decades, which can be formulated as a quadratic assignment problem (QAP).

Graph Matching

Improving Vision-based Self-positioning in Intelligent Transportation Systems via Integrated Lane and Vehicle Detection

no code implementations5 Apr 2017 Parag S. Chandakkar, Yilin Wang, Baoxin Li

In the framework, the number of lanes, the vehicle's position in those lanes and the presence of other vehicles are considered as parameters.

Computational Efficiency Density Estimation +1

PPP: Joint Pointwise and Pairwise Image Label Prediction

no code implementations CVPR 2016 Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li

However, pointwise labels in image classification and tag annotation are inherently related to the pairwise labels.

Attribute General Classification +2

Unsupervised Video Analysis Based on a Spatiotemporal Saliency Detector

no code implementations24 Mar 2015 Qiang Zhang, Yilin Wang, Baoxin Li

Recently, the spectrum analysis based visual saliency approach has attracted a lot of interest due to its simplicity and good performance, where the phase information of the image is used to construct the saliency map.

Anomaly Detection Foreground Segmentation +5

Cannot find the paper you are looking for? You can Submit a new open access paper.