1 code implementation • LREC 2022 • Yuru Jiang, Yang Xu, Yuhang Zhan, WeiKai He, Yilin Wang, Zixuan Xi, Meiyun Wang, Xinyu Li, Yu Li, Yanchao Yu
We describe a new freely available Chinese multi-party dialogue dataset for automatic extraction of dialogue-based character relationships.
no code implementations • 12 Apr 2025 • Yilin Wang, Peixuan Lei, Xuyang Wang, Liangliang Jiang, Liming Xuan, Wei Cheng, Honghua Zhao, Yuanxiang Li
To address these challenges, this paper presents a self-supervised learning-based foundation model that enables the transfer of diagnostic knowledge from mature aircraft (e. g., A320, A330) to newer ones (e. g., C919).
no code implementations • 11 Apr 2025 • Yilin Wang, Chuan Guo, Yuxuan Mu, Muhammad Gohar Javed, Xinxin Zuo, Juwei Lu, Hai Jiang, Li Cheng
In this work, we present MotionDreamer, a localized masked modeling paradigm designed to learn internal motion patterns from a given motion with arbitrary topology and duration.
no code implementations • 18 Mar 2025 • Yilin Wang
Autonomous driving systems require a deep understanding of human driving behaviors to achieve higher intelligence and safety. Despite advancements in deep learning, challenges such as long-tail distribution due to scarce samples and confusion from similar behaviors hinder effective driving behavior detection. Existing methods often fail to address sample confusion adequately, as datasets frequently contain ambiguous samples that obscure unique semantic information.
no code implementations • 12 Jan 2025 • Peng Zheng, Linzhi Huang, Yizhou Yu, Yi Chang, Yilin Wang, Rui Ma
However, the high computational cost of NeRF presents challenges for synthesizing high-resolution (HR) images.
1 code implementation • 7 Jan 2025 • Yuchun Fan, Yongyu Mu, Yilin Wang, Lei Huang, Junhao Ruan, Bei Li, Tong Xiao, ShuJian Huang, Xiaocheng Feng, Jingbo Zhu
Despite the significant improvements achieved by large language models (LLMs) in English reasoning tasks, these models continue to struggle with multilingual reasoning.
1 code implementation • 28 Dec 2024 • Nianli Peng, Yilin Wang
Building on previous works, We reformulate the infinite-agent stochastic control problem as a Markov Decision Process, where each representative agent interacts with the evolving mean field distribution.
no code implementations • 28 Dec 2024 • Gaoang Wang, Hang Wu, Yang Liao, Zhen Chen, Qing Zhou, Wenxing Wang, Yifei Liu, Yilin Wang, Meijing Wu, Ruiqi Xiang, Yuntao Yu, Xi Zhou, Feng Zhu, Zhonghua Liu, Tingjun Hou
Biotoxins, mainly produced by venomous animals, plants and microorganisms, exhibit high physiological activity and unique effects such as lowering blood pressure and analgesia.
no code implementations • 24 Dec 2024 • Wen Wen, Yilin Wang, Neil Birkbeck, Balu Adsumilli
The rise of short-form videos, characterized by diverse content, editing styles, and artifacts, poses substantial challenges for learning-based blind video quality assessment (BVQA) models.
no code implementations • 10 Dec 2024 • Xi Chen, Zhifei Zhang, He Zhang, Yuqian Zhou, Soo Ye Kim, Qing Liu, Yijun Li, Jianming Zhang, Nanxuan Zhao, Yilin Wang, Hui Ding, Zhe Lin, Hengshuang Zhao
We introduce UniReal, a unified framework designed to address various image generation and editing tasks.
no code implementations • 23 Nov 2024 • Hang Hua, Qing Liu, Lingzhi Zhang, Jing Shi, Zhifei Zhang, Yilin Wang, Jianming Zhang, Jiebo Luo
To support this endeavor, we introduce COMPOSITIONCAP, a new dataset for multi-grained region compositional image captioning, which introduces the task of compositional attribute-aware regional image captioning.
no code implementations • 10 Oct 2024 • Yilin Wang, Chuan Guo, Li Cheng, Hai Jiang
This motivates us to consider a novel task of \textit{Region Controllable Hand Grasp Generation (RegionGrasp)}, as follows: given as input a 3D object, together with its specific surface area selected as the intended contact region, to generate a diverse set of plausible hand grasps of the object, where the thumb finger tip touches the object surface on the contact region.
no code implementations • 26 Sep 2024 • Yilin Wang, Yifei Yu, Kong Sun, Peixuan Lei, Yuxuan Zhang, Enrico Zio, Aiguo Xia, Yuanxiang Li
Extensive experiments demonstrate that RmGPT significantly outperforms state-of-the-art algorithms, achieving near-perfect accuracy in diagnosis tasks and exceptionally low errors in prognosis tasks.
no code implementations • 5 Jul 2024 • Yuxuan Mu, Xinxin Zuo, Chuan Guo, Yilin Wang, Juwei Lu, Xiaofeng Wu, Songcen Xu, Peng Dai, Youliang Yan, Li Cheng
We present GSD, a diffusion model approach based on Gaussian Splatting (GS) representation for 3D object reconstruction from a single view.
no code implementations • 13 Jun 2024 • Jianing Yang, Harshine Visvanathan, Yilin Wang, Xinyi Hu, Matthew Gormley
Many tasks within NLP can be framed as sequential decision problems, ranging from sequence tagging to text generation.
no code implementations • 9 Jun 2024 • Yilin Wang, Haiyang Xu, Xiang Zhang, Zeyuan Chen, Zhizhou Sha, ZiRui Wang, Zhuowen Tu
We provide a two-way integration for the widely adopted ControlNet by integrating external condition generation algorithms into a single dense prediction method and incorporating its individually trained image generation processes into a single model.
no code implementations • 8 Jun 2024 • Yilin Wang, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli
We provided a comprehensive analysis of subjective quality scores for Short form SDR and HDR videos, and discuss the reliability of state-of-the-art UGC quality metrics and potential improvements.
no code implementations • 20 Apr 2024 • Xi Wang, Yichen Peng, Heng Fang, Yilin Wang, Haoran Xie, Xi Yang, Chuntao Li
Achieving this requires the effective decoupling of key attributes within the input image data to achieve representations accurately.
no code implementations • 8 Apr 2024 • Jing Gu, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Yilin Wang, Xin Eric Wang
Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.
1 code implementation • 26 Mar 2024 • Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang, Xicheng Lu
Previous methods for KGC re-ranking are mostly built on non-generative language models to obtain the probability of each candidate.
1 code implementation • CVPR 2024 • Haiyang Xu, Yu Lei, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu
We present Bayesian Diffusion Models (BDM), a prediction algorithm that performs effective Bayesian inference by tightly coupling the top-down (prior) information with the bottom-up (data-driven) procedure via joint diffusion processes.
1 code implementation • CVPR 2024 • Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin
In this paper, we propose UniHuman, a unified model that addresses multiple facets of human image editing in real-world settings.
1 code implementation • CVPR 2024 • ZiRui Wang, Zhizhou Sha, Zheng Ding, Yilin Wang, Zhuowen Tu
We present TokenCompose, a Latent Diffusion Model for text-to-image generation that achieves enhanced consistency between user-specified text prompts and model-generated images.
1 code implementation • 14 Nov 2023 • Yilin Wang, Xinyi Hu, Matthew R. Gormley
In this paper, we introduce the entanglement model, aiming to combine character and subword language models.
no code implementations • 25 Oct 2023 • Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu
In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods.
no code implementations • 11 Oct 2023 • Zhengmeng Xu, Yujie Wang, Xiaotong Feng, Yilin Wang, Yanli Li, Hai Lin
We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF).
2 code implementations • 8 May 2023 • Yilin Wang, Nan Cao, Teng Zhang, Xuanhua Shi, Hai Jin
Optimal margin Distribution Machine (ODM) is a newly proposed statistical learning framework rooting in the novel margin theory, which demonstrates better generalization performance than the traditional large margin based counterparts.
no code implementations • CVPR 2023 • Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel
Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map.
no code implementations • 13 Mar 2023 • Junjie Ke, Tianhao Zhang, Yilin Wang, Peyman Milanfar, Feng Yang
No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience.
no code implementations • 20 Oct 2022 • Yilin Wang, Yiheng Feng
Model-based and learning-based methods are two major types of methodologies to model car following behaviors.
1 code implementation • 29 Jun 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.
Ranked #1 on
Video Quality Assessment
on LIVE-ETRI
(using extra training data)
no code implementations • 18 Jun 2022 • Yilin Wang, Farzan Farnia
We support our theoretical results by performing several numerical experiments showing the role of the substitute network's generalization in generating transferable adversarial examples.
no code implementations • 21 May 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of capturing distortions arising from changes in frame rate as part of Video Quality Assessment (VQA).
no code implementations • 8 Apr 2022 • Xiangyu Huang, Caidan Zhao, Yilin Wang, Zhiqiang Wu
Firstly, we design a two-stream encoder to encode the appearance and motion information representations of normal samples and introduce constraints to further enhance the consistency of the feature semantics between appearance and motion information of normal samples so that abnormal samples with low consistency appearance and motion feature representation can be identified.
Ranked #2 on
Anomaly Detection
on CUHK Avenue
no code implementations • 31 Mar 2022 • Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased.
no code implementations • 24 Mar 2022 • Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
A number of studies have been directed towards understanding the perceptual characteristics of professionally generated gaming videos arising in gaming video streaming, online gaming, and cloud gaming.
no code implementations • 15 Mar 2022 • Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel
To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect to a selected \emph{region} in the reference image instead of the entire background.
1 code implementation • CVPR 2022 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille
We propose Lite Vision Transformer (LVT), a novel light-weight transformer network with two enhanced self-attention mechanisms to improve the model performances for mobile deployment.
2 code implementations • 25 Oct 2021 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of obtaining image quality representations in a self-supervised manner.
Ranked #2 on
Video Quality Assessment
on LIVE-ETRI
(using extra training data)
no code implementations • 29 Sep 2021 • Yilin Wang, Nan Cao, Teng Zhang, Hai Jin
Optimal margin Distribution Machine (ODM), a newly proposed statistical learning framework rooting in the novel margin theory, demonstrates better generalization performance than the traditional large margin based counterparts.
no code implementations • 27 Sep 2021 • Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor.
Ranked #2 on
Video Quality Assessment
on LIVE-YT-HFR
1 code implementation • ICCV 2021 • Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang
Image harmonization aims to improve the quality of image compositing by matching the "appearance" (\eg, color tone, brightness and contrast) between foreground and background images.
2 code implementations • ICCV 2021 • Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang
To accommodate this, the input images are usually resized and cropped to a fixed shape, causing image quality degradation.
Ranked #3 on
Image Quality Assessment
on MSU NR VQA Database
no code implementations • CVPR 2021 • Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang
Besides the subjective ratings and content labels of the dataset, we also propose a DNN-based framework to thoroughly analyze importance of content, technical quality, and compression level in perceptual quality.
no code implementations • 5 Jun 2021 • Yilin Wang, Shaozuo Yu, Xiaokang Yang, Wei Shen
In this paper, we propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable, while maintaining their high classification accuracy.
no code implementations • CVPR 2021 • Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta
We first train our model on COCO and evaluate the learned visual representations on various downstream tasks including image classification, object detection, and instance segmentation.
no code implementations • 29 Mar 2021 • Yilin Wang, Jiayi Ye
Video classification and analysis is always a popular and challenging field in computer vision.
no code implementations • 30 Jan 2021 • Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus.
1 code implementation • 26 Jan 2021 • Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
However, these models are either incapable or inefficient for predicting the quality of complex and diverse UGC videos in practical applications.
Ranked #4 on
Video Quality Assessment
on LIVE Livestream
no code implementations • 22 Dec 2020 • Jianzhou Zhao, Yilin Wang, Xiaolong Feng, Shengyuan A. Yang
Our results indicate that the electronic structures of LaFe$_2$As$_2$ and CaFe$_2$As$_2$ are not too different, which further suggest that superconductivity might also be induced in the collapsed phase of LaFe$_2$As$_2$ under similar non-hydrostatic conditions as for CaFe$_2$As$_2$.
Strongly Correlated Electrons Superconductivity
1 code implementation • 13 Dec 2020 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zhe Lin, Alan Yuille
To evaluate segmentation quality near object boundaries, we propose the Meticulosity Quality (MQ) score considering both the mask coverage and boundary precision.
1 code implementation • CVPR 2021 • Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille
We propose Mask Guided (MG) Matting, a robust matting framework that takes a general coarse mask as guidance.
no code implementations • 10 Dec 2020 • Fredrik Viklund, Yilin Wang
Moreover, if either of these two energies is finite they are equal up to a constant factor, and in this case, the foliation leaves are Weil-Petersson quasicircles.
Complex Variables Mathematical Physics Mathematical Physics Probability
no code implementations • 29 Oct 2020 • Yilin Wang, Jiayi Ye
Point cloud 3D object detection has recently received major attention and becomes an active research topic in 3D computer vision community.
1 code implementation • 26 Oct 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos.
Ranked #1 on
Video Quality Assessment
on LIVE-YT-HFR
no code implementations • NeurIPS 2020 • Digvijay Boob, Qi Deng, Guanghui Lan, Yilin Wang
We also establish new convergence complexities to achieve an approximate KKT solution when the objective can be smooth/nonsmooth, deterministic/stochastic and convex/nonconvex with complexity that is on a par with gradient descent for unconstrained optimization problems in respective cases.
1 code implementation • 22 Sep 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions.
1 code implementation • ECCV 2020 • Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns
We present a novel resizing module for neural networks: shape adaptor, a drop-in enhancement built on top of traditional resizing layers, such as pooling, bilinear sampling, and strided convolution.
1 code implementation • 22 Jul 2020 • Pavan C. Madhusudana, Xiangxu Yu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We also conducted a holistic evaluation of existing state-of-the-art Full and No-Reference video quality algorithms, and statistically benchmarked their performance on the new database.
no code implementations • ECCV 2020 • Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang
To our best knowledge, the proposed method is first to enable adversarial learning in autoregressive models for image generation.
no code implementations • CVPR 2020 • Innfarn Yoo, Xiyang Luo, Yilin Wang, Feng Yang, Peyman Milanfar
DitherNet manipulates the input image to reduce color banding artifacts and provides an alternative to traditional dithering.
no code implementations • 19 Jun 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
High frame rate videos are increasingly getting popular in recent years, driven by the strong requirements of the entertainment and streaming industries to provide high quality of experiences to consumers.
Ranked #3 on
Video Quality Assessment
on LIVE-YT-HFR
5 code implementations • 29 May 2020 • Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to the evolution of affordable and reliable consumer capture devices, and the tremendous popularity of social media platforms.
Ranked #13 on
Video Quality Assessment
on LIVE-FB LSVQ
no code implementations • 27 Feb 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos.
1 code implementation • 13 Apr 2019 • Yilin Wang, Sasi Inguva, Balu Adsumilli
However, traditional metrics used in compression and quality assessment, like BD-Rate and PSNR, are designed for pristine originals.
Multimedia Image and Video Processing
2 code implementations • ICCV 2019 • Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang
An assumption widely used in recent neural style transfer methods is that image styles can be described by global statics of deep features like Gram or covariance matrices.
no code implementations • NeurIPS 2018 • Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li
Graph matching has received persistent attention over decades, which can be formulated as a quadratic assignment problem (QAP).
no code implementations • 5 Apr 2017 • Parag S. Chandakkar, Yilin Wang, Baoxin Li
In the framework, the number of lanes, the vehicle's position in those lanes and the presence of other vehicles are considered as parameters.
no code implementations • 21 Jul 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Neil O'Hare, Yi Chang, Baoxin Li
Understanding human actions in wild videos is an important task with a broad range of applications.
no code implementations • CVPR 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li
However, pointwise labels in image classification and tag annotation are inherently related to the pairwise labels.
no code implementations • 24 Mar 2015 • Qiang Zhang, Yilin Wang, Baoxin Li
Recently, the spectrum analysis based visual saliency approach has attracted a lot of interest due to its simplicity and good performance, where the phase information of the image is used to construct the saliency map.
no code implementations • CVPR 2014 • Yilin Wang, Ke Wang, Enrique Dunn, Jan-Michael Frahm
We develop a sequential optimal sampling framework for stereo disparity estimation by adapting the Sequential Probability Ratio Test (SPRT) model.