no code implementations • 5 Dec 2024 • Yuan Xue, Qi Zhang, Chuanmin Jia, Shiqi Wang
By jointly optimizing compression and LL tasks, the proposed LL-ICM not only enriches its encoding ability in generalizing to versatile LL tasks but also optimizes the processing ability of down-stream LL task models, achieving mutual adaptation for image codecs and LL task models.
no code implementations • 23 Nov 2024 • Yingwen Zhang, Meng Wang, Xihua Sheng, Peilin Chen, Junru Li, Li Zhang, Shiqi Wang
Lossy image compression networks aim to minimize the latent entropy of images while adhering to specific distortion constraints.
no code implementations • 21 Nov 2024 • Peilin Chen, Xiaohan Fang, Meng Wang, Shiqi Wang, Siwei Ma
The Human Visual System (HVS), with its intricate sophistication, is capable of achieving ultra-compact information compression for visual signals.
no code implementations • 19 Nov 2024 • Siyi Pan, Baoliang Chen, Danni Huang, Hanwei Zhu, Lingyu Zhu, Xiangjie Sui, Shiqi Wang
By applying these specific distortions to the query or test images, we ensure that the degraded images are recognized as poor quality while their semantics remain.
no code implementations • 19 Nov 2024 • Kecheng Chen, Pingping Zhang, Hui Liu, Jie Liu, Yibing Liu, Jiaxin Huang, Shiqi Wang, Hong Yan, Haoliang Li
We have recently witnessed that ``Intelligence" and `` Compression" are the two sides of the same coin, where the language large model (LLM) with unprecedented intelligence is a general-purpose lossless compressor for various data modalities.
no code implementations • 19 Oct 2024 • Bolin Chen, Yan Ye, Jie Chen, Ru-Ling Liao, Shanzhi Yin, Shiqi Wang, Kaifa Yang, Yue Li, Yiling Xu, Ye-kui Wang, Shiv Gehlot, Guan-Ming Su, Peng Yin, Sean McCarthy, Gary J. Sullivan
This paper proposes a Generative Face Video Compression (GFVC) approach using Supplemental Enhancement Information (SEI), where a series of compact spatial and temporal representations of a face video signal (i. e., 2D/3D key-points, facial semantics and compact features) can be coded using SEI message and inserted into the coded video bitstream.
no code implementations • 16 Oct 2024 • Kecheng Chen, Pingping Zhang, Tiexin Qin, Shiqi Wang, Hong Yan, Haoliang Li
Current test- or compression-time adaptation image compression (TTA-IC) approaches, which leverage both latent and decoder refinements as a two-step adaptation scheme, have potentially enhanced the rate-distortion (R-D) performance of learned image compression models on cross-domain compression tasks, \textit{e. g.,} from natural to screen content images.
no code implementations • 14 Oct 2024 • Shanzhi Yin, Bolin Chen, Shiqi Wang, Yan Ye
In this paper, we propose a novel Multi-granularity Temporal Trajectory Factorization framework for generative human video compression, which holds great potential for bandwidth-constrained human-centric video communication.
1 code implementation • 13 Oct 2024 • Shanzhi Yin, Zihan Zhang, Bolin Chen, Shiqi Wang, Yan Ye
This paper proposes to learn generative priors from the motion patterns instead of video contents for generative video compression.
1 code implementation • 11 Oct 2024 • Bolin Chen, Shanzhi Yin, Zihan Zhang, Jie Chen, Ru-Ling Liao, Lingyu Zhu, Shiqi Wang, Yan Ye
Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities.
no code implementations • 10 Oct 2024 • Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo Zhang, Yanghui Rao
Considering these features are often mixed up with unknown biases, we then disentangle three kinds of latent factors from them, including the invariant factor that indicates intrinsic bait intention; the causal factor which reflects deceptive patterns in a certain scenario, and non-causal noise.
1 code implementation • 10 Oct 2024 • Qi Wang, Jindong Li, Shiqi Wang, Qianli Xing, Runliang Niu, He Kong, Rui Li, Guodong Long, Yi Chang, Chengqi Zhang
Large language models (LLMs) have not only revolutionized the field of natural language processing (NLP) but also have the potential to bring a paradigm shift in many other fields due to their remarkable abilities of language understanding, as well as impressive generalization capabilities and reasoning skills.
no code implementations • 4 Oct 2024 • Yifeng Ding, Hantian Ding, Shiqi Wang, Qing Sun, Varun Kumar, Zijian Wang
Moreover, model performance on FIM tasks deteriorates significantly without these unrealistic assumptions.
1 code implementation • 6 Sep 2024 • Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang
Recent single image-based enhancement methods may not be able to provide consistently desirable restoration performance for all views due to the ignorance of potential feature correspondence among different views.
no code implementations • 26 Aug 2024 • Keyang Zhang, Chenqi Kong, Shiqi Wang, Anderson Rocha, Haoliang Li
Recent advances in AI-powered image editing tools have significantly lowered the barrier to image modification, raising pressing security concerns those related to spreading misinformation and disinformation on social platforms.
1 code implementation • 22 Aug 2024 • Lingyu Zhu, Wenhan Yang, Baoliang Chen, Hanwei Zhu, Zhangkai Ni, Qi Mao, Shiqi Wang
To address the above challenge, we propose the Unrolled Decomposed Unpaired Network (UDU-Net) for enhancing low-light videos by unrolling the optimization functions into a deep network to decompose the signal into spatial and temporal-related factors, which are updated iteratively.
no code implementations • 18 Aug 2024 • Shiqi Wang, Zhengze Zhang, Rui Zhao, Fei Tan, Cam Tu Nguyen
Experiments with 7B LLMs on the HH and TL;DR datasets substantiate the effectiveness of our method in both automatic metrics and human evaluation, thereby highlighting its potential for aligning LLMs with human intent and values
no code implementations • 16 Aug 2024 • Xihua Sheng, Li Li, Dong Liu, Shiqi Wang
In this paper, we introduce a bi-directional deep contextual video compression scheme tailored for B-frames, termed DCVC-B, to improve the compression performance of deep B-frame coding.
no code implementations • 15 Aug 2024 • Pingping Zhang, Jinlong Li, Meng Wang, Nicu Sebe, Sam Kwong, Shiqi Wang
Existing codecs are designed to eliminate intrinsic redundancies to create a compact representation for compression.
no code implementations • 24 Jul 2024 • Binzhe Li, Shurun Wang, Shiqi Wang, Yan Ye
In this paper, we pioneer to propose a variable bitrate image compression framework consisting of a pre-editing module and an end-to-end codec to achieve promising rate-accuracy performance for different LVLMs.
1 code implementation • 12 Jun 2024 • Juncheng Wu, Zhangkai Ni, Hanli Wang, Wenhan Yang, Yuyin Zhou, Shiqi Wang
In this paper, we present Deep Degradation Response (DDR), a method to quantify changes in image deep features under varying degradation conditions.
1 code implementation • 29 May 2024 • Zhangkai Ni, Yue Liu, Keyan Ding, Wenhan Yang, Hanli Wang, Shiqi Wang
To bridge these gaps, we propose integrating deep features from pre-trained visual models with a statistical analysis model into a Multi-scale Deep Feature Statistics (MDFS) model for achieving opinion-unaware BIQA (OU-BIQA), thereby eliminating the reliance on human rating data and significantly improving training efficiency.
1 code implementation • 29 May 2024 • Hanwei Zhu, HaoNing Wu, Yixuan Li, ZiCheng Zhang, Baoliang Chen, Lingyu Zhu, Yuming Fang, Guangtao Zhai, Weisi Lin, Shiqi Wang
Extensive experiments on nine IQA datasets validate that the Compare2Score effectively bridges text-defined comparative levels during training with converted single image quality score for inference, surpassing state-of-the-art IQA models across diverse scenarios.
no code implementations • 28 May 2024 • Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras
We thus propose an automated pipeline to collect a high-quality dataset for code explanation and refinement by generating a number of explanations and refinement trajectories and filtering via execution verification.
no code implementations • 7 May 2024 • Peisong He, Leyao Zhu, Jiaxing Li, Shiqi Wang, Haoliang Li
The generative model has made significant advancements in the creation of realistic videos, which causes security issues.
no code implementations • 2 May 2024 • Chris Xing Tian, Yibing Liu, Haoliang Li, Ray C. C. Cheung, Shiqi Wang
However, FL also faces challenges such as high computational and communication costs regarding resource-constrained devices, and poor generalization performance due to the heterogeneity of data across edge clients and the presence of out-of-distribution data.
1 code implementation • 20 Apr 2024 • Yixuan Li, Xuelin Liu, Xiaoyang Wang, Bu Sung Lee, Shiqi Wang, Anderson Rocha, Weisi Lin
Meanwhile, large multimodal models (LMMs) have exhibited immense visual-text capabilities on various tasks, bringing the potential for explainable fake image detection.
no code implementations • 15 Apr 2024 • Xiangrui Liu, Xinju Wu, Pingping Zhang, Shiqi Wang, Zhu Li, Sam Kwong
Gaussian splatting, renowned for its exceptional rendering quality and efficiency, has emerged as a prominent technique in 3D scene representation.
1 code implementation • 12 Apr 2024 • Chenqi Kong, Anwei Luo, Peijun Bao, Yi Yu, Haoliang Li, Zengwei Zheng, Shiqi Wang, Alex C. Kot
Deepfakes have recently raised significant trust issues and security concerns among the public.
no code implementations • 11 Apr 2024 • Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras
Code generation models are not robust to small perturbations, which often lead to incorrect generations and significantly degrade the performance of these models.
1 code implementation • CVPR 2024 • Xiaoyun Zheng, Liwei Liao, Xufeng Li, Jianbo Jiao, Rongjie Wang, Feng Gao, Shiqi Wang, Ronggang Wang
To facilitate the development of these fields, in this paper, we present PKU-DyMVHumans, a versatile human-centric dataset for high-fidelity reconstruction and rendering of dynamic human scenarios from dense multi-view videos.
no code implementations • 13 Mar 2024 • Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, Yuchen Tian, Zijian Wang, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Rob Kwiatowski, Ramesh Nallapati, Bing Xiang
Generative models, widely utilized in various applications, can often struggle with prompts corresponding to partial tokens.
no code implementations • 26 Feb 2024 • HaoNing Wu, Hanwei Zhu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Annan Wang, Wenxiu Sun, Qiong Yan, Xiaohong Liu, Guangtao Zhai, Shiqi Wang, Weisi Lin
Comparative settings (e. g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses.
no code implementations • 26 Feb 2024 • Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gao
Representing the Neural Radiance Field (NeRF) with the explicit voxel grid (EVG) is a promising direction for improving NeRFs.
1 code implementation • 9 Feb 2024 • Runliang Niu, Jindong Li, Shiqi Wang, Yali Fu, Xiyu Hu, Xueyuan Leng, He Kong, Yi Chang, Qi Wang
Additionally, we construct the ScreenAgent Dataset, which collects screenshots and action sequences when completing a variety of daily computer tasks.
no code implementations • 3 Feb 2024 • Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Ye
Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the acquisition of digital content and impelling the progress of visual compression towards competitive performance gains and diverse functionalities over traditional codecs.
1 code implementation • 2 Feb 2024 • Hanwei Zhu, Xiangjie Sui, Baoliang Chen, Xuelin Liu, Peilin Chen, Yuming Fang, Shiqi Wang
While abundant research has been conducted on improving high-level visual understanding and reasoning capabilities of large multimodal models~(LMMs), their visual quality assessment~(IQA) ability has been relatively under-explored.
no code implementations • 31 Jan 2024 • Gabriel Ryan, Siddhartha Jain, Mingyue Shang, Shiqi Wang, Xiaofei Ma, Murali Krishna Ramanathan, Baishakhi Ray
Recent works using large language models (LLMs) for test generation have focused on improving generation quality through optimizing the test generation context and correcting errors in model outputs, but use fixed prompting strategies that prompt the model to generate tests without additional guidance.
no code implementations • 16 Jan 2024 • Yixuan Li, Peilin Chen, Hanwei Zhu, Keyan Ding, Leida Li, Shiqi Wang
The perceptual quality is quantified by the variant Mahalanobis Distance between the inner and outer Shape-Texture Statistics (DSTS), wherein the inner and outer statistics respectively describe the quality fingerprints of the distorted image and natural images.
1 code implementation • CVPR 2024 • Fu-Zhao Ou, Chongyi Li, Shiqi Wang, Sam Kwong
Furthermore to alleviate the issue of the model placing excessive trust in inaccurate quality anchors we propose a confidence calibration method to correct the quality distribution by exploiting to the fullest extent of these objective quality factors characterized as the merged-factor distribution during training.
1 code implementation • 30 Dec 2023 • Shiqi Wang, Yeqin Zhang, Cam-Tu Nguyen
Hard negative sampling, which is commonly used to improve contrastive learning, can introduce more noise in training.
no code implementations • 25 Dec 2023 • Qi Mao, Chongyu Wang, Meng Wang, Shiqi Wang, Ruijie Chen, Libiao Jin, Siwei Ma
The accelerated proliferation of visual content and the rapid development of machine vision technologies bring significant challenges in delivering visual data on a gigantic scale, which shall be effectively represented to satisfy both human and machine requirements.
1 code implementation • 5 Nov 2023 • Bolin Chen, Jie Chen, Shiqi Wang, Yan Ye
Generative Face Video Coding (GFVC) techniques can exploit the compact representation of facial priors and the strong inference capability of deep generative models, achieving high-quality face video communication in ultra-low bandwidth scenarios.
no code implementations • 30 Sep 2023 • Chenqi Kong, Anwei Luo, Shiqi Wang, Haoliang Li, Anderson Rocha, Alex C. Kot
Digital image forensics plays a crucial role in image authentication and manipulation localization.
no code implementations • 24 Sep 2023 • Binzhe Li, Bolin Chen, Zhao Wang, Shiqi Wang, Yan Ye
In this letter, we envision a new metaverse communication paradigm for virtual avatar faces, and develop the semantic face compression with compact 3D facial descriptors.
1 code implementation • 7 Sep 2023 • Xiangjie Sui, Hanwei Zhu, Xuelin Liu, Yuming Fang, Shiqi Wang, Zhou Wang
To address these issues, we introduce a unique generative scanpath representation (GSR) for effective quality inference of 360$^\circ$ images, which aggregates varied perceptual experiences of multi-hypothesis users under a predefined viewing condition.
no code implementations • 17 Jul 2023 • Qi Mao, Tinghan Yang, Yinuo Zhang, Zijian Wang, Meng Wang, Shiqi Wang, Siwei Ma
Remarkably, even with the loss of up to $20\%$ of indices, the images can be effectively restored with minimal perceptual loss.
2 code implementations • 10 Jul 2023 • Chongming Gao, Kexin Huang, Jiawei Chen, Yuan Zhang, Biao Li, Peng Jiang, Shiqi Wang, Zhong Zhang, Xiangnan He
Through theoretical analyses, we find that the conservatism of existing methods fails in pursuing users' long-term satisfaction.
1 code implementation • IEEE Transactions on Multimedia 2023 • Dongjie Ye, Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Sam Kwong
Benefiting from the learned memory, more complex distributions of reference images in the entire dataset can be “remembered” to facilitate the adjustment of the testing samples more adaptively.
Ranked #2 on Low-Light Image Enhancement on LOL-v2
2 code implementations • 3 Jul 2023 • Jinhao Duan, Hao Cheng, Shiqi Wang, Alex Zavalny, Chenan Wang, Renjing Xu, Bhavya Kailkhura, Kaidi Xu
Large Language Models (LLMs) show promising results in language generation and instruction following but frequently "hallucinate", making their outputs less reliable.
no code implementations • 9 Jun 2023 • João Phillipe Cardenuto, Jing Yang, Rafael Padilha, Renjie Wan, Daniel Moreira, Haoliang Li, Shiqi Wang, Fernanda Andaló, Sébastien Marcel, Anderson Rocha
Synthetic realities are digital creations or augmentations that are contextually generated through the use of Artificial Intelligence (AI) methods, leveraging extensive amounts of data to construct new narratives or realities, regardless of the intent to deceive.
1 code implementation • 5 Jun 2023 • Yibing Liu, Chris Xing Tian, Haoliang Li, Lei Ma, Shiqi Wang
The out-of-distribution (OOD) problem generally arises when neural networks encounter data that significantly deviates from the training data distribution, i. e., in-distribution (InD).
no code implementations • 2 May 2023 • Xinju Wu, Pingping Zhang, Meng Wang, Peilin Chen, Shiqi Wang, Sam Kwong
The emergence of digital avatars has raised an exponential increase in the demand for human point clouds with realistic and intricate details.
1 code implementation • CVPR 2023 • Jiechong Song, Chong Mou, Shiqi Wang, Siwei Ma, Jian Zhang
And, PGCA block achieves an enhanced information interaction, which introduces the inertia force into the gradient descent step through a cross attention block.
1 code implementation • 14 Apr 2023 • Yixuan Li, Bolin Chen, Baoliang Chen, Meng Wang, Shiqi Wang, Weisi Lin
In this paper, we introduce the large-scale Compressed Face Video Quality Assessment (CFVQA) database, which is the first attempt to systematically understand the perceptual quality and diversified compression distortions in face videos.
no code implementations • 9 Mar 2023 • Xiaokai Wei, Sujan Gonugondla, Wasi Ahmad, Shiqi Wang, Baishakhi Ray, Haifeng Qian, Xiaopeng Li, Varun Kumar, Zijian Wang, Yuchen Tian, Qing Sun, Ben Athiwaratkun, Mingyue Shang, Murali Krishna Ramanathan, Parminder Bhatia, Bing Xiang
Such large models incur significant resource usage (in terms of memory, latency, and dollars) as well as carbon footprint.
no code implementations • 2 Mar 2023 • Chenqi Kong, Haoliang Li, Shiqi Wang
Nowadays, forgery faces pose pressing security concerns over fake news, fraud, impersonation, etc.
1 code implementation • 22 Feb 2023 • Baoliang Chen, Lingyu Zhu, Hanwei Zhu, Wenhan Yang, Linqi Song, Shiqi Wang
Subsequently, we propose an objective quality assessment measure that plays a critical role in bridging the gap between visual quality and enhancement.
1 code implementation • 20 Feb 2023 • Bolin Chen, Zhao Wang, Binzhe Li, Shurun Wang, Shiqi Wang, Yan Ye
In this paper, we propose a novel framework for Interactive Face Video Coding (IFVC), which allows humans to interact with the intrinsic visual representations instead of the signals.
2 code implementations • 2 Feb 2023 • Jinhao Duan, Fei Kong, Shiqi Wang, Xiaoshuang Shi, Kaidi Xu
In this paper, we investigate the vulnerability of diffusion models to Membership Inference Attacks (MIAs), a common privacy concern.
1 code implementation • 30 Jan 2023 • Chenqi Kong, Kexin Zheng, Yibing Liu, Shiqi Wang, Anderson Rocha, Haoliang Li
Face presentation attacks (FPA), also known as face spoofing, have brought increasing concerns to the public through various malicious applications, such as financial fraud and privacy leakage.
1 code implementation • ICCV 2023 • Fu-Zhao Ou, Baoliang Chen, Chongyi Li, Shiqi Wang, Sam Kwong
Furthermore, we design an easy-to-hard training scheduler based on the inter-domain uncertainty and intra-domain quality margin as well as the ranking-based domain adversarial network to enhance the effectiveness of transfer learning and further reduce the source risk in domain adaptation.
1 code implementation • CVPR 2023 • Xiangjie Sui, Yuming Fang, Hanwei Zhu, Shiqi Wang, Zhou Wang
Scanpath prediction for 360deg images aims to produce dynamic gaze behaviors based on the human visual perception mechanism.
2 code implementations • 20 Dec 2022 • Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang
Most existing works on robustness in text or code tasks have focused on classification, while robustness in generation tasks is an uncharted area and to date there is no comprehensive benchmark for robustness in code generation.
1 code implementation • 30 Nov 2022 • Jiaxing Li, Chenqi Kong, Shiqi Wang, Haoliang Li
The image recapture attack is an effective image manipulation method to erase certain forensic traces, and when targeting on personal document images, it poses a great threat to the security of e-commerce and other web applications.
no code implementations • 13 Nov 2022 • Yibing Liu, Chris Xing Tian, Haoliang Li, Shiqi Wang
Specifically, by treating feature elements as neuron activation states, we show that conventional alignment methods tend to deteriorate the diversity of learned invariant features, as they indiscriminately minimize all neuron activation differences.
1 code implementation • 9 Nov 2022 • Hanwei Zhu, Baoliang Chen, Lingyu Zhu, Shiqi Wang, Weisi Lin
ImageNet pre-trained deep neural networks (DNNs) show notable transferability for building effective image quality assessment (IQA) models.
2 code implementations • 26 Oct 2022 • Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang
Using these benchmarks, we are able to assess the performance of code generation models in a multi-lingual fashion, and discovered generalization ability of language models on out-of-domain languages, advantages of multi-lingual models over mono-lingual, the ability of few-shot prompting to teach the model new languages, and zero-shot translation abilities even on mono-lingual settings.
no code implementations • 29 Sep 2022 • Chenqi Kong, Shiqi Wang, Haoliang Li
With the rapid progress over the past five years, face authentication has become the most pervasive biometric recognition method.
1 code implementation • 21 Sep 2022 • Zhaopeng Feng, Keyang Zhang, Shuyue Jia, Baoliang Chen, Shiqi Wang
Deep learning based image quality assessment (IQA) models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes.
no code implementations • 13 Sep 2022 • Yu Tian, Zhangkai Ni, Baoliang Chen, Shurun Wang, Shiqi Wang, Hanli Wang, Sam Kwong
In particular, in order to maximum redundancy removal without impairment of robust identity information, we apply the encoder with multiple feature extraction and attention-based feature decomposition modules to progressively decompose face features into two uncorrelated components, i. e., identity and residual features, via self-supervised learning.
1 code implementation • 12 Sep 2022 • Baoliang Chen, Hanwei Zhu, Lingyu Zhu, Shiqi Wang, Sam Kwong
The underlying mechanism of the proposed approach is based upon the mild assumption that the SCIs, which are not physically acquired, still obey certain statistics that could be understood in a learning fashion.
3 code implementations • 7 Sep 2022 • Runmin Cong, Qi Qin, Chen Zhang, Qiuping Jiang, Shiqi Wang, Yao Zhao, Sam Kwong
In this paper, we focus on a new weakly-supervised SOD task under hybrid labels, where the supervision labels include a large number of coarse labels generated by the traditional unsupervised method and a small number of real labels.
Ranked #6 on RGB Salient Object Detection on PASCAL-S
1 code implementation • 6 Sep 2022 • Yue Liu, Zhangkai Ni, Shiqi Wang, Hanli Wang, Sam Kwong
In this paper, a novel and effective image quality assessment (IQA) algorithm based on frequency disparity for high dynamic range (HDR) images is proposed, termed as local-global frequency feature-based model (LGFM).
3 code implementations • 11 Aug 2022 • huan zhang, Shiqi Wang, Kaidi Xu, Linyi Li, Bo Li, Suman Jana, Cho-Jui Hsieh, J. Zico Kolter
Our generalized bound propagation method, GCP-CROWN, opens up the opportunity to apply general cutting plane methods for neural network verification while benefiting from the efficiency and GPU acceleration of bound propagation methods.
1 code implementation • 5 Aug 2022 • Xigran Liao, Baoliang Chen, Hanwei Zhu, Shiqi Wang, Mingliang Zhou, Sam Kwong
Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image quality in a deterministic way by explicitly comparing the features, gauging how severely distorted an image is by how far the corresponding feature lies from the space of the reference images.
no code implementations • 3 Jul 2022 • Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Lin Ma, Sam Kwong
Getting rid of the fundamental limitations in fitting to the paired training data, recent unsupervised low-light enhancement methods excel in adjusting illumination and contrast of images.
1 code implementation • 16 May 2022 • Tiexin Qin, Shiqi Wang, Haoliang Li
Domain generalization aims to improve the generalization capability of machine learning systems to out-of-distribution (OOD) data.
1 code implementation • 4 Apr 2022 • Chongming Gao, Shiqi Wang, Shijun Li, Jiawei Chen, Xiangnan He, Wenqiang Lei, Biao Li, Yuan Zhang, Peng Jiang
The basic idea is to first learn a causal user model on historical data to capture the overexposure effect of items on user satisfaction.
no code implementations • 20 Feb 2022 • Baoliang Chen, Lingyu Zhu, Hanwei Zhu, Wenhan Yang, Fangbo Lu, Shiqi Wang
In particular, we create a large-scale database for QUality assessment Of The Enhanced LOw-Light Image (QUOTE-LOL), which serves as the foundation in studying and developing objective quality assessment measures.
no code implementations • 20 Feb 2022 • Pingping Zhang, Xu Wang, Linwei Zhu, Yun Zhang, Shiqi Wang, Sam Kwong
In this paper, we propose a distortion-aware loop filtering model to improve the performance of intra coding for 360$^o$ videos projected via equirectangular projection (ERP) format.
no code implementations • 19 Feb 2022 • Shiqi Wang, Chongming Gao, Min Gao, Junliang Yu, Zongwei Wang, Hongzhi Yin
By providing users with opportunities to experience goods without charge, a free trial makes adopters know more about products and thus encourages their willingness to buy.
1 code implementation • 28 Jan 2022 • Yibing Liu, Haoliang Li, Yangyang Guo, Chenqi Kong, Jing Li, Shiqi Wang
Attention mechanisms are dominating the explainability of deep models.
no code implementations • 28 Jan 2022 • Yu Tian, Zhangkai Ni, Baoliang Chen, Shiqi Wang, Hanli Wang, Sam Kwong
However, little work has been dedicated to automatic quality assessment of such GAN-generated face images (GFIs), even less have been devoted to generalized and robust quality assessment of GFIs generated with unseen GAN model.
1 code implementation • 31 Dec 2021 • Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong
The proposed approach is an end-to-end compressive image sensing method, composed of adaptive sampling and recovery.
no code implementations • 16 Oct 2021 • Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma
The latest advances in full-reference image quality assessment (IQA) involve unifying structure and texture similarity based on deep representations.
Ranked #34 on Video Quality Assessment on MSU SR-QA Dataset
Full-Reference Image Quality Assessment Image Super-Resolution +1
no code implementations • 29 Sep 2021 • huan zhang, Shiqi Wang, Kaidi Xu, Yihan Wang, Suman Jana, Cho-Jui Hsieh, J Zico Kolter
In this work, we formulate an adversarial attack using a branch-and-bound (BaB) procedure on ReLU neural networks and search adversarial examples in the activation space corresponding to binary variables in a mixed integer programming (MIP) formulation.
1 code implementation • 11 Aug 2021 • Qiuping Jiang, Zhentao Liu, Shiqi Wang, Feng Shao, Weisi Lin
Instead of explicitly formulating and fusing different masking effects in a bottom-up way, the proposed JND estimation model dedicates to first predicting a critical perceptual lossless (CPL) counterpart of the original image and then calculating the difference map between the original image and the predicted CPL image as the JND map.
1 code implementation • 9 Aug 2021 • Baoliang Chen, Lingyu Zhu, Chenqi Kong, Hanwei Zhu, Shiqi Wang, Zhu Li
In this paper, we propose a no-reference (NR) image quality assessment (IQA) method via feature level pseudo-reference (PR) hallucination.
1 code implementation • 13 Jul 2021 • Chenqi Kong, Baoliang Chen, Haoliang Li, Shiqi Wang, Anderson Rocha, Sam Kwong
The technological advancements of deep learning have enabled sophisticated face manipulation schemes, raising severe trust issues and security concerns in modern society.
no code implementations • 1 Jul 2021 • Shurun Wang, Zhao Wang, Shiqi Wang, Yan Ye
In this paper, we show that the design and optimization of network architecture could be further improved for compression towards machine vision.
no code implementations • NeurIPS 2021 • Shiqi Wang, huan zhang, Kaidi Xu, Xue Lin, Suman Jana, Cho-Jui Hsieh, J Zico Kolter
We develop $\beta$-CROWN, a new bound propagation based method that can fully encode neuron split constraints in branch-and-bound (BaB) based complete verification via optimizable parameters $\beta$.
1 code implementation • 24 May 2021 • Yizheng Chen, Shiqi Wang, Yue Qin, Xiaojing Liao, Suman Jana, David Wagner
Since data distribution shift is very common in security applications, e. g., often observed for malware detection, local robustness cannot guarantee that the property holds for unseen inputs at the time of deploying the classifier.
no code implementations • 14 May 2021 • Chris Xing Tian, Haoliang Li, YuFei Wang, Shiqi Wang
However, due to the issue of limited dataset availability and the strict legal and ethical requirements for patient privacy protection, the broad applications of medical imaging classification driven by DNN with large-scale training data have been largely hindered.
5 code implementations • NeurIPS 2021 • Shiqi Wang, huan zhang, Kaidi Xu, Xue Lin, Suman Jana, Cho-Jui Hsieh, J. Zico Kolter
Compared to the typically tightest but very costly semidefinite programming (SDP) based incomplete verifiers, we obtain higher verified accuracy with three orders of magnitudes less verification time.
no code implementations • 27 Feb 2021 • Chris Xing Tian, Haoliang Li, Xiaofei Xie, Yang Liu, Shiqi Wang
More specifically, by treating the DNN as a program and each neuron as a functional point of the code, during the network training we aim to improve the generalization capability by maximizing the neuron coverage of DNN with the gradient similarity regularization between the original and augmented samples.
no code implementations • 25 Jan 2021 • Baoliang Chen, Wenhan Yang, Haoliang Li, Shiqi Wang, Sam Kwong
The first branch aims to learn the camera invariant spoofing features via feature level decomposition in the high frequency domain.
1 code implementation • 30 Dec 2020 • Zhangkai Ni, Wenhan Yang, Shiqi Wang, Lin Ma, Sam Kwong
In this paper, we present an unsupervised image enhancement generative adversarial network (UEGAN), which learns the corresponding image-to-image mapping from a set of images with desired characteristics in an unsupervised manner, rather than learning on a large number of paired images.
no code implementations • 30 Dec 2020 • Zhangkai Ni, Wenhan Yang, Shiqi Wang, Lin Ma, Sam Kwong
The key novelty of the proposed QAGAN lies in the injected QAM for the generator such that it learns domain-relevant quality attention directly from the two domains.
1 code implementation • 27 Dec 2020 • Baoliang Chen, Lingyu Zhu, Guo Li, Hongfei Fan, Shiqi Wang
In this work, we propose a no-reference video quality assessment method, aiming to achieve high-generalization capability in cross-content, -resolution and -frame rate quality prediction.
no code implementations • 14 Dec 2020 • Shiqi Wang, Kevin Eykholt, Taesung Lee, Jiyong Jang, Ian Molloy
On CIFAR10, a non-robust LeNet model has a 21. 63% error rate, while a model created using verifiable training and a L-infinity robustness criterion of 8/255, has an error rate of 57. 10%.
4 code implementations • ICLR 2021 • Kaidi Xu, huan zhang, Shiqi Wang, Yihan Wang, Suman Jana, Xue Lin, Cho-Jui Hsieh
Formal verification of neural networks (NNs) is a challenging and important problem.
2 code implementations • 10 Nov 2020 • Jianhui Chang, Zhenghui Zhao, Chuanmin Jia, Shiqi Wang, Lingbo Yang, Qi Mao, Jian Zhang, Siwei Ma
To this end, we propose a novel conceptual compression framework that encodes visual data into compact structure and texture representations, then decodes in a deep synthesis fashion, aiming to achieve better visual reconstruction quality, flexible content manipulation, and potential support for various vision tasks.
1 code implementation • NeurIPS 2020 • Haoliang Li, YuFei Wang, Renjie Wan, Shiqi Wang, Tie-Qiang Li, Alex C. Kot
Recently, we have witnessed great progress in the field of medical imaging classification by adopting deep neural networks.
no code implementations • 16 Sep 2020 • Rizhao Cai, Haoliang Li, Shiqi Wang, Changsheng chen, Alex ChiChung Kot
Inspired by the philosophy employed by human beings to determine whether a presented face example is genuine or not, i. e., to glance at the example globally first and then carefully observe the local regions to gain more discriminative information, for the face anti-spoofing problem, we propose a novel framework based on the Convolutional Neural Network (CNN) and the Recurrent Neural Network (RNN).
no code implementations • 15 Sep 2020 • Haoliang Li, Yufei Wang, Xiaofei Xie, Yang Liu, Shiqi Wang, Renjie Wan, Lap-Pui Chau, Alex C. Kot
In this paper, we propose a novel black-box backdoor attack technique on face recognition systems, which can be conducted without the knowledge of the targeted DNN model.
no code implementations • 19 Aug 2020 • Baoliang Chen, Haoliang Li, Hongfei Fan, Shiqi Wang
Here, we develop the first unsupervised domain adaptation based no reference quality assessment method for SCIs, leveraging rich subjective ratings of the natural images (NIs).
no code implementations • 13 Aug 2020 • Rongqun Lin, Linwei Zhu, Shiqi Wang, Sam Kwong
Compactly representing the visual signals is of fundamental importance in various image/video-centered applications.
no code implementations • 4 Jun 2020 • Bai Li, Shiqi Wang, Suman Jana, Lawrence Carin
Current neural-network-based classifiers are susceptible to adversarial examples.
1 code implementation • 20 May 2020 • Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao
A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.
1 code implementation • 4 May 2020 • Keyan Ding, Kede Ma, Shiqi Wang, Eero P. Simoncelli
The performance of objective image quality assessment (IQA) models has been evaluated primarily by comparing model predictions to human quality judgments.
no code implementations • 21 Apr 2020 • Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In particular, we study the feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and enhancement layer targets to perfectly reconstruct the texture.
2 code implementations • 16 Apr 2020 • Keyan Ding, Kede Ma, Shiqi Wang, Eero P. Simoncelli
Objective measures of image quality generally operate by comparing pixels of a "degraded" image to those of the original.
Ranked #31 on Video Quality Assessment on MSU SR-QA Dataset
1 code implementation • 6 Mar 2020 • Bai Li, Shiqi Wang, Yunhan Jia, Yantao Lu, Zhenyu Zhong, Lawrence Carin, Suman Jana
Recent research has proposed the lottery ticket hypothesis, suggesting that for a deep neural network, there exist trainable sub-networks performing equally or better than the original model with commensurate training steps.
4 code implementations • NeurIPS 2020 • Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana
We demonstrate that our approach, titled HYDRA, achieves compressed networks with state-of-the-art benign and robust accuracy, simultaneously.
1 code implementation • NeurIPS 2020 • Jiahao Su, Shiqi Wang, Furong Huang
In this work, we propose to replace any traditional convolutional layer with an autoregressive moving-average (ARMA) layer, a novel module with an adjustable receptive field controlled by the learnable autoregressive coefficients.
no code implementations • 10 Feb 2020 • Shurun Wang, Wenhan Yang, Shiqi Wang
In this paper, we propose a novel end-to-end feature compression scheme by leveraging the representation and learning capability of deep neural networks, towards intelligent front-end equipped analysis with promising accuracy and efficiency.
no code implementations • 16 Dec 2019 • Wenhan Yang, Robby T. Tan, Shiqi Wang, Yuming Fang, Jiaying Liu
The goal of single-image deraining is to restore the rain-free background scenes of an image degraded by rain streaks and rain accumulation.
2 code implementations • 3 Dec 2019 • Yizheng Chen, Shiqi Wang, Weifan Jiang, Asaf Cidon, Suman Jana
There are various costs for attackers to manipulate the features of security classifiers.
1 code implementation • 31 Jul 2019 • Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu, Shiqi Wang, Wen Gao
The digital retina in smart cities is to select what the City Eye tells the City Brain, and convert the acquired visual data from front-end visual sensors to features in an intelligent sensing manner.
1 code implementation • 3 Jul 2019 • Keyan Ding, Kede Ma, Shiqi Wang
The goal of research in automatic image popularity assessment (IPA) is to develop computational models that can accurately predict the potential of a social image to go viral on the Internet.
no code implementations • 14 Jun 2019 • Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana
In this work, we rigorously study the extension of network pruning strategies to preserve both benign accuracy and robustness of a network.
no code implementations • 5 Jun 2019 • Shiqi Wang, Yizheng Chen, Ahmed Abdou, Suman Jana
In this paper, we present interval attacks, a new technique to find adversarial examples to evaluate the robustness of neural networks.
no code implementations • 3 Jun 2019 • Junlong Gao, Xi Meng, Shiqi Wang, Xia Li, Shanshe Wang, Siwei Ma, Wen Gao
Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens.
no code implementations • 16 Apr 2019 • Minghao Yin, Xiu Li, Yongbing Zhang, Shiqi Wang
In this paper, we aim to understand Residual Network (ResNet) in a scientifically sound way by providing a bridge between ResNet and Feynman path integral.
no code implementations • CVPR 2019 • Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao
Existing methods for image captioning are usually trained by cross entropy loss, which leads to exposure bias and the inconsistency between the optimizing function and evaluation metrics.
no code implementations • 7 Apr 2019 • Siwei Ma, Xinfeng Zhang, Chuanmin Jia, Zhenghui Zhao, Shiqi Wang, Shanshe Wang
Deep convolution neural network (CNN) which makes the neural network resurge in recent years and has achieved great success in both artificial intelligent and signal processing fields, also provides a novel and promising solution for image and video compression.
1 code implementation • 6 Apr 2019 • Yizheng Chen, Shiqi Wang, Dongdong She, Suman Jana
A practically useful malware classifier must be robust against evasion attacks.
no code implementations • 14 Mar 2019 • Shurun Wang, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a scalable image compression scheme, including the base layer for feature representation and enhancement layer for texture representation.
1 code implementation • 6 Nov 2018 • Shiqi Wang, Yizheng Chen, Ahmed Abdou, Suman Jana
Making neural networks robust against adversarial inputs has resulted in an arms race between new defenses and attacks.
2 code implementations • NeurIPS 2018 • Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, Suman Jana
Our approach can check different safety properties and find concrete counterexamples for networks that are 10$\times$ larger than the ones supported by existing analysis techniques.
no code implementations • 17 Sep 2018 • Zhuo Chen, Weisi Lin, Shiqi Wang, Ling-Yu Duan, Alex C. Kot
The recent advances of hardware technology have made the intelligent analysis equipped at the front-end with deep learning more prevailing and practical.
no code implementations • CVPR 2018 • Haoliang Li, Sinno Jialin Pan, Shiqi Wang, Alex C. Kot
In this paper, we tackle the problem of domain generalization: how to learn a generalized feature representation for an âunseenâ target domain by taking the advantage of multiple seen source-domain data.
Ranked #58 on Domain Generalization on PACS
3 code implementations • 28 Apr 2018 • Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, Suman Jana
In this paper, we present a new direction for formally checking security properties of DNNs without using SMT solvers.
no code implementations • 5 Dec 2017 • Ling-Yu Duan, Yihang Lou, Shiqi Wang, Wen Gao, Yong Rui
To practically facilitate deep neural network models in the large-scale video analysis, there are still unprecedented challenges for the large-scale video data management.
no code implementations • 25 Sep 2017 • Chuanmin Jia, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma
Deep learning has demonstrated tremendous break through in the area of image/video processing.
Multimedia
3 code implementations • 13 Aug 2017 • Zhuo Chen, Weisi Lin, Shiqi Wang, Long Xu, Leida Li
For many computer vision problems, the deep neural networks are trained and validated based on the assumption that the input images are pristine (i. e., artifact-free).
no code implementations • 26 Apr 2017 • Ling-Yu Duan, Vijay Chandrasekhar, Shiqi Wang, Yihang Lou, Jie Lin, Yan Bai, Tiejun Huang, Alex ChiChung Kot, Wen Gao
This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG).
no code implementations • 1 Mar 2017 • Feng Gao, Yihang Lou, Yan Bai, Shiqi Wang, Tiejun Huang, Ling-Yu Duan
Object detection aims to identify instances of semantic objects of a certain class in images or videos.
no code implementations • 1 Mar 2017 • Yan Bai, Feng Gao, Yihang Lou, Shiqi Wang, Tiejun Huang, Ling-Yu Duan
In this paper, we propose to leverage intra-class variance in metric learning of triplet network to improve the performance of fine-grained recognition.
no code implementations • 17 Aug 2016 • Xiang Zhang, Jiarui Sun, Siwei Ma, Zhouchen Lin, Jian Zhang, Shiqi Wang, Wen Gao
Therefore, introducing an accurate rate-constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation.