Search Results for author: Jiayu Yang

Found 29 papers, 13 papers with code

MLICv2: Enhanced Multi-Reference Entropy Modeling for Learned Image Compression

no code implementations27 Apr 2025 Wei Jiang, Yongqi Zhai, Jiayu Yang, Feng Gao, Ronggang Wang

In this paper, we present MLICv2 and MLICv2$^+$, enhanced versions of the MLIC series, featuring improved transform techniques, entropy modeling, and instance adaptability.

Computational Efficiency Image Compression

Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction

no code implementations30 Mar 2025 Jingui Ma, Yang Hu, Luyang Tang, Jiayu Yang, Yongqi Zhai, Ronggang Wang

Specifically, we propose a spatial condition-based prediction module to utilize the grid-captured scene information for prediction, with a residual compensation strategy designed to learn the missing fine-grained information.

3DGS Novel View Synthesis +2

Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting

1 code implementation21 Mar 2025 Jinbo Yan, Rui Peng, Zhiyan Wang, Luyang Tang, Jiayu Yang, Jie Liang, Jiahao Wu, Ronggang Wang

Building Free-Viewpoint Videos in a streaming manner offers the advantage of rapid responsiveness compared to offline training methods, greatly enhancing user experience.

CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction

no code implementations11 Mar 2025 Zhiyuan Wu, Xibin Song, Senbo Wang, Weizhe Liu, Jiayu Yang, Ziang Cheng, Shenzhou Chen, Taizhang Shang, Weixuan Sun, Shan Luo, Pan Ji

However, challenges remain as 2D diffusion models often struggle to produce dense images with strong multi-view consistency, and LRMs tend to amplify these inconsistencies during the 3D reconstruction process.

3D Generation 3D Object Reconstruction +2

Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation

1 code implementation20 Feb 2025 Jiayu Yang, Taizhang Shang, Weixuan Sun, Xibin Song, Ziang Cheng, Senbo Wang, Shenzhou Chen, Weizhe Liu, Hongdong Li, Pan Ji

This report presents a comprehensive framework for generating high-quality 3D shapes and textures from diverse input prompts, including single images, multi-view images, and text descriptions.

3D Shape Generation Texture Synthesis

Hybrid Local-Global Context Learning for Neural Video Compression

no code implementations30 Nov 2024 Yongqi Zhai, Jiayu Yang, Wei Jiang, Chunhui Yang, Luyang Tang, Ronggang Wang

In this paper, we propose a hybrid context generation module, which combines the advantages of the above methods in an optimal way and achieves accurate compensation at a low bit cost.

Motion Compensation Motion Estimation +2

Learning New Concepts, Remembering the Old: A Novel Continual Learning

no code implementations25 Nov 2024 Songning Lai, Mingqian Liao, Zhangyi Hu, Jiayu Yang, Wenshuo Chen, Yutao Yue

Concept Bottleneck Models (CBMs) enhance model interpretability by introducing human-understandable concepts within the architecture.

Continual Learning Incremental Learning

Maintaining Informative Coherence: Migrating Hallucinations in Large Language Models via Absorbing Markov Chains

no code implementations27 Oct 2024 Jiemin Wu, Songning Lai, Ruiqiang Xiao, Tianlang Xue, Jiayu Yang, Yutao Yue

Large Language Models (LLMs) are powerful tools for text generation, translation, and summarization, but they often suffer from hallucinations-instances where they fail to maintain the fidelity and coherence of contextual information during decoding, sometimes overlooking critical details due to their sampling strategies and inherent biases from training data and fine-tuning discrepancies.

Text Generation TruthfulQA

CAT: Concept-level backdoor ATtacks for Concept Bottleneck Models

no code implementations7 Oct 2024 Songning Lai, Jiayu Yang, Yu Huang, Lijie Hu, Tianlang Xue, Zhangyi Hu, Jiaxu Li, Haicheng Liao, Yutao Yue

Despite the transformative impact of deep learning across multiple domains, the inherent opacity of these models has driven the development of Explainable Artificial Intelligence (XAI).

Backdoor Attack Explainable artificial intelligence +1

TimeSieve: Extracting Temporal Dynamics through Information Bottlenecks

1 code implementation7 Jun 2024 Ninghui Feng, Songning Lai, Jiayu Yang, Fobao Zhou, Zhenxiao Yin, Hang Zhao

Our results validate the effectiveness of our approach in addressing the key challenges in time series forecasting, paving the way for more reliable and efficient predictive models in practical applications.

Financial Analysis Time Series +1

UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding

no code implementations2 Feb 2024 Jiayu Yang, Wei Jiang, Yongqi Zhai, Chunhui Yang, Ronggang Wang

This paper presents a learned video compression method in response to video compression track of the 6th Challenge on Learned Image Compression (CLIC), at DCC 2024. Specifically, we propose a unified contextual video compression framework (UCVC) for joint P-frame and B-frame coding.

Image Compression Video Compression

ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion

1 code implementation CVPR 2024 Jiayu Yang, Ziang Cheng, Yunfei Duan, Pan Ji, Hongdong Li

Given a single image of a 3D object, this paper proposes a novel method (named ConsistNet) that is able to generate multiple images of the same object, as if seen they are captured from different viewpoints, while the 3D (multi-view) consistencies among those multiple generated images are effectively exploited.

Depth Estimation Depth Prediction +2

A Comprehensive Review of Community Detection in Graphs

no code implementations21 Sep 2023 Jiakang Li, Songning Lai, Zhihao Shuai, Yuan Tan, Yifan Jia, Mianyang Yu, Zichen Song, Xiaokang Peng, Ziyang Xu, Yongxin Ni, Haifeng Qiu, Jiayu Yang, Yutong Liu, Yonggang Lu

This review article delves into the topic of community detection in graphs, which serves as a thorough exposition of various community detection methods from perspectives of modularity-based method, spectral clustering, probabilistic modelling, and deep learning.

Community Detection Sociology

Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality

no code implementations8 Sep 2023 Ziang Cheng, Jiayu Yang, Hongdong Li

One of the major difficulties is the lack of high-quality indoor video stereo training datasets captured by head-mounted VR/AR glasses.

Mixed Reality Stereo Matching

MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

1 code implementation28 Jul 2023 Wei Jiang, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

Additionally, to capture global contexts, we propose the linear complexity attention-based global correlations capturing by leveraging the decomposition of the softmax operation.

Image Compression

LLIC: Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression

no code implementations19 Apr 2023 Wei Jiang, Peirong Ning, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

To tackle this issue, we propose Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression (LLIC).

Image Compression

Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo

1 code implementation CVPR 2022 Jiayu Yang, Jose M. Alvarez, Miaomiao Liu

Boundary pixels usually follow a multi-modal distribution as they represent different depths; Therefore, the assumption results in an erroneous depth prediction at the coarser level of the cost volume pyramid and can not be corrected in the refinement levels leading to wrong depth predictions.

Depth Estimation Depth Prediction

Self-supervised Learning of Depth Inference for Multi-view Stereo

1 code implementation CVPR 2021 Jiayu Yang, Jose M. Alvarez, Miaomiao Liu

Here, we propose a self-supervised learning framework for multi-view stereo that exploit pseudo labels from the input data.

Depth Estimation Image Reconstruction +1

Super-Resolving Compressed Video in Coding Chain

no code implementations26 Mar 2021 Dewang Hou, Yang Zhao, Yuyao Ye, Jiayu Yang, Jian Zhang, Ronggang Wang

Scaling and lossy coding are widely used in video transmission and storage.

Decoder

Cannot find the paper you are looking for? You can Submit a new open access paper.