Search Results for author: Tianyu Huang

Found 23 papers, 11 papers with code

Large Language Models Enhanced Hyperbolic Space Recommender Systems

no code implementations8 Apr 2025 Wentao Cheng, Zhida Qin, Zexue Wu, Pengzhan Zhou, Tianyu Huang

Large Language Models (LLMs) have attracted significant attention in recommender systems for their excellent world knowledge capabilities.

Contrastive Learning Recommendation Systems +2

Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video

no code implementations4 Apr 2025 Jiaxin Guo, Wenzhen Dong, Tianyu Huang, Hao Ding, Ziyi Wang, Haomin Kuang, Qi Dou, Yun-hui Liu

The core contribution of our method is expanding the capability of the recent pairwise reconstruction model to long-term incremental dynamic reconstruction by an uncertainty-aware dual memory mechanism.

Camera Pose Estimation Depth Estimation +3

Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

no code implementations4 Dec 2024 Jiahao Lu, Tianyu Huang, Peng Li, Zhiyang Dou, Cheng Lin, Zhiming Cui, Zhen Dong, Sai-Kit Yeung, Wenping Wang, YuAn Liu

Recent developments in monocular depth estimation methods enable high-quality depth estimation of single-view images but fail to estimate consistent video depth across different frames.

Monocular Depth Estimation

Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments

no code implementations18 Sep 2024 Lei Cheng, Junpeng Hu, Haodong Yan, Mariia Gladkova, Tianyu Huang, Yun-hui Liu, Daniel Cremers, Haoang Li

Photometric bundle adjustment (PBA) is widely used in estimating the camera pose and 3D geometry by assuming a Lambertian world.

3D geometry

2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction

no code implementations30 Aug 2024 Ruihan Yu, Tianyu Huang, Jingwang Ling, Feng Xu

2D Gaussian Splatting has recently emerged as a significant method in 3D reconstruction, enabling novel view synthesis and geometry reconstruction simultaneously.

3D Reconstruction Novel View Synthesis

DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors

1 code implementation3 Jun 2024 Tianyu Huang, Haoze Zhang, Yihan Zeng, Zhilu Zhang, Hui Li, WangMeng Zuo, Rynson W. H. Lau

In this work, to combine the strengths and complementing shortcomings of the above two solutions, we propose to learn the physical properties of a material field with video diffusion priors, and then utilize a physics-based Material-Point-Method (MPM) simulator to generate 4D content with realistic motions.

Efficient and Robust Point Cloud Registration via Heuristics-guided Parameter Search

1 code implementation9 Apr 2024 Tianyu Huang, Haoang Li, Liangzu Peng, Yinlong Liu, Yun-hui Liu

Our strategy largely reduces the search space and can guarantee accuracy with only a few inlier samples, therefore enjoying an excellent trade-off between efficiency and robustness.

Point Cloud Registration

Scalable 3D Registration via Truncated Entry-wise Absolute Residuals

1 code implementation CVPR 2024 Tianyu Huang, Liangzu Peng, René Vidal, Yun-hui Liu

Given an input set of $3$D point pairs, the goal of outlier-robust $3$D registration is to compute some rotation and translation that align as many point pairs as possible.

A Comprehensive Survey on 3D Content Generation

1 code implementation2 Feb 2024 Jian Liu, Xiaoshui Huang, Tianyu Huang, Lu Chen, Yuenan Hou, Shixiang Tang, Ziwei Liu, Wanli Ouyang, WangMeng Zuo, Junjun Jiang, Xianming Liu

Recent years have witnessed remarkable advances in artificial intelligence generated content(AIGC), with diverse input modalities, e. g., text, image, video, audio and 3D.

Survey

FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models

2 code implementations28 Dec 2023 Wan Xu, Tianyu Huang, Tianyu Qu, Guanglei Yang, Yiwen Guo, WangMeng Zuo

To address the above challenges, we introduce the FILP-3D framework with two novel components: the Redundant Feature Eliminator (RFE) for feature space misalignment and the Spatial Noise Compensator (SNC) for significant noise.

class-incremental learning Dimensionality Reduction +3

DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior

1 code implementation CVPR 2024 Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu, Rynson W. H. Lau, WangMeng Zuo

The priors are then regarded as input conditions to maintain reasonable geometries, in which conditional LoRA and weighted score are further proposed to optimize detailed textures.

3D Generation NeRF +1

UniM$^2$AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

1 code implementation21 Aug 2023 Jian Zou, Tianyu Huang, Guanglei Yang, Zhenhua Guo, Tao Luo, Chun-Mei Feng, WangMeng Zuo

First, it projects the features from both modalities into a cohesive 3D volume space to intricately marry the bird's eye view (BEV) with the height dimension.

3D Object Detection Autonomous Driving +1

Learning Accurate 3D Shape Based on Stereo Polarimetric Imaging

no code implementations CVPR 2023 Tianyu Huang, Haoang Li, Kejing He, Congying Sui, Bin Li, Yun-hui Liu

As to the orthographic projection problem, we propose a novel Viewing Direction-aided Positional Encoding (VDPE) strategy.

DDIT: Semantic Scene Completion via Deformable Deep Implicit Templates

no code implementations ICCV 2023 Haoang Li, Jinhu Dong, Binghui Wen, Ming Gao, Tianyu Huang, Yun-hui Liu, Daniel Cremers

It abstracts the shape prior of a category, and thus can provide constraints on the overall shape of an instance.

CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training

1 code implementation ICCV 2023 Tianyu Huang, Bowen Dong, Yunhan Yang, Xiaoshui Huang, Rynson W. H. Lau, Wanli Ouyang, WangMeng Zuo

To address this issue, we propose CLIP2Point, an image-depth pre-training method by contrastive learning to transfer CLIP to the 3D domain, and adapt it to point cloud classification.

Contrastive Learning Few-Shot Learning +5

AIMusicGuru: Music Assisted Human Pose Correction

no code implementations24 Mar 2022 Snehesh Shrestha, Cornelia Fermüller, Tianyu Huang, Pyone Thant Win, Adam Zukerman, Chethan M. Parameshwara, Yiannis Aloimonos

Pose Estimation techniques rely on visual cues available through observations represented in the form of pixels.

Pose Estimation

Intelligent multiscale simulation based on process-guided composite database

no code implementations20 Mar 2020 Zeliang Liu, Haoyan Wei, Tianyu Huang, C. T. Wu

In the paper, we present an integrated data-driven modeling framework based on process modeling, material homogenization, mechanistic machine learning, and concurrent multiscale simulation.

BIG-bench Machine Learning Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.