Search Results for author: Han Huang

Found 36 papers, 11 papers with code

Definition Modelling for Appropriate Specificity

no code implementations EMNLP 2021 Han Huang, Tomoyuki Kajiwara, Yuki Arase

Definition generation techniques aim to generate a definition of a target word or phrase given a context.

Definition Modelling Re-Ranking +1

JADE: Corpus for Japanese Definition Modelling

no code implementations LREC 2022 Han Huang, Tomoyuki Kajiwara, Yuki Arase

This study investigated and released the JADE, a corpus for Japanese definition modelling, which is a technique that automatically generates definitions of a given target word and phrase.

Definition Modelling

KEBench: A Benchmark on Knowledge Editing for Large Vision-Language Models

no code implementations12 Mar 2024 Han Huang, Haitian Zhong, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

We conducted experiments of different editing methods on five LVLMs, and thoroughly analyze how these methods impact the models.

knowledge editing

Reconstructing the Geometry of Random Geometric Graphs

no code implementations14 Feb 2024 Han Huang, Pakawut Jiradilok, Elchanan Mossel

Random geometric graphs are random graph models defined on metric spaces.

Unsupervised Solution Operator Learning for Mean-Field Games via Sampling-Invariant Parametrizations

no code implementations27 Jan 2024 Han Huang, Rongjie Lai

To ensure the proposed parametrization is well-suited for operator learning, we introduce and prove the notion of sampling invariance for our model, establishing its convergence to a continuous operator in the sampling limit.

Operator learning

NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

no code implementations21 Dec 2023 Han Huang, Yulun Wu, Junsheng Zhou, Ge Gao, Ming Gu, Yu-Shen Liu

To achieve this, we train a neural network to learn a global implicit field from the on-surface points obtained from SfM and then leverage it as a coarse geometric constraint.

Surface Reconstruction valid

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models

no code implementations21 Sep 2023 Fernanda De La Torre, Cathy Mengying Fang, Han Huang, Andrzej Banburski-Fahey, Judith Amores Fernandez, Jaron Lanier

We present Large Language Model for Mixed Reality (LLMR), a framework for the real-time creation and modification of interactive Mixed Reality experiences using LLMs.

Language Modelling Large Language Model +4

MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis

1 code implementation15 Jul 2023 Junyu Li, Han Huang, Dong Ni, Wufeng Xue, Dongmei Zhu, Jun Cheng

In addition, we design an object-level temporal aggregation (OTA) module that can automatically filter low-quality features and efficiently integrate temporal information from multiple frames to improve the accuracy of tumor diagnosis.

Video Classification

Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation

2 code implementations21 May 2023 Han Huang, Leilei Sun, Bowen Du, Weifeng Lv

To capture the correlation between molecular graphs and geometries in the diffusion process, we develop a Diffusion Graph Transformer to parameterize the data prediction model that recovers the original data from noisy data.

3D Molecule Generation Drug Discovery +2

LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar

no code implementations2 May 2023 Yuelang Xu, Hongwen Zhang, Lizhen Wang, Xiaochen Zhao, Han Huang, GuoJun Qi, Yebin Liu

Existing approaches to animatable NeRF-based head avatars are either built upon face templates or use the expression coefficients of templates as the driving signal.

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

no code implementations CVPR 2023 Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu

In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses.

PriSTI: A Conditional Diffusion Framework for Spatiotemporal Imputation

1 code implementation20 Feb 2023 Mingzhe Liu, Han Huang, Hao Feng, Leilei Sun, Bowen Du, Yanjie Fu

Our proposed framework provides a conditional feature extraction module first to extract the coarse yet effective spatiotemporal dependencies from conditional information as the global context prior.

Imputation Noise Estimation

Conditional Diffusion Based on Discrete Graph Structures for Molecular Graph Generation

1 code implementation1 Jan 2023 Han Huang, Leilei Sun, Bowen Du, Weifeng Lv

To accomplish these goals, we propose a novel Conditional Diffusion model based on discrete Graph Structures (CDGS) for molecular graph generation.

Drug Discovery Graph Generation +2

Neural Reconstruction of Relightable Human Model from Monocular Video

no code implementations ICCV 2023 Wenzhang Sun, Yunlong Che, Han Huang, Yandong Guo

In this paper, we introduce a novel self-supervised framework that takes a monocular video of a moving human as input and generates a 3D neural representation capable of being rendered with novel poses under arbitrary lighting conditions.

Privileged Prior Information Distillation for Image Matting

no code implementations25 Nov 2022 Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang

Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.

Image Matting

UnconFuse: Avatar Reconstruction from Unconstrained Images

no code implementations18 Nov 2022 Han Huang, Liliang Chen, Xihao Wang

The report proposes an effective solution about 3D human body reconstruction from multiple unconstrained frames for ECCV 2022 WCPA Challenge: From Face, Body and Fashion to 3D Virtual avatars I (track1: Multi-View Based 3D Human Body Reconstruction).

CrossHuman: Learning Cross-Guidance from Multi-Frame Images for Human Reconstruction

no code implementations20 Jul 2022 Liliang Chen, Jiaqi Li, Han Huang, Yandong Guo

We propose CrossHuman, a novel method that learns cross-guidance from parametric human model and multi-frame RGB images to achieve high-quality 3D human reconstruction.

3D Human Reconstruction

Personalized Diagnostic Tool for Thyroid Cancer Classification using Multi-view Ultrasound

no code implementations1 Jul 2022 Han Huang, Yijie Dong, Xiaohong Jia, Jianqiao Zhou, Dong Ni, Jun Cheng, Ruobing Huang

Furthermore, finding an optimal way to integrate multi-view information also relies on the experience of clinicians and adds further difficulty to accurate diagnosis.

Decision Making

Bridging Mean-Field Games and Normalizing Flows with Trajectory Regularization

no code implementations30 Jun 2022 Han Huang, Jiajia Yu, Jie Chen, Rongjie Lai

In this work, we unravel the connections between MFGs and NFs by contextualizing the training of an NF as solving the MFG.

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations11 May 2022 Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

Situational Perception Guided Image Matting

no code implementations20 Apr 2022 Bo Xu, Jiake Xie, Han Huang, Ziwen Li, Cheng Lu, Yong Tang, Yandong Guo

In this paper, we propose a Situational Perception Guided Image Matting (SPG-IM) method that mitigates subjective bias of matting annotations and captures sufficient situational perception information for better global saliency distilled from the visual-to-textual task.

Image Matting Object

Structured Local Radiance Fields for Human Avatar Modeling

no code implementations CVPR 2022 Zerong Zheng, Han Huang, Tao Yu, Hongwen Zhang, Yandong Guo, Yebin Liu

These local radiance fields not only leverage the flexibility of implicit representation in shape and appearance modeling, but also factorize cloth deformations into skeleton motions, node residual translations and the dynamic detail variations inside each individual radiance field.

Semantic Distillation Guided Salient Object Detection

no code implementations8 Mar 2022 Bo Xu, Guanze Liu, Han Huang, Cheng Lu, Yandong Guo

Most existing CNN-based salient object detection methods can identify local segmentation details like hair and animal fur, but often misinterpret the real saliency due to the lack of global contextual information caused by the subjectiveness of the SOD task and the locality of convolution layers.

Image Captioning Object +3

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

no code implementations22 Oct 2021 Ziwen Li, Bo Xu, Han Huang, Cheng Lu, Yandong Guo

In this paper, we propose a new framework Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation (DTS-VIBE), to generate 3D human pose and mesh from RGB videos.

3D Human Pose Estimation Optical Flow Estimation

Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction

1 code implementation ICCV 2021 Bo Xu, Han Huang, Cheng Lu, Ziwen Li, Yandong Guo

In this paper, we propose a Virtual Multi-modality Foreground Matting (VMFM) method to learn human-object interactive foreground (human and objects interacted with him or her) from a raw RGB image.

Human-Object Interaction Detection Image Matting

Incorporating Surprisingly Popular Algorithm and Euclidean Distance-based Adaptive Topology into PSO

1 code implementation25 Aug 2021 Xuan Wu, Jizong Han, Di Wang, Pengyue Gao, Quanlong Cui, Liang Chen, Yanchun Liang, Han Huang, Heow Pueh Lee, Chunyan Miao, You Zhou, Chunguo Wu

While many Particle Swarm Optimization (PSO) algorithms only use fitness to assess the performance of particles, in this work, we adopt Surprisingly Popular Algorithm (SPA) as a complementary metric in addition to fitness.

Single Particle Analysis

Differentiable Neural Architecture Search for Extremely Lightweight Image Super-Resolution

1 code implementation9 May 2021 Han Huang, Li Shen, Chaoyang He, Weisheng Dong, Wei Liu

Specifically, the cell-level search space is designed based on an information distillation mechanism, focusing on the combinations of lightweight operations and aiming to build a more lightweight and accurate SR structure.

Image Super-Resolution Neural Architecture Search +2

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

no code implementations7 Apr 2020 Zhecan Wang, Jian Zhao, Cheng Lu, Han Huang, Fan Yang, Lianji Li, Yandong Guo

To better demonstrate the advantage of our methods, we further propose a new benchmark dataset with the most rich distribution of head-gaze combination reflecting real-world scenarios.

Gaze Estimation

An Improved Epsilon Constraint-handling Method in MOEA/D for CMOPs with Large Infeasible Regions

no code implementations27 Jul 2017 Zhun Fan, Wenji Li, Xinye Cai, Han Huang, Yi Fang, Yugen You, Jiajie Mo, Caimin Wei, Erik Goodman

In order to evaluate the performance of MOEA/D-IEpsilon, a new set of CMOPs with two and three objectives is designed, having large infeasible regions (relative to the feasible regions), and they are called LIR-CMOPs.

Cannot find the paper you are looking for? You can Submit a new open access paper.