Search Results for author: Zeyi Huang

Found 25 papers, 14 papers with code

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

1 code implementation20 Apr 2025 Zhenkui Yang, Zeyi Huang, Ge Wang, Han Ding, Tony Xiao Han, Fei Wang

Wireless signal-based human sensing technologies, such as WiFi, millimeter-wave (mmWave) radar, and Radio Frequency Identification (RFID), enable the detection and interpretation of human presence, posture, and activities, thereby providing critical support for applications in public security, healthcare, and smart environments.

Action Recognition Temporal Action Localization

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting

1 code implementation25 Mar 2025 Xinpeng Liu, Zeyi Huang, Fumio Okura, Yasuyuki Matsushita

Novel view synthesis has demonstrated impressive progress recently, with 3D Gaussian splatting (3DGS) offering efficient training time and photorealistic real-time rendering.

3DGS Novel View Synthesis +1

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents

no code implementations25 Feb 2025 Eric Xue, Zeyi Huang, Yuyang Ji, Haohan Wang

These findings establish Iterative Refinement as an effective new strategy for LLM-driven ML automation and position IMPROVE as an accessible solution for building high-quality computer vision models without requiring ML expertise.

Attribute Large Language Model

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

no code implementations8 Jan 2025 Zeyi Huang, Yuyang Ji, Xiaofang Wang, Nikhil Mehta, Tong Xiao, DongHyun Lee, Sigmund Vanvalkenburgh, Shengxin Zha, Bolin Lai, Licheng Yu, Ning Zhang, Yong Jae Lee, Miao Liu

Long-form video understanding with Large Vision Language Models is challenged by the need to analyze temporally dispersed yet spatially concentrated key moments within limited context windows.

EgoSchema Object Tracking +1

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

no code implementations27 Dec 2023 Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei zhang, Hang Xu

Current large-scale diffusion models represent a giant leap forward in conditional image synthesis, capable of interpreting diverse cues like text, human poses, and edges.

Computational Efficiency Denoising +1

A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

1 code implementation ICCV 2023 Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee

Domain generalization studies the problem of training a model with samples from several domains (or distributions) and then testing the model with samples from a new, unseen domain.

Domain Generalization Knowledge Distillation +3

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

no code implementations9 Jun 2023 Mu Cai, Zeyi Huang, Yuheng Li, Utkarsh Ojha, Haohan Wang, Yong Jae Lee

To study what the LLM can do with this XML-based textual description of images, we test the LLM on three broad computer vision tasks: (i) visual reasoning and question answering, (ii) image classification under distribution shift, few-shot learning, and (iii) generating new images using visual prompting.

Few-Shot Learning Image Classification +6

Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation

1 code implementation4 Jun 2022 Haohan Wang, Zeyi Huang, Xindi Wu, Eric P. Xing

Finally, we test this simple technique we identify (worst-case data augmentation with squared l2 norm alignment regularization) and show that the benefits of this method outrun those of the specially designed methods.

Data Augmentation

The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization

1 code implementation9 Apr 2022 Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing

Training with an emphasis on "hard-to-learn" components of the data has been proven as an effective method to improve the generalization of machine learning models, especially in the settings where robustness (e. g., generalization across distributions) is valued.

BIG-bench Machine Learning Domain Generalization

The Two Dimensions of Worst-Case Training and Their Integrated Effect for Out-of-Domain Generalization

no code implementations CVPR 2022 Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing

Training with an emphasis on "hard-to-learn" components of the data has been proven as an effective method to improve the generalization of machine learning models, especially in the settings where robustness (e. g., generalization across distributions) is valued.

BIG-bench Machine Learning Domain Generalization

On the Integration of Self-Attention and Convolution

2 code implementations CVPR 2022 Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang

In this paper, we show that there exists a strong underlying relation between them, in the sense that the bulk of computations of these two paradigms are in fact done with the same operation.

Representation Learning

Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features

no code implementations5 Nov 2021 Haohan Wang, Zeyi Huang, HANLIN ZHANG, Yong Jae Lee, Eric Xing

Machine learning has demonstrated remarkable prediction accuracy over i. i. d data, but the accuracy often drops when tested with data from another distribution.

BIG-bench Machine Learning

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition

2 code implementations NeurIPS 2021 Yulin Wang, Rui Huang, Shiji Song, Zeyi Huang, Gao Huang

Inspired by this phenomenon, we propose a Dynamic Transformer to automatically configure a proper number of tokens for each input image.

Ranked #28 on Image Classification on CIFAR-100 (using extra training data)

All Computational Efficiency +1

On the Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations

no code implementations1 Jan 2021 Haohan Wang, Zeyi Huang, Xindi Wu, Eric Xing

Data augmentation is one of the most popular techniques for improving the robustness of neural networks.

Data Augmentation

Learning Robust Models by Countering Spurious Correlations

no code implementations1 Jan 2021 Haohan Wang, Zeyi Huang, Eric Xing

In this paper, we formally study the generalization error bound for this setup with the knowledge of how the spurious features are associated with the label.

Domain Adaptation

Self-Challenging Improves Cross-Domain Generalization

8 code implementations ECCV 2020 Zeyi Huang, Haohan Wang, Eric P. Xing, Dong Huang

We introduce a simple training heuristic, Representation Self-Challenging (RSC), that significantly improves the generalization of CNN to the out-of-domain data.

Domain Generalization Image Classification

Multiple Anchor Learning for Visual Object Detection

3 code implementations CVPR 2020 Wei Ke, Tianliang Zhang, Zeyi Huang, Qixiang Ye, Jianzhuang Liu, Dong Huang

In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector.

General Classification Multiple Instance Learning +3

High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks

1 code implementation28 May 2019 Haohan Wang, Xindi Wu, Zeyi Huang, Eric P. Xing

We investigate the relationship between the frequency spectrum of image data and the generalization behavior of convolutional neural networks (CNN).

Adversarial Attack Vocal Bursts Intensity Prediction

Improving Object Detection with Inverted Attention

1 code implementation28 Mar 2019 Zeyi Huang, Wei Ke, Dong Huang

Our approach (1) operates along both the spatial and channels dimensions of the feature maps; (2) requires no extra training on hard samples, no extra network parameters for attention estimation, and no testing overheads.

Object object-detection +1

Large Margin Object Tracking with Circulant Feature Maps

no code implementations CVPR 2017 Mengmeng Wang, Yong liu, Zeyi Huang

Structured output support vector machine (SVM) based tracking algorithms have shown favorable performance recently.

Object Object Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.