Search Results for author: Zeyi Huang

Found 19 papers, 13 papers with code

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

no code implementations • 27 Dec 2023 • Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei zhang, Hang Xu

Current large-scale diffusion models represent a giant leap forward in conditional image synthesis, capable of interpreting diverse cues like text, human poses, and edges.

Computational Efficiency Denoising +1

Paper
Add Code

A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

1 code implementation • ICCV 2023 • Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee

Domain generalization studies the problem of training a model with samples from several domains (or distributions) and then testing the model with samples from a new, unseen domain.

Ranked #15 on Domain Generalization on PACS

Domain Generalization Knowledge Distillation +2

Paper
Code

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

no code implementations • 9 Jun 2023 • Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee

By leveraging the XML-based textual descriptions of SVG representations instead of raster images, we aim to bridge the gap between the visual and textual modalities, allowing LLMs to directly understand and manipulate images without the need for parameterized visual components.

Image Classification In-Context Learning +2

Paper
Add Code

Expeditious Saliency-guided Mix-up through Random Gradient Thresholding

1 code implementation • 9 Dec 2022 • Minh-Long Luu, Zeyi Huang, Eric P. Xing, Yong Jae Lee, Haohan Wang

Mix-up training approaches have proven to be effective in improving the generalization ability of Deep Neural Networks.

Ranked #1 on Classifier calibration on CIFAR-100

Classifier calibration Image Classification +1

Paper
Code

Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation

1 code implementation • 4 Jun 2022 • Haohan Wang, Zeyi Huang, Xindi Wu, Eric P. Xing

Finally, we test this simple technique we identify (worst-case data augmentation with squared l2 norm alignment regularization) and show that the benefits of this method outrun those of the specially designed methods.

Data Augmentation

Paper
Code

The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization

1 code implementation • 9 Apr 2022 • Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing

Training with an emphasis on "hard-to-learn" components of the data has been proven as an effective method to improve the generalization of machine learning models, especially in the settings where robustness (e. g., generalization across distributions) is valued.

BIG-bench Machine Learning Domain Generalization

Paper
Code

The Two Dimensions of Worst-Case Training and Their Integrated Effect for Out-of-Domain Generalization

no code implementations • CVPR 2022 • Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing

BIG-bench Machine Learning Domain Generalization

Paper
Add Code

On the Integration of Self-Attention and Convolution

2 code implementations • CVPR 2022 • Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang

In this paper, we show that there exists a strong underlying relation between them, in the sense that the bulk of computations of these two paradigms are in fact done with the same operation.

Representation Learning

360

Paper
Code

Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features

1 code implementation • 5 Nov 2021 • Haohan Wang, Zeyi Huang, HANLIN ZHANG, Yong Jae Lee, Eric Xing

Machine learning has demonstrated remarkable prediction accuracy over i. i. d data, but the accuracy often drops when tested with data from another distribution.

BIG-bench Machine Learning

Paper
Code

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition

2 code implementations • NeurIPS 2021 • Yulin Wang, Rui Huang, Shiji Song, Zeyi Huang, Gao Huang

Inspired by this phenomenon, we propose a Dynamic Transformer to automatically configure a proper number of tokens for each input image.

Ranked #29 on Image Classification on CIFAR-100 (using extra training data)

Computational Efficiency Image Classification

241

Paper
Code

On the Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations

no code implementations • 1 Jan 2021 • Haohan Wang, Zeyi Huang, Xindi Wu, Eric Xing

Data augmentation is one of the most popular techniques for improving the robustness of neural networks.

Data Augmentation

Paper
Add Code

Learning Robust Models by Countering Spurious Correlations

no code implementations • 1 Jan 2021 • Haohan Wang, Zeyi Huang, Eric Xing

In this paper, we formally study the generalization error bound for this setup with the knowledge of how the spurious features are associated with the label.

Domain Adaptation

Paper
Add Code

Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations

1 code implementation • 25 Nov 2020 • Haohan Wang, Zeyi Huang, Xindi Wu, Eric P. Xing

Data augmentation is one of the most popular techniques for improving the robustness of neural networks.

Data Augmentation

Paper
Code

Comprehensive Attention Self-Distillation for Weakly-Supervised Object Detection

1 code implementation • NeurIPS 2020 • Zeyi Huang, Yang Zou, Vijayakumar Bhagavatula, Dong Huang

Moreover, the image-level category labels do not enforce consistent object detection across different transformations of the same images.

Ranked #1 on Weakly Supervised Object Detection on MSCOCO

Object object-detection +1

Paper
Code

Self-Challenging Improves Cross-Domain Generalization

8 code implementations • ECCV 2020 • Zeyi Huang, Haohan Wang, Eric P. Xing, Dong Huang

We introduce a simple training heuristic, Representation Self-Challenging (RSC), that significantly improves the generalization of CNN to the out-of-domain data.

Ranked #27 on Domain Generalization on PACS

Domain Generalization Image Classification

1,328

Paper
Code

Multiple Anchor Learning for Visual Object Detection

3 code implementations • CVPR 2020 • Wei Ke, Tianliang Zhang, Zeyi Huang, Qixiang Ye, Jianzhuang Liu, Dong Huang

In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector.

Ranked #116 on Object Detection on COCO test-dev

General Classification Multiple Instance Learning +3

Paper
Code

High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks

1 code implementation • 28 May 2019 • Haohan Wang, Xindi Wu, Zeyi Huang, Eric P. Xing

We investigate the relationship between the frequency spectrum of image data and the generalization behavior of convolutional neural networks (CNN).

Adversarial Attack Vocal Bursts Intensity Prediction

233

Paper
Code

Improving Object Detection with Inverted Attention

1 code implementation • 28 Mar 2019 • Zeyi Huang, Wei Ke, Dong Huang

Our approach (1) operates along both the spatial and channels dimensions of the feature maps; (2) requires no extra training on hard samples, no extra network parameters for attention estimation, and no testing overheads.

Object object-detection +1

Paper
Code

Large Margin Object Tracking with Circulant Feature Maps

no code implementations • CVPR 2017 • Mengmeng Wang, Yong liu, Zeyi Huang

Structured output support vector machine (SVM) based tracking algorithms have shown favorable performance recently.

Object Object Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.