Search Results for author: Long Lian

Found 12 papers, 8 papers with code

Rethinking Patch Dependence for Masked Autoencoders

1 code implementation • 25 Jan 2024 • Letian Fu, Long Lian, Renhao Wang, Baifeng Shi, Xudong Wang, Adam Yala, Trevor Darrell, Alexei A. Efros, Ken Goldberg

In this work, we re-examine inter-patch dependencies in the decoding mechanism of masked autoencoders (MAE).

Instance Segmentation Representation Learning +1

Paper
Code

Unsupervised Universal Image Segmentation

1 code implementation • 28 Dec 2023 • Dantong Niu, Xudong Wang, Xinyang Han, Long Lian, Roei Herzig, Trevor Darrell

Several unsupervised image segmentation approaches have been proposed which eliminate the need for dense manually-annotated segmentation masks; current models separately handle either semantic segmentation (e. g., STEGO) or class-agnostic instance segmentation (e. g., CutLER), but not both (i. e., panoptic segmentation).

Ranked #1 on Unsupervised Panoptic Segmentation on COCO val2017

Image Segmentation Instance Segmentation +7

125

Paper
Code

Self-correcting LLM-controlled Diffusion Models

no code implementations • 27 Nov 2023 • Tsung-Han Wu, Long Lian, Joseph E. Gonzalez, Boyi Li, Trevor Darrell

Steered by an LLM controller, SLD turns text-to-image generation into an iterative closed-loop process, ensuring correctness in the resulting image.

Attribute Text-to-Image Generation

Paper
Add Code

LLM-grounded Video Diffusion Models

no code implementations • 29 Sep 2023 • Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li

We show that LLMs are able to understand complex spatiotemporal dynamics from text alone and generate layouts that align closely with both the prompts and the object motion patterns typically observed in the real world.

Language Modelling Large Language Model +1

Paper
Add Code

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

1 code implementation • 23 May 2023 • Long Lian, Boyi Li, Adam Yala, Trevor Darrell

Our method significantly outperforms the base diffusion model and several strong baselines in accurately generating images according to prompts that require various capabilities, doubling the generation accuracy across four tasks on average.

Common Sense Reasoning Language Modelling +2

352

Paper
Code

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

1 code implementation • CVPR 2023 • Long Lian, Zhirong Wu, Stella X. Yu

The Gestalt law of common fate, i. e., what move at the same speed belong together, has inspired unsupervised object discovery based on motion segmentation.

Ranked #1 on Unsupervised Object Segmentation on FBMS-59

Motion Segmentation Object +7

Paper
Code

Q-Diffusion: Quantizing Diffusion Models

1 code implementation • ICCV 2023 • Xiuyu Li, Yijiang Liu, Long Lian, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer

We propose a novel PTQ method specifically tailored towards the unique multi-timestep pipeline and model architecture of the diffusion models, which compresses the noise estimation network to accelerate the generation process.

Image Generation Noise Estimation +1

260

Paper
Code

Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy

no code implementations • 17 Dec 2022 • Long Lian, Zhirong Wu, Stella X. Yu

Previous methods in unsupervised video object segmentation (UVOS) have demonstrated the effectiveness of motion as either input or supervision for segmentation.

Misconceptions Object +5

Paper
Add Code

Debiased Learning from Naturally Imbalanced Pseudo-Labels

1 code implementation • CVPR 2022 • Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu

Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data.

Ranked #1 on Few-Shot Image Classification on ImageNet - 0-Shot (using extra training data)

counterfactual Counterfactual Reasoning +4

Paper
Code

Unsupervised Selective Labeling for More Effective Semi-Supervised Learning

1 code implementation • 6 Oct 2021 • Xudong Wang, Long Lian, Stella X. Yu

Intuitively, no matter what the downstream task is, instances to be labeled must be representative and diverse: The former would facilitate label propagation to unlabeled data, whereas the latter would ensure coverage of the entire dataset.

Ranked #2 on Semi-Supervised Image Classification (Cold Start) on CIFAR-10, 100 Labels

Active Learning Semi-Supervised Image Classification (Cold Start)

Paper
Code

Unsupervised Visual Attention and Invariance for Reinforcement Learning

no code implementations • CVPR 2021 • Xudong Wang, Long Lian, Stella X. Yu

Existing methods focus on training an RL policy that is universal to changing visual domains, whereas we focus on extracting visual foreground that is universal, feeding clean invariant vision to the RL policy learner.

Domain Generalization Keypoint Detection +2

Paper
Add Code

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

2 code implementations • ICLR 2021 • Xudong Wang, Long Lian, Zhongqi Miao, Ziwei Liu, Stella X. Yu

We take a dynamic view of the training data and provide a principled model bias and variance analysis as the training data fluctuates: Existing long-tail classifiers invariably increase the model variance and the head-tail model bias gap remains large, due to more and larger confusion with hard negatives for the tail.

Ranked #22 on Long-tail Learning on iNaturalist 2018

Image Classification imbalanced classification +1

250

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.