Search Results for author: Songwei Ge

Found 20 papers, 7 papers with code

On the Content Bias in Fréchet Video Distance

no code implementations • 18 Apr 2024 • Songwei Ge, Aniruddha Mahapatra, Gaurav Parmar, Jun-Yan Zhu, Jia-Bin Huang

We show that FVD with features extracted from the recent large-scale self-supervised video models is less biased toward image quality.

Video Generation

Paper
Add Code

Grounded Text-to-Image Synthesis with Attention Refocusing

no code implementations • 8 Jun 2023 • Quynh Phung, Songwei Ge, Jia-Bin Huang

Driven by the scalable diffusion models trained on large-scale datasets, text-to-image synthesis methods have shown compelling results.

Image Generation

Paper
Add Code

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

no code implementations • ICCV 2023 • Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji

Despite tremendous progress in generating high-quality images using diffusion models, synthesizing a sequence of animated frames that are both photorealistic and temporally coherent is still in its infancy.

Ranked #8 on Text-to-Video Generation on UCF-101

Image Generation Text-to-Video Generation +1

Paper
Add Code

Expressive Text-to-Image Generation with Rich Text

no code implementations • ICCV 2023 • Songwei Ge, Taesung Park, Jun-Yan Zhu, Jia-Bin Huang

For each region, we enforce its text attributes by creating region-specific detailed prompts and applying region-specific guidance, and maintain its fidelity against plain-text generation through region-based injections.

Text Generation Text-to-Image Generation

Paper
Add Code

Text-driven Visual Synthesis with Latent Diffusion Prior

no code implementations • 16 Feb 2023 • Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang

There has been tremendous progress in large-scale text-to-image synthesis driven by diffusion models enabling versatile downstream applications such as 3D object synthesis from texts, image editing, and customized generation.

Image Generation Text to 3D

Paper
Add Code

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

1 code implementation • CVPR 2023 • Songwei Ge, Shlok Mishra, Simon Kornblith, Chun-Liang Li, David Jacobs

To exploit such a structure, we propose a contrastive learning framework where a Euclidean loss is used to learn object representations and a hyperbolic loss is used to encourage representations of scenes to lie close to representations of their constituent objects in a hyperbolic space.

Contrastive Learning Image Classification +5

Paper
Code

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

2 code implementations • 17 Apr 2022 • Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

Altogether, MUGEN can help progress research in many tasks in multimodal understanding and generation.

Navigate Retrieval +4

Paper
Code

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

1 code implementation • 7 Apr 2022 • Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Videos are created to express emotion, exchange information, and share experiences.

Ranked #15 on Video Generation on UCF-101

Video Generation

237

Paper
Code

Robust Contrastive Learning Using Negative Samples with Diminished Semantics

1 code implementation • NeurIPS 2021 • Songwei Ge, Shlok Mishra, Haohan Wang, Chun-Liang Li, David Jacobs

We also show that model bias favors texture and shape features differently under different test settings.

Contrastive Learning Data Augmentation +1

Paper
Code

Visual Conceptual Blending with Large-scale Language and Vision Models

no code implementations • 27 Jun 2021 • Songwei Ge, Devi Parikh

We ask the question: to what extent can recent large-scale language and image generation models blend visual concepts?

Image Generation Language Modelling +2

Paper
Add Code

Shift Invariance Can Reduce Adversarial Robustness

1 code implementation • NeurIPS 2021 • Songwei Ge, Vasu Singla, Ronen Basri, David Jacobs

Using this, we prove that shift invariance in neural networks produces adversarial examples for the simple case of two classes, each consisting of a single image with a black or white dot on a gray background.

Adversarial Robustness

Paper
Code

Creative Sketch Generation

1 code implementation • ICLR 2021 • Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh

Sketching or doodling is a popular creative activity that people engage in.

Generative Adversarial Network

103

Paper
Code

Smooth Kernels Improve Adversarial Robustness and Perceptually-Aligned Gradients

no code implementations • ICLR 2020 • Haohan Wang, Xindi Wu, Songwei Ge, Zachary C. Lipton, Eric P. Xing

Recent research has shown that CNNs are often overly sensitive to high-frequency textural patterns.

Adversarial Robustness

Paper
Add Code

Learned Interpolation for 3D Generation

no code implementations • 8 Dec 2019 • Austin Dill, Songwei Ge, Eunsu Kang, Chun-Liang Li, Barnabas Poczos

The typical approach for incorporating this creative process is to interpolate in a learned latent space so as to avoid the problem of generating unrealistic instances by exploiting the model's learned structure.

3D Generation

Paper
Add Code

Getting Topology and Point Cloud Generation to Mesh

no code implementations • 8 Dec 2019 • Austin Dill, Chun-Liang Li, Songwei Ge, Eunsu Kang

In this work, we explore the idea that effective generative models for point clouds under the autoencoding framework must acknowledge the relationship between a continuous surface, a discretized mesh, and a set of points sampled from the surface.

Point Cloud Generation