Search Results for author: Songwei Ge

Found 25 papers, 10 papers with code

Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors

no code implementations12 Dec 2024 Yue Feng, Vaibhav Sanjay, Spencer Lutz, Badour AlBahar, Songwei Ge, Jia-Bin Huang

Automatically generating multiview illusions is a compelling challenge, where a single piece of visual content offers distinct interpretations from different viewing perspectives.

Rethinking Score Distillation as a Bridge Between Image Distributions

no code implementations13 Jun 2024 David McAllister, Songwei Ge, Jia-Bin Huang, David W. Jacobs, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa

We compare our method to existing approaches for score distillation sampling and show that it can produce high-frequency details with realistic colors.

NeRF

Coherent Zero-Shot Visual Instruction Generation

no code implementations6 Jun 2024 Quynh Phung, Songwei Ge, Jia-Bin Huang

Despite the advances in text-to-image synthesis, particularly with diffusion models, generating visual instructions that require consistent representation and smooth state transitions of objects across sequential steps remains a formidable challenge.

Image Generation Reading Comprehension

On the Content Bias in Fréchet Video Distance

1 code implementation18 Apr 2024 Songwei Ge, Aniruddha Mahapatra, Gaurav Parmar, Jun-Yan Zhu, Jia-Bin Huang

We show that FVD with features extracted from the recent large-scale self-supervised video models is less biased toward image quality.

Video Generation

On the Content Bias in Frechet Video Distance

no code implementations CVPR 2024 Songwei Ge, Aniruddha Mahapatra, Gaurav Parmar, Jun-Yan Zhu, Jia-Bin Huang

Frechet Video Distance (FVD) a prominent metric for evaluating video generation models is known to conflict with human perception occasionally.

Video Generation

Grounded Text-to-Image Synthesis with Attention Refocusing

no code implementations CVPR 2024 Quynh Phung, Songwei Ge, Jia-Bin Huang

Driven by the scalable diffusion models trained on large-scale datasets, text-to-image synthesis methods have shown compelling results.

Image Generation

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

no code implementations ICCV 2023 Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji

Despite tremendous progress in generating high-quality images using diffusion models, synthesizing a sequence of animated frames that are both photorealistic and temporally coherent is still in its infancy.

Image Generation Text-to-Video Generation +1

Expressive Text-to-Image Generation with Rich Text

1 code implementation ICCV 2023 Songwei Ge, Taesung Park, Jun-Yan Zhu, Jia-Bin Huang

For each region, we enforce its text attributes by creating region-specific detailed prompts and applying region-specific guidance, and maintain its fidelity against plain-text generation through region-based injections.

Text Generation Text-to-Image Generation

Text-driven Visual Synthesis with Latent Diffusion Prior

no code implementations16 Feb 2023 Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang

There has been tremendous progress in large-scale text-to-image synthesis driven by diffusion models enabling versatile downstream applications such as 3D object synthesis from texts, image editing, and customized generation.

Decoder Image Generation +1

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

1 code implementation CVPR 2023 Songwei Ge, Shlok Mishra, Simon Kornblith, Chun-Liang Li, David Jacobs

To exploit such a structure, we propose a contrastive learning framework where a Euclidean loss is used to learn object representations and a hyperbolic loss is used to encourage representations of scenes to lie close to representations of their constituent objects in a hyperbolic space.

Contrastive Learning Image Classification +5

Visual Conceptual Blending with Large-scale Language and Vision Models

no code implementations27 Jun 2021 Songwei Ge, Devi Parikh

We ask the question: to what extent can recent large-scale language and image generation models blend visual concepts?

Image Generation Language Modeling +3

Shift Invariance Can Reduce Adversarial Robustness

1 code implementation NeurIPS 2021 Songwei Ge, Vasu Singla, Ronen Basri, David Jacobs

Using this, we prove that shift invariance in neural networks produces adversarial examples for the simple case of two classes, each consisting of a single image with a black or white dot on a gray background.

Adversarial Robustness

Learned Interpolation for 3D Generation

no code implementations8 Dec 2019 Austin Dill, Songwei Ge, Eunsu Kang, Chun-Liang Li, Barnabas Poczos

The typical approach for incorporating this creative process is to interpolate in a learned latent space so as to avoid the problem of generating unrealistic instances by exploiting the model's learned structure.

3D Generation

Getting Topology and Point Cloud Generation to Mesh

no code implementations8 Dec 2019 Austin Dill, Chun-Liang Li, Songwei Ge, Eunsu Kang

In this work, we explore the idea that effective generative models for point clouds under the autoencoding framework must acknowledge the relationship between a continuous surface, a discretized mesh, and a set of points sampled from the surface.

Point Cloud Generation

From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories

no code implementations20 Aug 2019 Songwei Ge, Curtis Xuan, Ruihua Song, Chao Zou, Wei Liu, Jin Zhou

In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model.

Retrieval TAG +1

Developing Creative AI to Generate Sculptural Objects

no code implementations20 Aug 2019 Songwei Ge, Austin Dill, Eunsu Kang, Chun-Liang Li, Lingyao Zhang, Manzil Zaheer, Barnabas Poczos

We explore the intersection of human and machine creativity by generating sculptural objects through machine learning.

Clustering Generating 3D Point Clouds

Personalizing Search Results Using Hierarchical RNN with Query-aware Attention

no code implementations20 Aug 2019 Songwei Ge, Zhicheng Dou, Zhengbao Jiang, Jian-Yun Nie, Ji-Rong Wen

Our analysis reveals that the attention model is able to attribute higher weights to more related past sessions after fine training.

Attribute

Cannot find the paper you are looking for? You can Submit a new open access paper.