Search Results for author: Jie An

Found 21 papers, 8 papers with code

Bring Metric Functions into Diffusion Models

no code implementations • 4 Jan 2024 • Jie An, Zhengyuan Yang, JianFeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo

The first module, similar to a standard DDPM, learns to predict the added noise and is unaffected by the metric function.

Denoising

Paper
Add Code

Video Understanding with Large Language Models: A Survey

1 code implementation • 29 Dec 2023 • Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, JianGuo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly.

Video Understanding

650

Paper
Code

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

no code implementations • 11 Oct 2023 • Jie An, Zhengyuan Yang, Linjie Li, JianFeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo

We hope our proposed framework, benchmark, and LMM evaluation could help establish the intriguing interleaved image-text generation task.

Question Answering Text Generation

Paper
Add Code

Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation

1 code implementation • 14 Aug 2023 • Alexander Martin, Haitian Zheng, Jie An, Jiebo Luo

In this work, we use text-guided latent diffusion models for zero-shot image-to-image translation (I2I) across large domain gaps (longI2I), where large amounts of new visual features and new geometry need to be generated to enter the target domain.

Image-to-Image Translation

Paper
Code

Domain-Scalable Unpaired Image Translation via Latent Space Anchoring

1 code implementation • 26 Jun 2023 • Siyu Huang, Jie An, Donglai Wei, Zudi Lin, Jiebo Luo, Hanspeter Pfister

However, given a UNIT model trained on certain domains, it is difficult for current methods to incorporate new domains because they often need to train the full model on both existing and new domains.

Image-to-Image Translation Translation

Paper
Code

Online Causation Monitoring of Signal Temporal Logic

1 code implementation • 28 May 2023 • Zhenya Zhang, Jie An, Paolo Arcaini, Ichiro Hasuo

The classic STL monitoring is performed by computing a robustness interval that specifies, at each instant, how far the monitored signals are from violating and satisfying the specification.

Paper
Code

Learning to Evaluate the Artness of AI-generated Images

no code implementations • 8 May 2023 • Junyu Chen, Jie An, Hanjia Lyu, Jiebo Luo

Assessing the artness of AI-generated images continues to be a challenge within the realm of image generation.

Image Generation

Paper
Add Code

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation

no code implementations • 17 Apr 2023 • Jie An, Songyang Zhang, Harry Yang, Sonal Gupta, Jia-Bin Huang, Jiebo Luo, Xi Yin

In contrast, we propose a parameter-free temporal shift module that can leverage the spatial U-Net as is for video generation.

Super-Resolution Text-to-Image Generation +2

Paper
Add Code

QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity

1 code implementation • CVPR 2023 • Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister

The mechanism of existing style transfer algorithms is by minimizing a hybrid loss function to push the generated image toward high similarities in both content and style.

Quantization Style Transfer +1

Paper
Code

Improving Visual-textual Sentiment Analysis by Fusing Expert Features

no code implementations • 23 Nov 2022 • Junyu Chen, Jie An, Hanjia Lyu, Jiebo Luo

Visual-textual sentiment analysis aims to predict sentiment with the input of a pair of image and text.

Sentiment Analysis

Paper
Add Code

Make-A-Video: Text-to-Video Generation without Text-Video Data

2 code implementations • 29 Sep 2022 • Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman

We propose Make-A-Video -- an approach for directly translating the tremendous recent progress in Text-to-Image (T2I) generation to Text-to-Video (T2V).

Ranked #3 on Text-to-Video Generation on MSR-VTT (CLIP-FID metric)

Image Generation Super-Resolution +2

1,839

Paper
Code

Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

no code implementations • 9 Feb 2022 • Jie Chen, Chang Liu, Jiawu Xie, Jie An, Nan Huang

In particular, this method breaks through the limitations of the existing methods, not only achieves good results in multivariate separation, but also effectively separates signals when mixed with 40dB Gaussian noise signals.

Temporal Sequences

Paper
Add Code

ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows

1 code implementation • CVPR 2021 • Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo

The forward inference projects input images into deep features, while the backward inference remaps deep features back to input images in a lossless and unbiased way.

Style Transfer

148

Paper
Code

Global Image Sentiment Transfer

no code implementations • 22 Jun 2020 • Jie An, Tianlang Chen, Songyang Zhang, Jiebo Luo

This work proposes a novel framework consisting of a reference image retrieval step and a global sentiment transfer step to transfer sentiments of images according to a given sentiment tag.

Image Retrieval Retrieval +3

Paper
Add Code

Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning

no code implementations • 16 Jun 2020 • Jie An, Tao Li, Hao-Zhi Huang, Li Shen, Xuan Wang, Yongyi Tang, Jinwen Ma, Wei Liu, Jiebo Luo

Extracting effective deep features to represent content and style information is the key to universal style transfer.

Style Transfer

Paper
Add Code

Ultrafast Photorealistic Style Transfer via Neural Architecture Search

no code implementations • 5 Dec 2019 • Jie An, Haoyi Xiong, Jun Huan, Jiebo Luo

Our method consists of a construction step (C-step) to build a photorealistic stylization network and a pruning step (P-step) for acceleration.

Network Pruning Neural Architecture Search +1

Paper
Add Code

Learning One-Clock Timed Automata

1 code implementation • 23 Oct 2019 • Jie An, Mingshuai Chen, Bohua Zhan, Naijun Zhan, Miaomiao Zhang

We present an algorithm for active learning of deterministic timed automata with a single clock.

Formal Languages and Automata Theory

Paper
Code

Fast Universal Style Transfer for Artistic and Photorealistic Rendering

no code implementations • 6 Jul 2019 • Jie An, Haoyi Xiong, Jiebo Luo, Jun Huan, Jinwen Ma

Given a pair of images as the source of content and the reference of style, existing solutions usually first train an auto-encoder (AE) to reconstruct the image using deep features and then embeds pre-defined style transfer modules into the AE reconstruction procedure to transfer the style of the reconstructed image through modifying the deep features.

Style Transfer

Paper
Add Code

StyleNAS: An Empirical Study of Neural Architecture Search to Uncover Surprisingly Fast End-to-End Universal Style Transfer Networks

no code implementations • 6 Jun 2019 • Jie An, Haoyi Xiong, Jinwen Ma, Jiebo Luo, Jun Huan

Finally compared to existing universal style transfer networks for photorealistic rendering such as PhotoWCT that stacks multiple well-trained auto-encoders and WCT transforms in a non-end-to-end manner, the architectures designed by StyleNAS produce better style-transferred images with details preserving, using a tiny number of operators/parameters, and enjoying around 500x inference time speed-up.

Image Classification Neural Architecture Search +4

Paper
Add Code

NIL: Learning Nonlinear Interpolants

no code implementations • 28 May 2019 • Mingshuai Chen, Jian Wang, Jie An, Bohua Zhan, Deepak Kapur, Naijun Zhan

Nonlinear interpolants have been shown useful for the verification of programs and hybrid systems in contexts of theorem proving, model checking, abstract interpretation, etc.

Automated Theorem Proving

Paper
Add Code

Pyramid Attention Network for Semantic Segmentation

no code implementations • 25 May 2018 • Hanchao Li, Pengfei Xiong, Jie An, Lingxue Wang

A Pyramid Attention Network(PAN) is proposed to exploit the impact of global contextual information in semantic segmentation.

Segmentation Semantic Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.