Search Results for author: Lingzhi Zhang

Found 16 papers, 7 papers with code

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

no code implementations23 Nov 2024 Hang Hua, Qing Liu, Lingzhi Zhang, Jing Shi, Zhifei Zhang, Yilin Wang, Jianming Zhang, Jiebo Luo

To support this endeavor, we introduce COMPOSITIONCAP, a new dataset for multi-grained region compositional image captioning, which introduces the task of compositional attribute-aware regional image captioning.

Attribute Cross-Modal Retrieval +4

Detecting Human Artifacts from Text-to-Image Models

1 code implementation21 Nov 2024 Kaihong Wang, Lingzhi Zhang, Jianming Zhang

In this study, we address this challenge by curating Human Artifact Dataset (HAD), the first large-scale dataset specifically designed to identify and localize human artifacts.

Anatomy Artifact Detection +1

Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models

no code implementations23 May 2024 Katherine Xu, Lingzhi Zhang, Jianbo Shi

In this work, we conduct a large-scale scientific study into the impact of random seeds during diffusion inference.

Image Generation

Amodal Completion via Progressive Mixed Context Diffusion

no code implementations CVPR 2024 Katherine Xu, Lingzhi Zhang, Jianbo Shi

We propose to sidestep many of the difficulties of existing approaches, which typically involve a two-step process of predicting amodal masks and then generating pixels.

Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation

no code implementations6 Aug 2022 Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi

Recently, deep models have established SOTA performance for low-resolution image inpainting, but they lack fidelity at resolutions associated with modern cameras such as 4K or more, and for large holes.

4k Image Inpainting

Perceptual Artifacts Localization for Inpainting

1 code implementation5 Aug 2022 Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi

Inspired by this workflow, we propose a new learning task of automatic segmentation of inpainting perceptual artifacts, and apply the model for inpainting model evaluation and iterative refinement.

Image Inpainting

Multimodal Image Outpainting With Regularized Normalized Diversification

1 code implementation25 Oct 2019 Lingzhi Zhang, Jiancong Wang, Jianbo Shi

In this paper, we study the problem of generating a set ofrealistic and diverse backgrounds when given only a smallforeground region.

Image Outpainting

Deep Image Blending

2 code implementations25 Oct 2019 Lingzhi Zhang, Tarmily Wen, Jianbo Shi

In addition, we jointly optimize the proposed Poisson blending loss as well as the style and content loss computed from a deep network, and reconstruct the blending region by iteratively updating the pixels using the L-BFGS solver.

Object

Neural Embedding for Physical Manipulations

no code implementations13 Jul 2019 Lingzhi Zhang, Andong Cao, Rui Li, Jianbo Shi

In common real-world robotic operations, action and state spaces can be vast and sometimes unknown, and observations are often relatively sparse.

Cannot find the paper you are looking for? You can Submit a new open access paper.