Search Results for author: Tiezheng Ge

Found 37 papers, 10 papers with code

Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion

no code implementations23 Apr 2024 Hongyu Chen, Yiqi Gao, Min Zhou, Peng Wang, Xubin Li, Tiezheng Ge, Bo Zheng

Meanwhile, a network, dubbed as Masked ControlNet, is designed to utilize these object masks for object generation in the misaligned visual control region.

Accelerating Image Generation with Sub-path Linear Approximation Model

no code implementations22 Apr 2024 Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks.

RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance

no code implementations22 Apr 2024 Chengrui Wang, PengFei Liu, Min Zhou, Ming Zeng, Xubin Li, Tiezheng Ge, Bo Zheng

The style guidance is a hand image, e. g., the malformed hand itself, and is employed to furnish the style reference for hand refining.

Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation

no code implementations5 Mar 2024 Weijie Li, Litong Gong, Yiran Zhu, Fanda Fan, Biao Wang, Tiezheng Ge, Bo Zheng

The experimental results demonstrate the effectiveness of our approach in improving the fidelity of generated videos.

Denoising Image Animation +1

AtomoVideo: High Fidelity Image-to-Video Generation

no code implementations4 Mar 2024 Litong Gong, Yiran Zhu, Weijie Li, Xiaoyang Kang, Biao Wang, Tiezheng Ge, Bo Zheng

Recently, video generation has achieved significant rapid development based on superior text-to-image generation techniques.

Image to Video Generation Text-to-Image Generation

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

no code implementations22 Feb 2024 Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

By conducting a detailed analysis of real multi-turn dialogue data, we construct a three-tier hierarchical ability taxonomy comprising 4208 turns across 1388 multi-turn dialogues in 13 distinct tasks.

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

1 code implementation22 Feb 2024 Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng

This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained benchmark that evaluates concept-wise mathematical reasoning of Large Language Models (LLMs).

Math Mathematical Reasoning

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

no code implementations13 Jan 2024 Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng

Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources.

4k Position

Hierarchical Masked 3D Diffusion Model for Video Outpainting

no code implementations5 Sep 2023 Fanda Fan, Chaoxu Guo, Litong Gong, Biao Wang, Tiezheng Ge, Yuning Jiang, Chunjie Luo, Jianfeng Zhan

Our pipeline benefits from bidirectional learning of the mask modeling and thus can employ a hybrid strategy of infilling and interpolation when generating sparse frames.

Image Outpainting

Deep Task-specific Bottom Representation Network for Multi-Task Recommendation

no code implementations11 Aug 2023 Qi Liu, Zhilong Zhou, Gangwei Jiang, Tiezheng Ge, Defu Lian

In this paper, we focus on the bottom representation learning of MTL in RS and propose the Deep Task-specific Bottom Representation Network (DTRN) to alleviate the negative transfer problem.

Multi-Task Learning Recommendation Systems +1

AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation

no code implementations2 Aug 2023 Jinpeng Lin, Min Zhou, Ye Ma, Yifan Gao, Chenxi Fei, Yangjian Chen, Zhang Yu, Tiezheng Ge

Meanwhile, to our knowledge, we propose the first poster generation dataset that includes visual attribute annotations for over 76k posters.

Attribute

Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences

no code implementations31 Jul 2023 Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

To address these limitations, we explore the problem of Few-Shot Stylized Visual Captioning, which aims to generate captions in any desired style, using only a few examples as guidance during inference, without requiring further training.

Image Captioning Language Modelling

Edit As You Wish: Video Description Editing with Multi-grained Commands

no code implementations15 May 2023 Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

In this paper, we propose a novel Video Description Editing (VDEdit) task to automatically revise an existing video description guided by flexible user requests.

Attribute Position +3

Unsupervised Domain Adaption with Pixel-level Discriminator for Image-aware Layout Generation

no code implementations CVPR 2023 Chenchen Xu, Min Zhou, Tiezheng Ge, Yuning Jiang, Weiwei Xu

This paper focuses on using the GAN-based model conditioned on image contents to generate advertising poster graphic layouts, which requires an advertising poster layout dataset with paired product images and graphic layouts.

Domain Adaptation

Video Object of Interest Segmentation

no code implementations6 Dec 2022 Siyuan Zhou, Chunru Zhan, Biao Wang, Tiezheng Ge, Yuning Jiang, Li Niu

Given a video and a target image of interest, our objective is to simultaneously segment and track all objects in the video that are relevant to the target image.

Object Segmentation +3

Motion Transformer for Unsupervised Image Animation

1 code implementation28 Sep 2022 Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Image animation aims to animate a source image by using motion learned from a driving video.

Image Animation

Geometry Aligned Variational Transformer for Image-conditioned Layout Generation

no code implementations2 Sep 2022 Yunning Cao, Ye Ma, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang

First, self-attention mechanism is adopted to model the contextual relationship within layout elements, while cross-attention mechanism is used to fuse the visual information of conditional images.

Layout Design Object Localization

Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information

no code implementations7 May 2022 Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang

Therefore, we present a dataset, E-MMAD (e-commercial multimodal multi-structured advertisement copywriting), which requires, and supports much more detailed information in text generation.

Text Generation Video Captioning

Dual-Level Decoupled Transformer for Video Captioning

no code implementations6 May 2022 Yiqi Gao, Xinglin Hou, Wei Suo, Mengyang Sun, Tiezheng Ge, Yuning Jiang, Peng Wang

As for the latter, \textbf{\textit{"couple"}} means treating the generation of visual semantic and syntax-related words equally.

Descriptive Sentence +1

Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs

no code implementations30 Apr 2022 Min Zhou, Chenchen Xu, Ye Ma, Tiezheng Ge, Yuning Jiang, Weiwei Xu

Through both quantitative and qualitative evaluations, we demonstrate that the proposed model can synthesize high-quality graphic layouts according to image compositions.

CapOnImage: Context-driven Dense-Captioning on Image

no code implementations27 Apr 2022 Yiqi Gao, Xinglin Hou, Yuanmeng Zhang, Tiezheng Ge, Yuning Jiang, Peng Wang

Existing image captioning systems are dedicated to generating narrative captions for images, which are spatially detached from the image in presentation.

Dense Captioning Image Captioning

Self-Supervised Text Erasing with Controllable Image Synthesis

no code implementations27 Apr 2022 Gangwei Jiang, Shiyao Wang, Tiezheng Ge, Yuning Jiang, Ying WEI, Defu Lian

The synthetic training images with erasure ground-truth are then fed to train a coarse-to-fine erasing network.

Image Generation

Estimation of Reliable Proposal Quality for Temporal Action Detection

1 code implementation25 Apr 2022 Junshan Hu, Chaoxu Guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge, Yuning Jiang, Houqiang Li

For the region perspective, we introduce Region Evaluate Module (REM) which uses a new and efficient sampling method for proposal feature representation containing more contextual information compared with point feature to refine category score and proposal boundary.

Action Detection

Structure-Aware Motion Transfer with Deformable Anchor Model

1 code implementation CVPR 2022 Jiale Tao, Biao Wang, Borun Xu, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Specifically, inspired by the known deformable part model (DPM), our DAM introduces two types of anchors or keypoints: i) a number of motion anchors that capture both appearance and motion information from the source image and driving video; ii) a latent root anchor, which is linked to the motion anchors to facilitate better learning of the representations of the object structure information.

Move As You Like: Image Animation in E-Commerce Scenario

1 code implementation19 Dec 2021 Borun Xu, Biao Wang, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Creative image animations are attractive in e-commerce applications, where motion transfer is one of the import ways to generate animations from static images.

Image Animation

Boosting Image Outpainting with Semantic Layout Prediction

no code implementations18 Oct 2021 Ye Ma, Jin Ma, Min Zhou, Quan Chen, Tiezheng Ge, Yuning Jiang, Tong Lin

Secondly, another GAN model is trained to synthesize real images based on the extended semantic layouts.

Image Outpainting Semantic Segmentation

Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure

1 code implementation2 Mar 2021 Jin Chen, Tiezheng Ge, Gangwei Jiang, Zhiqiang Zhang, Defu Lian, Kai Zheng

Based on the tree structure, Thompson sampling is adapted with dynamic programming, leading to efficient exploration for potential ad creatives with the largest CTR.

Efficient Exploration Thompson Sampling

Automated Creative Optimization for E-Commerce Advertising

1 code implementation28 Feb 2021 Jin Chen, Ju Xu, Gangwei Jiang, Tiezheng Ge, Zhiqiang Zhang, Defu Lian, Kai Zheng

However, interactions between creative elements may be more complex than the inner product, and the FM-estimated CTR may be of high variance due to limited feedback.

AutoML Click-Through Rate Prediction +2

Semantic Human Matting

2 code implementations5 Sep 2018 Quan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, Kun Gai

SHM is the first algorithm that learns to jointly fit both semantic information and high quality details with deep networks.

Image Matting

Product Sparse Coding

no code implementations CVPR 2014 Tiezheng Ge, Kaiming He, Jian Sun

In this paper, we study a special case of sparse coding in which the codebook is a Cartesian product of two subcodebooks.

General Classification Image Classification +2

Optimized Product Quantization for Approximate Nearest Neighbor Search

no code implementations CVPR 2013 Tiezheng Ge, Kaiming He, Qifa Ke, Jian Sun

Product quantization is an effective vector quantization approach to compactly encode high-dimensional vectors for fast approximate nearest neighbor (ANN) search.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.