Search Results for author: Tiezheng Ge

Found 37 papers, 10 papers with code

Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion

no code implementations • 23 Apr 2024 • Hongyu Chen, Yiqi Gao, Min Zhou, Peng Wang, Xubin Li, Tiezheng Ge, Bo Zheng

Meanwhile, a network, dubbed as Masked ControlNet, is designed to utilize these object masks for object generation in the misaligned visual control region.

Paper
Add Code

Accelerating Image Generation with Sub-path Linear Approximation Model

no code implementations • 22 Apr 2024 • Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks.

Paper
Add Code

RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance

no code implementations • 22 Apr 2024 • Chengrui Wang, PengFei Liu, Min Zhou, Ming Zeng, Xubin Li, Tiezheng Ge, Bo Zheng

The style guidance is a hand image, e. g., the malformed hand itself, and is employed to furnish the style reference for hand refining.

Paper
Add Code

Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation

no code implementations • 5 Mar 2024 • Weijie Li, Litong Gong, Yiran Zhu, Fanda Fan, Biao Wang, Tiezheng Ge, Bo Zheng

The experimental results demonstrate the effectiveness of our approach in improving the fidelity of generated videos.

Denoising Image Animation +1

Paper
Add Code

AtomoVideo: High Fidelity Image-to-Video Generation

no code implementations • 4 Mar 2024 • Litong Gong, Yiran Zhu, Weijie Li, Xiaoyang Kang, Biao Wang, Tiezheng Ge, Bo Zheng

Recently, video generation has achieved significant rapid development based on superior text-to-image generation techniques.

Image to Video Generation Text-to-Image Generation

Paper
Add Code

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

no code implementations • 22 Feb 2024 • Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

By conducting a detailed analysis of real multi-turn dialogue data, we construct a three-tier hierarchical ability taxonomy comprising 4208 turns across 1388 multi-turn dialogues in 13 distinct tasks.

Paper
Add Code

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

1 code implementation • 22 Feb 2024 • Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng

This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained benchmark that evaluates concept-wise mathematical reasoning of Large Language Models (LLMs).

Math Mathematical Reasoning

Paper
Code

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

no code implementations • 13 Jan 2024 • Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng

Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources.

4k Position

Paper
Add Code

Hierarchical Masked 3D Diffusion Model for Video Outpainting

no code implementations • 5 Sep 2023 • Fanda Fan, Chaoxu Guo, Litong Gong, Biao Wang, Tiezheng Ge, Yuning Jiang, Chunjie Luo, Jianfeng Zhan

Our pipeline benefits from bidirectional learning of the mask modeling and thus can employ a hybrid strategy of infilling and interpolation when generating sparse frames.

Image Outpainting

Paper
Add Code

Deep Task-specific Bottom Representation Network for Multi-Task Recommendation

no code implementations • 11 Aug 2023 • Qi Liu, Zhilong Zhou, Gangwei Jiang, Tiezheng Ge, Defu Lian

In this paper, we focus on the bottom representation learning of MTL in RS and propose the Deep Task-specific Bottom Representation Network (DTRN) to alleviate the negative transfer problem.

Multi-Task Learning Recommendation Systems +1

Paper
Add Code

TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design

no code implementations • 9 Aug 2023 • Yifan Gao, Jinpeng Lin, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang

Specifically, TextPainter takes the global-local background image as a hint of style and guides the text image generation with visual harmony.

Image Generation Language Modelling +2

Paper
Add Code

AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation

no code implementations • 2 Aug 2023 • Jinpeng Lin, Min Zhou, Ye Ma, Yifan Gao, Chenxi Fei, Yangjian Chen, Zhang Yu, Tiezheng Ge

Meanwhile, to our knowledge, we propose the first poster generation dataset that includes visual attribute annotations for over 76k posters.

Attribute

Paper
Add Code

Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences

no code implementations • 31 Jul 2023 • Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

To address these limitations, we explore the problem of Few-Shot Stylized Visual Captioning, which aims to generate captions in any desired style, using only a few examples as guidance during inference, without requiring further training.

Image Captioning Language Modelling

Paper
Add Code

Edit As You Wish: Video Description Editing with Multi-grained Commands

no code implementations • 15 May 2023 • Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

In this paper, we propose a novel Video Description Editing (VDEdit) task to automatically revise an existing video description guided by flexible user requests.

Attribute Position +3

Paper
Add Code

Unsupervised Domain Adaption with Pixel-level Discriminator for Image-aware Layout Generation

no code implementations • CVPR 2023 • Chenchen Xu, Min Zhou, Tiezheng Ge, Yuning Jiang, Weiwei Xu

This paper focuses on using the GAN-based model conditioned on image contents to generate advertising poster graphic layouts, which requires an advertising poster layout dataset with paired product images and graphic layouts.

Domain Adaptation

Paper
Add Code

CF-Font: Content Fusion for Few-shot Font Generation

1 code implementation • CVPR 2023 • Chi Wang, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu

Content and style disentanglement is an effective way to achieve few-shot font generation.

Disentanglement Font Generation

101

Paper
Code

Video Object of Interest Segmentation

no code implementations • 6 Dec 2022 • Siyuan Zhou, Chunru Zhan, Biao Wang, Tiezheng Ge, Yuning Jiang, Li Niu

Given a video and a target image of interest, our objective is to simultaneously segment and track all objects in the video that are relevant to the target image.

Object Segmentation +3

Paper
Add Code

Motion and Appearance Adaptation for Cross-Domain Motion Transfer

no code implementations • 29 Sep 2022 • Borun Xu, Biao Wang, Jinhong Deng, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Motion transfer aims to transfer the motion of a driving video to a source image.

Object

Paper
Add Code

Motion Transformer for Unsupervised Image Animation

1 code implementation • 28 Sep 2022 • Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Image animation aims to animate a source image by using motion learned from a driving video.

Image Animation

Paper
Code

Geometry Aligned Variational Transformer for Image-conditioned Layout Generation

no code implementations • 2 Sep 2022 • Yunning Cao, Ye Ma, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang

First, self-attention mechanism is adopted to model the contextual relationship within layout elements, while cross-attention mechanism is used to fuse the visual information of conditional images.

Layout Design Object Localization

Paper
Add Code

Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information

no code implementations • 7 May 2022 • Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang

Therefore, we present a dataset, E-MMAD (e-commercial multimodal multi-structured advertisement copywriting), which requires, and supports much more detailed information in text generation.

Text Generation Video Captioning

Paper
Add Code

Dual-Level Decoupled Transformer for Video Captioning

no code implementations • 6 May 2022 • Yiqi Gao, Xinglin Hou, Wei Suo, Mengyang Sun, Tiezheng Ge, Yuning Jiang, Peng Wang

As for the latter, \textbf{\textit{"couple"}} means treating the generation of visual semantic and syntax-related words equally.

Descriptive Sentence +1

Paper
Add Code

Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs

no code implementations • 30 Apr 2022 • Min Zhou, Chenchen Xu, Ye Ma, Tiezheng Ge, Yuning Jiang, Weiwei Xu

Through both quantitative and qualitative evaluations, we demonstrate that the proposed model can synthesize high-quality graphic layouts according to image compositions.

Paper
Add Code

CapOnImage: Context-driven Dense-Captioning on Image

no code implementations • 27 Apr 2022 • Yiqi Gao, Xinglin Hou, Yuanmeng Zhang, Tiezheng Ge, Yuning Jiang, Peng Wang

Existing image captioning systems are dedicated to generating narrative captions for images, which are spatially detached from the image in presentation.

Dense Captioning Image Captioning

Paper
Add Code

Self-Supervised Text Erasing with Controllable Image Synthesis

no code implementations • 27 Apr 2022 • Gangwei Jiang, Shiyao Wang, Tiezheng Ge, Yuning Jiang, Ying WEI, Defu Lian

The synthetic training images with erasure ground-truth are then fed to train a coarse-to-fine erasing network.

Image Generation

Paper
Add Code

Estimation of Reliable Proposal Quality for Temporal Action Detection

1 code implementation • 25 Apr 2022 • Junshan Hu, Chaoxu Guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge, Yuning Jiang, Houqiang Li

For the region perspective, we introduce Region Evaluate Module (REM) which uses a new and efficient sampling method for proposal feature representation containing more contextual information compared with point feature to refine category score and proposal boundary.

Action Detection

Paper
Code

Structure-Aware Motion Transfer with Deformable Anchor Model

1 code implementation • CVPR 2022 • Jiale Tao, Biao Wang, Borun Xu, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Specifically, inspired by the known deformable part model (DPM), our DAM introduces two types of anchors or keypoints: i) a number of motion anchors that capture both appearance and motion information from the source image and driving video; ii) a latent root anchor, which is linked to the motion anchors to facilitate better learning of the representations of the object structure information.

Paper
Code

Learning Pixel-Level Distinctions for Video Highlight Detection

no code implementations • CVPR 2022 • Fanyue Wei, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

To this end, we propose to learn pixel-level distinctions to improve the video highlight detection.

Highlight Detection

Paper
Add Code

Move As You Like: Image Animation in E-Commerce Scenario

1 code implementation • 19 Dec 2021 • Borun Xu, Biao Wang, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan

Creative image animations are attractive in e-commerce applications, where motion transfer is one of the import ways to generate animations from static images.

Image Animation

Paper
Code

Boosting Image Outpainting with Semantic Layout Prediction

no code implementations • 18 Oct 2021 • Ye Ma, Jin Ma, Min Zhou, Quan Chen, Tiezheng Ge, Yuning Jiang, Tong Lin

Secondly, another GAN model is trained to synthesize real images based on the extended semantic layouts.

Image Outpainting Semantic Segmentation

Paper
Add Code

Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure

1 code implementation • 2 Mar 2021 • Jin Chen, Tiezheng Ge, Gangwei Jiang, Zhiqiang Zhang, Defu Lian, Kai Zheng

Based on the tree structure, Thompson sampling is adapted with dynamic programming, leading to efficient exploration for potential ad creatives with the largest CTR.

Efficient Exploration Thompson Sampling

Paper
Code

Automated Creative Optimization for E-Commerce Advertising

1 code implementation • 28 Feb 2021 • Jin Chen, Ju Xu, Gangwei Jiang, Tiezheng Ge, Zhiqiang Zhang, Defu Lian, Kai Zheng

However, interactions between creative elements may be more complex than the inner product, and the FM-estimated CTR may be of high variance due to limited feedback.

AutoML Click-Through Rate Prediction +2

Paper
Code

A Hybrid Bandit Model with Visual Priors for Creative Ranking in Display Advertising

1 code implementation • 8 Feb 2021 • Shiyao Wang, Qi Liu, Tiezheng Ge, Defu Lian, Zhiqiang Zhang

Creative plays a great important role in e-commerce for exhibiting products.

Recommendation Systems

Paper
Code

Semantic Human Matting

2 code implementations • 5 Sep 2018 • Quan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, Kun Gai

SHM is the first algorithm that learns to jointly fit both semantic information and high quality details with deep networks.

Ranked #5 on Image Matting on AIM-500

Image Matting

522

Paper
Code

Image Matters: Visually modeling user behaviors using Advanced Model Server

no code implementations • 17 Nov 2017 • Tiezheng Ge, Liqin Zhao, Guorui Zhou, Keyu Chen, Shuying Liu, Huimin Yi, Zelin Hu, Bochao Liu, Peng Sun, Haoyu Liu, Pengtao Yi, Sui Huang, Zhiqiang Zhang, Xiaoqiang Zhu, Yu Zhang, Kun Gai

So we propose to model user preference jointly with user behavior ID features and behavior images.

Click-Through Rate Prediction

Paper
Add Code

Product Sparse Coding

no code implementations • CVPR 2014 • Tiezheng Ge, Kaiming He, Jian Sun

In this paper, we study a special case of sparse coding in which the codebook is a Cartesian product of two subcodebooks.

General Classification Image Classification +2

Paper
Add Code

Optimized Product Quantization for Approximate Nearest Neighbor Search

no code implementations • CVPR 2013 • Tiezheng Ge, Kaiming He, Qifa Ke, Jian Sun

Product quantization is an effective vector quantization approach to compactly encode high-dimensional vectors for fast approximate nearest neighbor (ANN) search.

Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.