Search Results for author: Sixiao Zheng

Found 10 papers, 5 papers with code

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning

no code implementations CVPR 2025 Zhenyang Liu, Yikai Wang, Sixiao Zheng, Tongying Pan, Longfei Liang, Yanwei Fu, xiangyang xue

By incorporating 2D segmentation masks from the SAM and multi-view CLIP embeddings, ReasonGrounder selects Gaussian groups based on object scale, enabling accurate localization through both explicit and implicit language understanding, even in novel, occluded views.

3D visual grounding Feature Splatting +1

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

no code implementations11 Feb 2025 Sixiao Zheng, Zimian Peng, Yanpeng Zhou, Yi Zhu, Hang Xu, Xiangru Huang, Yanwei Fu

Recent image-to-video generation methods have demonstrated success in enabling control over one or two visual elements, such as camera motion or object motion.

Image to Video Generation Object +1

ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context

1 code implementation13 Jul 2024 Sixiao Zheng, Yanwei Fu

Visual storytelling involves generating a sequence of coherent frames from a textual storyline while maintaining consistency in characters and scenes.

Story Continuation Story Visualization +2

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT

no code implementations24 Feb 2024 Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fu

We propose an Intelligent Director framework, utilizing LENS to generate descriptions for images and video frames and combining ChatGPT to generate coherent captions while recommending appropriate music names.

Retrieval Style Transfer

Vision Transformers: From Semantic Segmentation to Dense Prediction

3 code implementations19 Jul 2022 Li Zhang, Jiachen Lu, Sixiao Zheng, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng, Philip H. S. Torr

Extensive experiments show that our methods achieve appealing performance on a variety of dense prediction tasks (e. g., object detection and instance segmentation and semantic segmentation) as well as image classification.

image-classification Image Classification +7

Clustering by the Probability Distributions from Extreme Value Theory

1 code implementation20 Feb 2022 Sixiao Zheng, Ke Fan, Yanxi Hou, Jianfeng Feng, Yanwei Fu

In contrast, the GPD fits the distribution of distance to the centroid exceeding a sufficiently large threshold, leading to a more stable performance of GPD k-means.

Clustering

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection

1 code implementation4 Jun 2021 Zekun Luo, Zheng Fang, Sixiao Zheng, Yabiao Wang, Yanwei Fu

Non-Maximum Suppression (NMS) is essential for object detection and affects the evaluation results by incorporating False Positives (FP) and False Negatives (FN), especially in crowd occlusion scenes.

object-detection Object Detection +1

Incrementally Zero-Shot Detection by an Extreme Value Analyzer

no code implementations23 Mar 2021 Sixiao Zheng, Yanwei Fu, Yanxi Hou

However, zero-shot learning models assume that all seen classes should be known beforehand, while incremental learning models cannot recognize unseen classes.

class-incremental learning Class Incremental Learning +4

Extreme Value k-means Clustering

no code implementations25 Sep 2019 Sixiao Zheng, Yanxi Hou, Yanwei Fu, Jianfeng Feng

We thus propose a novel algorithm called Extreme Value k-means (EV k-means), including GEV k-means and GPD k-means.

Clustering Computational Efficiency

Cannot find the paper you are looking for? You can Submit a new open access paper.