Search Results for author: XiaoLong Jiang

Found 17 papers, 12 papers with code

Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective

1 code implementation13 Aug 2024 Ouxiang Li, Jiayin Cai, Yanbin Hao, XiaoLong Jiang, Yao Hu, Fuli Feng

In this paper, we re-examine the SID problem and identify two prevalent biases in current training paradigms, i. e., weakened artifact features and overfitted artifact features.

Image Generation Synthetic Image Detection

A Sanity Check for AI-generated Image Detection

1 code implementation27 Jun 2024 Shilin Yan, Ouxiang Li, Jiayin Cai, Yanbin Hao, XiaoLong Jiang, Yao Hu, Weidi Xie

This effectively enables the model to discern AI-generated images based on semantics or contextual information; Secondly, we select the highest frequency patches and the lowest frequency patches in the image, and compute the low-level patchwise features, aiming to detect AI-generated images by low-level artifacts, for example, noise pattern, anti-aliasing, etc.

Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning

no code implementations17 Jun 2024 Cilin Yan, Haochen Wang, XiaoLong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

Specifically, we adopt a transformer module which takes the visual feature as "Query", the text features of the anchors as "Key" and the similarity matrix between the text features of anchor and target classes as "Value".

Few-Shot Learning Relation

Controllable Mind Visual Diffusion Model

1 code implementation17 May 2023 Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, XiaoLong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang

Brain signal visualization has emerged as an active research area, serving as a critical interface between the human visual system and computer vision models.

Attribute Image Generation

PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation

1 code implementation23 Apr 2023 Cilin Yan, Haochen Wang, Jie Liu, XiaoLong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing.

Interactive Segmentation Segmentation

MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation

no code implementations14 Apr 2023 Jie Guo, Qimeng Wang, Yan Gao, XiaoLong Jiang, Xu Tang, Yao Hu, Baochang Zhang

CLIP (Contrastive Language-Image Pretraining) is well-developed for open-vocabulary zero-shot image-level recognition, while its applications in pixel-level tasks are less investigated, where most efforts directly adopt CLIP features without deliberative adaptations.

GPR Open Vocabulary Semantic Segmentation +4

Towards Open-Vocabulary Video Instance Segmentation

1 code implementation ICCV 2023 Haochen Wang, Cilin Yan, Shuai Wang, XiaoLong Jiang, Xu Tang, Yao Hu, Weidi Xie, Efstratios Gavves

Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categories in real-world videos.

Instance Segmentation Segmentation +3

Continuous Remote Sensing Image Super-Resolution based on Context Interaction in Implicit Function Space

1 code implementation16 Feb 2023 Keyan Chen, Wenyuan Li, Sen Lei, Jianqi Chen, XiaoLong Jiang, Zhengxia Zou, Zhenwei Shi

Despite its fruitful applications in remote sensing, image super-resolution is troublesome to train and deploy as it handles different resolution magnifications with separate models.

Image Super-Resolution

OvarNet: Towards Open-vocabulary Object Attribute Recognition

1 code implementation CVPR 2023 Keyan Chen, XiaoLong Jiang, Yao Hu, Xu Tang, Yan Gao, Jianqi Chen, Weidi Xie

In this paper, we consider the problem of simultaneously detecting objects and inferring their visual attributes in an image, even for those with no manual annotations provided at the training stage, resembling an open-vocabulary scenario.

 Ranked #1 on Open Vocabulary Attribute Detection on OVAD benchmark (using extra training data)

Attribute Knowledge Distillation +5

SwiftNet: Real-time Video Object Segmentation

1 code implementation CVPR 2021 Haochen Wang, XiaoLong Jiang, Haibing Ren, Yao Hu, Song Bai

In this work we present SwiftNet for real-time semisupervised video object segmentation (one-shot VOS), which reports 77. 8% J &F and 70 FPS on DAVIS 2017 validation dataset, leading all present solutions in overall accuracy and speed performance.

Object Segmentation +3

Horizontal-to-Vertical Video Conversion

1 code implementation11 Jan 2021 Tun Zhu, Daoxin Zhang, Yao Hu, Tianran Wang, XiaoLong Jiang, Jianke Zhu, Jiawei Li

Alongside the prevalence of mobile videos, the general public leans towards consuming vertical videos on hand-held devices.

Boundary Detection Multi-Object Tracking

Graph Neural Based End-to-end Data Association Framework for Online Multiple-Object Tracking

1 code implementation11 Jul 2019 Xiaolong Jiang, Peizhao Li, Yanjing Li, Xian-Tong Zhen

In this work, we present an end-to-end framework to settle data association in online Multiple-Object Tracking (MOT).

Graph Neural Network Multiple Object Tracking

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks

no code implementations CVPR 2019 Xiaolong Jiang, Zehao Xiao, Baochang Zhang, Xiantong Zhen, Xianbin Cao, David Doermann, Ling Shao

In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps.

Crowd Counting Decoder +1

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Network

no code implementations3 Mar 2019 Xiaolong Jiang, Zehao Xiao, Baochang Zhang, Xian-Tong Zhen, Xian-Bin Cao, David Doermann, Ling Shao

In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps.

Crowd Counting Decoder +1

Model-free Tracking with Deep Appearance and Motion Features Integration

no code implementations16 Dec 2018 Xiaolong Jiang, Peizhao Li, Xian-Tong Zhen, Xian-Bin Cao

To overcome the object-centric information scarcity, both appearance and motion features are deeply integrated by the proposed AMNet, which is an end-to-end offline trained two-stream network.

Motion Detection Object +1

Cannot find the paper you are looking for? You can Submit a new open access paper.