Search Results for author: Jie Guo

Found 48 papers, 7 papers with code

Fused Text Segmentation Networks for Multi-oriented Scene Text Detection

no code implementations • 11 Sep 2017 • Yuchen Dai, Zheng Huang, Yuting Gao, Youxuan Xu, Kai Chen, Jie Guo, Weidong Qiu

In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instance-aware semantic segmentation perspective.

Ranked #12 on Scene Text Detection on MSRA-TD500

Multi-Oriented Scene Text Detection object-detection +6

Paper
Add Code

FPAN: Fine-grained and Progressive Attention Localization Network for Data Retrieval

no code implementations • 5 Apr 2018 • Sijia Chen, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

The Localization of the target object for data retrieval is a key issue in the Intelligent and Connected Transportation Systems (ICTS).

Multi-Task Learning Object +3

Paper
Add Code

Stochastic Channel Decorrelation Network and Its Application to Visual Tracking

no code implementations • 3 Jul 2018 • Jie Guo, Tingfa Xu, Shenwang Jiang, Ziyi Shen

Deep convolutional neural networks (CNNs) have dominated many computer vision domains because of their great power to extract good features automatically.

Visual Tracking

Paper
Add Code

Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model

1 code implementation • ECCV 2018 • Jie Guo, Zuojian Zhou, Li-Min Wang

We propose a sparse and low-rank reflection model for specular highlight detection and removal using a single input image.

Highlight Detection highlight removal

Paper
Code

Super-Resolution of Brain MRI Images using Overcomplete Dictionaries and Nonlocal Similarity

no code implementations • 13 Feb 2019 • Yinghua Li, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

The sparsity and self-similarity of the image blocks are taken as the constraints.

Compressive Sensing Image Super-Resolution

Paper
Add Code

Interest-Related Item Similarity Model Based on Multimodal Data for Top-N Recommendation

no code implementations • 13 Feb 2019 • Junmei Lv, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

Specifically, the Multimodal IRIS model consists of three modules, i. e., multimodal feature learning module, the Interest-Related Network (IRN) module and item similarity recommendation module.

Recommendation Systems

Paper
Add Code

Self-Selective Correlation Ship Tracking Method for Smart Ocean System

no code implementations • 26 Feb 2019 • Xu Kang, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

In recent years, with the development of the marine industry, navigation environment becomes more complicated.

Management regression

Paper
Add Code

Learning Actor Relation Graphs for Group Activity Recognition

2 code implementations • CVPR 2019 • Jianchao Wu, Li-Min Wang, Li Wang, Jie Guo, Gangshan Wu

To this end, we propose to build a flexible and efficient Actor Relation Graph (ARG) to simultaneously capture the appearance and position relation between actors.

Ranked #3 on Group Activity Recognition on Collective Activity

Action Recognition Group Activity Recognition +1

169

Paper
Code

Hybrid Models for Open Set Recognition

no code implementations • ECCV 2020 • Hongjie Zhang, Ang Li, Jie Guo, Yanwen Guo

We propose the OpenHybrid framework, which is composed of an encoder to encode the input data into a joint embedding space, a classifier to classify samples to inlier classes, and a flow-based density estimator to detect whether a sample belongs to the unknown category.

Ranked #6 on Out-of-Distribution Detection on CIFAR-10 vs CIFAR-100

Open Set Learning Out-of-Distribution Detection

Paper
Add Code

FA-GANs: Facial Attractiveness Enhancement with Generative Adversarial Networks on Frontal Faces

no code implementations • 17 May 2020 • Jingwu He, Chuan Wang, Yang Zhang, Jie Guo, Yanwen Guo

To the best of our knowledge, we are the first to enhance the facial attractiveness with GANs in both geometry and appearance aspects.

Paper
Add Code

Image Stitching Based on Planar Region Consensus

no code implementations • 6 Jul 2020 • Aocheng Li, Jie Guo, Yanwen Guo

We specifically design a new module to make fully use of existing semantic segmentation networks to accommodate planar segmentation.

Image Stitching Segmentation +1

Paper
Add Code

Partially Observable Online Change Detection via Smooth-Sparse Decomposition

no code implementations • 22 Sep 2020 • Jie Guo, Hao Yan, Chen Zhang, Steven Hoi

We consider online change detection of high dimensional data streams with sparse changes, where only a subset of data streams can be observed at each sensing time point due to limited sensing capacities.

Bayesian Inference Change Detection +1

Paper
Add Code

Hierarchical Disentangled Representation Learning for Outdoor Illumination Estimation and Editing

no code implementations • ICCV 2021 • Piaopiao Yu, Jie Guo, Fan Huang, Cheng Zhou, Hongwei Che, Xiao Ling, Yanwen Guo

However, naively compressing an outdoor panorama into a low-dimensional latent vector, as existing models have done, causes two major problems.

Representation Learning

Paper
Add Code

Rendering Discrete Participating Media with Geometrical Optics Approximation

no code implementations • 24 Feb 2021 • Jie Guo, Bingyang Hu, Yanjun Chen, Yuanqi Li, Yanwen Guo, Ling-Qi Yan

We consider the scattering of light in participating media composed of sparsely and randomly distributed discrete particles.

Graphics Optics

Paper
Add Code

A Survey on Natural Language Video Localization

no code implementations • 1 Apr 2021 • Xinfang Liu, Xiushan Nie, Zhifang Tan, Jie Guo, Yilong Yin

Natural language video localization (NLVL), which aims to locate a target moment from a video that semantically corresponds to a text query, is a novel and challenging task.

Paper
Add Code

Probabilistic Ranking-Aware Ensembles for Enhanced Object Detections

no code implementations • 7 May 2021 • Mingyuan Mao, Baochang Zhang, David Doermann, Jie Guo, Shumin Han, Yuan Feng, Xiaodi Wang, Errui Ding

This leads to a new problem of confidence discrepancy for the detector ensembles.

Ensemble Learning Object +2

Paper
Add Code

GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition

no code implementations • CVPR 2021 • Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo

We demonstrate that local geometry has a greater impact on the sound than the global geometry and offers more cues in material recognition.

Material Recognition

Paper
Add Code

A Transductive Maximum Margin Classifier for Few-Shot Learning

no code implementations • 26 Jul 2021 • Fei Pan, Chunlei Xu, Jie Guo, Yanwen Guo

We introduce a transductive maximum margin classifier for few-shot learning (FS-TMMC).

Few-Shot Learning

Paper
Add Code

Temporal Alignment Prediction for Few-Shot Video Classification

no code implementations • 26 Jul 2021 • Fei Pan, Chunlei Xu, Jie Guo, Yanwen Guo

In order to obtain the similarity of a pair of videos, we predict the alignment scores between all pairs of temporal positions in the two videos with the temporal alignment prediction function.

Classification Video Classification

Paper
Add Code

Scale Invariant Domain Generalization Image Recapture Detection

no code implementations • 7 Oct 2021 • Jinian Luo, Jie Guo, Weidong Qiu, Zheng Huang, Hong Hui

However, most of them ignored the domain generalization scenario and scale variances, with an inferior performance on domain shift situations, and normally were exacerbated by intra-domain and inter-domain scale variances.

Domain Generalization Face Identification

Paper
Add Code

SVBRDF Recovery From a Single Image With Highlights using a Pretrained Generative Adversarial Network

no code implementations • 29 Oct 2021 • Tao Wen, Beibei Wang, Lei Zhang, Jie Guo, Nicolas Holzschuch

For efficiency, we train the network in two stages: reusing a trained model to initialize the SVBRDFs and fine-tune it based on the input image.

Generative Adversarial Network

Paper
Add Code

Task-Oriented Image Transmission for Scene Classification in Unmanned Aerial Systems

no code implementations • 21 Dec 2021 • Xu Kang, Bin Song, Jie Guo, Zhijin Qin, F. Richard Yu

The vigorous developments of Internet of Things make it possible to extend its computing and storage capabilities to computing tasks in the aerial system with collaboration of cloud and edge, especially for artificial intelligence (AI) tasks based on deep learning (DL).

Classification Edge-computing +1

Paper
Add Code

ISNet: Shape Matters for Infrared Small Target Detection

1 code implementation • CVPR 2022 • Mingjin Zhang, Rui Zhang, Yuxiang Yang, Haichen Bai, Jing Zhang, Jie Guo

TOAA block calculates the low-level information with attention mechanism in both row and column directions and fuses it with the high-level information to capture the shape characteristic of targets and suppress noises.

Management

107

Paper
Code

GLPanoDepth: Global-to-Local Panoramic Depth Estimation

1 code implementation • 6 Feb 2022 • Jiayang Bai, Shuichang Lai, Haoyu Qin, Jie Guo, Yanwen Guo

In this paper, we propose a learning-based method for predicting dense depth values of a scene from a monocular omnidirectional image.

Ranked #7 on Depth Estimation on Stanford2D3D Panoramic

Depth Estimation

Paper
Code

Deep Graph Learning for Spatially-Varying Indoor Lighting Prediction

no code implementations • 13 Feb 2022 • Jiayang Bai, Jie Guo, Chenchen Wan, Zhenyu Chen, Zhen He, Shan Yang, Piaopiao Yu, Yan Zhang, Yanwen Guo

At its core is a new lighting model (dubbed DSGLight) based on depth-augmented Spherical Gaussians (SG) and a Graph Convolutional Network (GCN) that infers the new lighting representation from a single LDR image of limited field-of-view.

Graph Learning Lighting Estimation

Paper
Add Code

Deep Point Cloud Simplification for High-quality Surface Reconstruction

no code implementations • 17 Mar 2022 • Yuanqi Li, Jianwei Guo, Xinran Yang, Shun Liu, Jie Guo, Xiaopeng Zhang, Yanwen Guo

In this paper, we propose a novel point cloud simplification network (PCS-Net) dedicated to high-quality surface mesh reconstruction while maintaining geometric fidelity.

Scene Understanding Surface Reconstruction +1

Paper
Add Code

Completing Partial Point Clouds with Outliers by Collaborative Completion and Segmentation

no code implementations • 18 Mar 2022 • Changfeng Ma, Yang Yang, Jie Guo, Chongjun Wang, Yanwen Guo

We propose in this paper an end-to-end network, named CS-Net, to complete the point clouds contaminated by noises or containing outliers.

Point Cloud Completion Segmentation

Paper
Add Code

TENET: Transformer Encoding Network for Effective Temporal Flow on Motion Prediction

no code implementations • 30 Jun 2022 • Yuting Wang, Hangning Zhou, Zhigang Zhang, Chen Feng, Huadong Lin, Chaofei Gao, Yizhi Tang, Zhenting Zhao, Shiyu Zhang, Jie Guo, Xuefeng Wang, Ziyao Xu, Chi Zhang

This technical report presents an effective method for motion prediction in autonomous driving.

Ranked #12 on Motion Forecasting on Argoverse CVPR 2020

Motion Forecasting motion prediction +1

Paper
Add Code

Improving Performance of Automatic Keyword Extraction (AKE) Methods Using PoS-Tagging and Enhanced Semantic-Awareness

no code implementations • 9 Nov 2022 • Enes Altuncu, Jason R. C. Nurse, Yang Xu, Jie Guo, Shujun Li

Automatic keyword extraction (AKE) has gained more importance with the increasing amount of digital textual data that modern computing systems process.

Information Retrieval Keyword Extraction +3

Paper
Add Code

HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval

no code implementations • 16 Dec 2022 • Jie Guo, Meiting Wang, Yan Zhou, Bin Song, Yuhao Chi, Wei Fan, Jianglong Chang

Then, a multi-granularity shared space is established with a designed Multi-granularity Feature Aggregation and Rearrangement (MFAR) module, which enhances the semantic corresponding relations between the local and global information, and obtains more accurate feature representations for the image and text modalities.

Retrieval Sentence +1

Paper
Add Code

Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion

no code implementations • CVPR 2023 • Changfeng Ma, Yinuo Chen, Pengxiao Guo, Jie Guo, Chongjun Wang, Yanwen Guo

Extensive experiments and comparisons demonstrate our superiority and generalization and show that our method achieves state-of-the-art performance on unsupervised completion of real scene objects.

Point Cloud Completion

Paper
Add Code

UHDNeRF: Ultra-High-Definition Neural Radiance Fields

no code implementations • ICCV 2023 • Quewei Li, Feichao Li, Jie Guo, Yanwen Guo

We propose UHDNeRF, a new framework for novel view synthesis on the challenging ultra-high-resolution (e. g., 4K) real-world scenes.

4k Novel View Synthesis

Paper
Add Code

Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields

no code implementations • 10 Mar 2023 • Jiayang Bai, Letian Huang, Wen Gong, Jie Guo, Yanwen Guo

Recently, Neural Radiance Fields (NeRF) have emerged as a potent method for synthesizing novel views from a dense set of images.

Paper
Add Code

Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

no code implementations • 18 Mar 2023 • Jiayang Bai, Zhen He, Shan Yang, Jie Guo, Zhenyu Chen, Yan Zhang, Yanwen Guo

Recent methods mostly rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama.

HDR Reconstruction

Paper
Add Code

MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation

no code implementations • 14 Apr 2023 • Jie Guo, Qimeng Wang, Yan Gao, XiaoLong Jiang, Xu Tang, Yao Hu, Baochang Zhang

CLIP (Contrastive Language-Image Pretraining) is well-developed for open-vocabulary zero-shot image-level recognition, while its applications in pixel-level tasks are less investigated, where most efforts directly adopt CLIP features without deliberative adaptations.

GPR Open Vocabulary Semantic Segmentation +3

Paper
Add Code

Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

no code implementations • 24 Apr 2023 • Yan Zhou, Jie Guo, Hao Sun, Bin Song, Fei Richard Yu

The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance.

Contrastive Learning Multimodal Recommendation

Paper
Add Code

ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution

1 code implementation • ICCV 2023 • Mingjin Zhang, Chi Zhang, Qiming Zhang, Jie Guo, Xinbo Gao, Jing Zhang

Single hyperspectral image super-resolution (single-HSI-SR) aims to restore a high-resolution hyperspectral image from a low-resolution observation.

Hyperspectral Image Super-Resolution Image Super-Resolution

Paper
Code

Manifold Path Guiding for Importance Sampling Specular Chains

no code implementations • 24 Sep 2023 • Zhimin Fan, Pengpei Hong, Jie Guo, Changqing Zou, Yanwen Guo, Ling-Qi Yan

We verify that importance sampling the seed chain in the continuous space reaches the goal of importance sampling the discrete admissible specular chain.

Paper
Add Code

Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation

no code implementations • 3 Nov 2023 • Xin Yuan, Jie Guo, Weidong Qiu, Zheng Huang, Shujun Li

Mis- and disinformation online have become a major societal problem as major sources of online harms of different kinds.

Paper
Add Code

Exploring Multi-Modal Control in Music-Driven Dance Generation

no code implementations • 1 Jan 2024 • Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li

Existing music-driven 3D dance generation methods mainly concentrate on high-quality dance generation, but lack sufficient control during the generation process.

Paper
Add Code

Distributed Task-Oriented Communication Networks with Multimodal Semantic Relay and Edge Intelligence

no code implementations • 18 Jan 2024 • Jie Guo, Hao Chen, Bin Song, Yuhao Chi, Chau Yuen, Fei Richard Yu, Geoffrey Ye Li, Dusit Niyato

In this article, we present a novel framework, named distributed task-oriented communication networks (DTCN), based on recent advances in multimodal semantic transmission and edge intelligence.

Paper
Add Code

On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy

no code implementations • 1 Feb 2024 • Letian Huang, Jiayang Bai, Jie Guo, Yuanqi Li, Yanwen Guo

This paper addresses the projection error function of 3D Gaussian Splatting, commencing with the residual error from the first-order Taylor expansion of the projection function.

Neural Rendering

Paper
Add Code

360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming

no code implementations • 1 Feb 2024 • Jiayang Bai, Letian Huang, Jie Guo, Wen Gong, Yuanqi Li, Yanwen Guo

This technique typically takes perspective images as input and optimizes a set of 3D elliptical Gaussians by splatting them onto the image planes, resulting in 2D Gaussians.

Novel View Synthesis

Paper
Add Code

Semantic Human Mesh Reconstruction with Textures

no code implementations • 5 Mar 2024 • Xiaoyu Zhan, Jianxin Yang, Yuanqi Li, Jie Guo, Yanwen Guo, Wenping Wang

SHERT applies semantic- and normal-based sampling between the detailed surface (e. g. mesh and SDF) and the corresponding SMPL-X model to obtain a partially sampled semantic mesh and then generates the complete semantic mesh by our specifically designed self-supervised completion and refinement networks.

Paper
Add Code

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

1 code implementation • 15 Mar 2024 • Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules.

Ranked #1 on Motion Synthesis on FineDance

Motion Synthesis

Paper
Code

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.

valid Video Quality Assessment +1

Paper
Code

Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention

no code implementations • ECCV 2020 • Quewei Li, Jie Guo, Yang Fei, Qinyu Tang, Wenxiu Sun, Jin Zeng, Yanwen Guo

We propose a deep convolutional neural network (CNN) to estimate surface normal from a single color image accompanied with a low-quality depth channel.

Surface Normal Estimation

Paper
Add Code

Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding

no code implementations • CCL 2020 • Yuan Hua, Zheng Huang, Jie Guo, Weidong Qiu

Information extraction from documents such as receipts or invoices is a fundamental and crucial step for office automation.

document understanding graph construction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.