Search Results for author: Jie Guo

Found 48 papers, 7 papers with code

Fused Text Segmentation Networks for Multi-oriented Scene Text Detection

no code implementations11 Sep 2017 Yuchen Dai, Zheng Huang, Yuting Gao, Youxuan Xu, Kai Chen, Jie Guo, Weidong Qiu

In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instance-aware semantic segmentation perspective.

Multi-Oriented Scene Text Detection object-detection +6

FPAN: Fine-grained and Progressive Attention Localization Network for Data Retrieval

no code implementations5 Apr 2018 Sijia Chen, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

The Localization of the target object for data retrieval is a key issue in the Intelligent and Connected Transportation Systems (ICTS).

Multi-Task Learning Object +3

Stochastic Channel Decorrelation Network and Its Application to Visual Tracking

no code implementations3 Jul 2018 Jie Guo, Tingfa Xu, Shenwang Jiang, Ziyi Shen

Deep convolutional neural networks (CNNs) have dominated many computer vision domains because of their great power to extract good features automatically.

Visual Tracking

Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model

1 code implementation ECCV 2018 Jie Guo, Zuojian Zhou, Li-Min Wang

We propose a sparse and low-rank reflection model for specular highlight detection and removal using a single input image.

Highlight Detection highlight removal

Interest-Related Item Similarity Model Based on Multimodal Data for Top-N Recommendation

no code implementations13 Feb 2019 Junmei Lv, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

Specifically, the Multimodal IRIS model consists of three modules, i. e., multimodal feature learning module, the Interest-Related Network (IRN) module and item similarity recommendation module.

Recommendation Systems

Self-Selective Correlation Ship Tracking Method for Smart Ocean System

no code implementations26 Feb 2019 Xu Kang, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

In recent years, with the development of the marine industry, navigation environment becomes more complicated.

Management regression

Learning Actor Relation Graphs for Group Activity Recognition

2 code implementations CVPR 2019 Jianchao Wu, Li-Min Wang, Li Wang, Jie Guo, Gangshan Wu

To this end, we propose to build a flexible and efficient Actor Relation Graph (ARG) to simultaneously capture the appearance and position relation between actors.

Action Recognition Group Activity Recognition +1

Hybrid Models for Open Set Recognition

no code implementations ECCV 2020 Hongjie Zhang, Ang Li, Jie Guo, Yanwen Guo

We propose the OpenHybrid framework, which is composed of an encoder to encode the input data into a joint embedding space, a classifier to classify samples to inlier classes, and a flow-based density estimator to detect whether a sample belongs to the unknown category.

Open Set Learning Out-of-Distribution Detection

FA-GANs: Facial Attractiveness Enhancement with Generative Adversarial Networks on Frontal Faces

no code implementations17 May 2020 Jingwu He, Chuan Wang, Yang Zhang, Jie Guo, Yanwen Guo

To the best of our knowledge, we are the first to enhance the facial attractiveness with GANs in both geometry and appearance aspects.

Image Stitching Based on Planar Region Consensus

no code implementations6 Jul 2020 Aocheng Li, Jie Guo, Yanwen Guo

We specifically design a new module to make fully use of existing semantic segmentation networks to accommodate planar segmentation.

Image Stitching Segmentation +1

Partially Observable Online Change Detection via Smooth-Sparse Decomposition

no code implementations22 Sep 2020 Jie Guo, Hao Yan, Chen Zhang, Steven Hoi

We consider online change detection of high dimensional data streams with sparse changes, where only a subset of data streams can be observed at each sensing time point due to limited sensing capacities.

Bayesian Inference Change Detection +1

Hierarchical Disentangled Representation Learning for Outdoor Illumination Estimation and Editing

no code implementations ICCV 2021 Piaopiao Yu, Jie Guo, Fan Huang, Cheng Zhou, Hongwei Che, Xiao Ling, Yanwen Guo

However, naively compressing an outdoor panorama into a low-dimensional latent vector, as existing models have done, causes two major problems.

Representation Learning

Rendering Discrete Participating Media with Geometrical Optics Approximation

no code implementations24 Feb 2021 Jie Guo, Bingyang Hu, Yanjun Chen, Yuanqi Li, Yanwen Guo, Ling-Qi Yan

We consider the scattering of light in participating media composed of sparsely and randomly distributed discrete particles.

Graphics Optics

A Survey on Natural Language Video Localization

no code implementations1 Apr 2021 Xinfang Liu, Xiushan Nie, Zhifang Tan, Jie Guo, Yilong Yin

Natural language video localization (NLVL), which aims to locate a target moment from a video that semantically corresponds to a text query, is a novel and challenging task.

GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition

no code implementations CVPR 2021 Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo

We demonstrate that local geometry has a greater impact on the sound than the global geometry and offers more cues in material recognition.

Material Recognition

A Transductive Maximum Margin Classifier for Few-Shot Learning

no code implementations26 Jul 2021 Fei Pan, Chunlei Xu, Jie Guo, Yanwen Guo

We introduce a transductive maximum margin classifier for few-shot learning (FS-TMMC).

Few-Shot Learning

Temporal Alignment Prediction for Few-Shot Video Classification

no code implementations26 Jul 2021 Fei Pan, Chunlei Xu, Jie Guo, Yanwen Guo

In order to obtain the similarity of a pair of videos, we predict the alignment scores between all pairs of temporal positions in the two videos with the temporal alignment prediction function.

Classification Video Classification

Scale Invariant Domain Generalization Image Recapture Detection

no code implementations7 Oct 2021 Jinian Luo, Jie Guo, Weidong Qiu, Zheng Huang, Hong Hui

However, most of them ignored the domain generalization scenario and scale variances, with an inferior performance on domain shift situations, and normally were exacerbated by intra-domain and inter-domain scale variances.

Domain Generalization Face Identification

SVBRDF Recovery From a Single Image With Highlights using a Pretrained Generative Adversarial Network

no code implementations29 Oct 2021 Tao Wen, Beibei Wang, Lei Zhang, Jie Guo, Nicolas Holzschuch

For efficiency, we train the network in two stages: reusing a trained model to initialize the SVBRDFs and fine-tune it based on the input image.

Generative Adversarial Network

Task-Oriented Image Transmission for Scene Classification in Unmanned Aerial Systems

no code implementations21 Dec 2021 Xu Kang, Bin Song, Jie Guo, Zhijin Qin, F. Richard Yu

The vigorous developments of Internet of Things make it possible to extend its computing and storage capabilities to computing tasks in the aerial system with collaboration of cloud and edge, especially for artificial intelligence (AI) tasks based on deep learning (DL).

Classification Edge-computing +1

ISNet: Shape Matters for Infrared Small Target Detection

1 code implementation CVPR 2022 Mingjin Zhang, Rui Zhang, Yuxiang Yang, Haichen Bai, Jing Zhang, Jie Guo

TOAA block calculates the low-level information with attention mechanism in both row and column directions and fuses it with the high-level information to capture the shape characteristic of targets and suppress noises.

Management

GLPanoDepth: Global-to-Local Panoramic Depth Estimation

1 code implementation6 Feb 2022 Jiayang Bai, Shuichang Lai, Haoyu Qin, Jie Guo, Yanwen Guo

In this paper, we propose a learning-based method for predicting dense depth values of a scene from a monocular omnidirectional image.

Depth Estimation

Deep Graph Learning for Spatially-Varying Indoor Lighting Prediction

no code implementations13 Feb 2022 Jiayang Bai, Jie Guo, Chenchen Wan, Zhenyu Chen, Zhen He, Shan Yang, Piaopiao Yu, Yan Zhang, Yanwen Guo

At its core is a new lighting model (dubbed DSGLight) based on depth-augmented Spherical Gaussians (SG) and a Graph Convolutional Network (GCN) that infers the new lighting representation from a single LDR image of limited field-of-view.

Graph Learning Lighting Estimation

Deep Point Cloud Simplification for High-quality Surface Reconstruction

no code implementations17 Mar 2022 Yuanqi Li, Jianwei Guo, Xinran Yang, Shun Liu, Jie Guo, Xiaopeng Zhang, Yanwen Guo

In this paper, we propose a novel point cloud simplification network (PCS-Net) dedicated to high-quality surface mesh reconstruction while maintaining geometric fidelity.

Scene Understanding Surface Reconstruction +1

Completing Partial Point Clouds with Outliers by Collaborative Completion and Segmentation

no code implementations18 Mar 2022 Changfeng Ma, Yang Yang, Jie Guo, Chongjun Wang, Yanwen Guo

We propose in this paper an end-to-end network, named CS-Net, to complete the point clouds contaminated by noises or containing outliers.

Point Cloud Completion Segmentation

Improving Performance of Automatic Keyword Extraction (AKE) Methods Using PoS-Tagging and Enhanced Semantic-Awareness

no code implementations9 Nov 2022 Enes Altuncu, Jason R. C. Nurse, Yang Xu, Jie Guo, Shujun Li

Automatic keyword extraction (AKE) has gained more importance with the increasing amount of digital textual data that modern computing systems process.

Information Retrieval Keyword Extraction +3

HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval

no code implementations16 Dec 2022 Jie Guo, Meiting Wang, Yan Zhou, Bin Song, Yuhao Chi, Wei Fan, Jianglong Chang

Then, a multi-granularity shared space is established with a designed Multi-granularity Feature Aggregation and Rearrangement (MFAR) module, which enhances the semantic corresponding relations between the local and global information, and obtains more accurate feature representations for the image and text modalities.

Retrieval Sentence +1

Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion

no code implementations CVPR 2023 Changfeng Ma, Yinuo Chen, Pengxiao Guo, Jie Guo, Chongjun Wang, Yanwen Guo

Extensive experiments and comparisons demonstrate our superiority and generalization and show that our method achieves state-of-the-art performance on unsupervised completion of real scene objects.

Point Cloud Completion

UHDNeRF: Ultra-High-Definition Neural Radiance Fields

no code implementations ICCV 2023 Quewei Li, Feichao Li, Jie Guo, Yanwen Guo

We propose UHDNeRF, a new framework for novel view synthesis on the challenging ultra-high-resolution (e. g., 4K) real-world scenes.

4k Novel View Synthesis

Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields

no code implementations10 Mar 2023 Jiayang Bai, Letian Huang, Wen Gong, Jie Guo, Yanwen Guo

Recently, Neural Radiance Fields (NeRF) have emerged as a potent method for synthesizing novel views from a dense set of images.

Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

no code implementations18 Mar 2023 Jiayang Bai, Zhen He, Shan Yang, Jie Guo, Zhenyu Chen, Yan Zhang, Yanwen Guo

Recent methods mostly rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama.

HDR Reconstruction

MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation

no code implementations14 Apr 2023 Jie Guo, Qimeng Wang, Yan Gao, XiaoLong Jiang, Xu Tang, Yao Hu, Baochang Zhang

CLIP (Contrastive Language-Image Pretraining) is well-developed for open-vocabulary zero-shot image-level recognition, while its applications in pixel-level tasks are less investigated, where most efforts directly adopt CLIP features without deliberative adaptations.

GPR Open Vocabulary Semantic Segmentation +3

Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

no code implementations24 Apr 2023 Yan Zhou, Jie Guo, Hao Sun, Bin Song, Fei Richard Yu

The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance.

Contrastive Learning Multimodal Recommendation

ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution

1 code implementation ICCV 2023 Mingjin Zhang, Chi Zhang, Qiming Zhang, Jie Guo, Xinbo Gao, Jing Zhang

Single hyperspectral image super-resolution (single-HSI-SR) aims to restore a high-resolution hyperspectral image from a low-resolution observation.

Hyperspectral Image Super-Resolution Image Super-Resolution

Manifold Path Guiding for Importance Sampling Specular Chains

no code implementations24 Sep 2023 Zhimin Fan, Pengpei Hong, Jie Guo, Changqing Zou, Yanwen Guo, Ling-Qi Yan

We verify that importance sampling the seed chain in the continuous space reaches the goal of importance sampling the discrete admissible specular chain.

Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation

no code implementations3 Nov 2023 Xin Yuan, Jie Guo, Weidong Qiu, Zheng Huang, Shujun Li

Mis- and disinformation online have become a major societal problem as major sources of online harms of different kinds.

Exploring Multi-Modal Control in Music-Driven Dance Generation

no code implementations1 Jan 2024 Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li

Existing music-driven 3D dance generation methods mainly concentrate on high-quality dance generation, but lack sufficient control during the generation process.

Distributed Task-Oriented Communication Networks with Multimodal Semantic Relay and Edge Intelligence

no code implementations18 Jan 2024 Jie Guo, Hao Chen, Bin Song, Yuhao Chi, Chau Yuen, Fei Richard Yu, Geoffrey Ye Li, Dusit Niyato

In this article, we present a novel framework, named distributed task-oriented communication networks (DTCN), based on recent advances in multimodal semantic transmission and edge intelligence.

On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy

no code implementations1 Feb 2024 Letian Huang, Jiayang Bai, Jie Guo, Yuanqi Li, Yanwen Guo

This paper addresses the projection error function of 3D Gaussian Splatting, commencing with the residual error from the first-order Taylor expansion of the projection function.

Neural Rendering

360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming

no code implementations1 Feb 2024 Jiayang Bai, Letian Huang, Jie Guo, Wen Gong, Yuanqi Li, Yanwen Guo

This technique typically takes perspective images as input and optimizes a set of 3D elliptical Gaussians by splatting them onto the image planes, resulting in 2D Gaussians.

Novel View Synthesis

Semantic Human Mesh Reconstruction with Textures

no code implementations5 Mar 2024 Xiaoyu Zhan, Jianxin Yang, Yuanqi Li, Jie Guo, Yanwen Guo, Wenping Wang

SHERT applies semantic- and normal-based sampling between the detailed surface (e. g. mesh and SDF) and the corresponding SMPL-X model to obtain a partially sampled semantic mesh and then generates the complete semantic mesh by our specifically designed self-supervised completion and refinement networks.

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

1 code implementation15 Mar 2024 Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules.

Motion Synthesis

Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention

no code implementations ECCV 2020 Quewei Li, Jie Guo, Yang Fei, Qinyu Tang, Wenxiu Sun, Jin Zeng, Yanwen Guo

We propose a deep convolutional neural network (CNN) to estimate surface normal from a single color image accompanied with a low-quality depth channel.

Surface Normal Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.