Search Results for author: Feng Gao

Found 115 papers, 42 papers with code

Global and Local Attention-Based Transformer for Hyperspectral Image Change Detection

1 code implementation21 Nov 2024 Ziyi Wang, Feng Gao, Junyu Dong, Qian Du

The global attention component employs global attention on downsampled feature maps to capture low-frequency information, while the local attention component focuses on high-frequency details using non-overlapping window-based local attention.

Change Detection

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

1 code implementation20 Nov 2024 Feng Gao, Chao Yu, Yu Wang, Yi Wu

In this paper, we propose a novel framework, Neural Internal Model Control, which integrates model-based control with RL-based control to enhance robustness.

ARM: Appearance Reconstruction Model for Relightable 3D Generation

no code implementations16 Nov 2024 Xiang Feng, Chang Yu, Zoubin Bi, Yintong Shang, Feng Gao, Hongzhi Wu, Kun Zhou, Chenfanfu Jiang, Yin Yang

Recent image-to-3D reconstruction models have greatly advanced geometry generation, but they still struggle to faithfully generate realistic appearance.

3D Generation 3D Reconstruction +1

A novel and efficient parameter estimation of the Lognormal-Rician turbulence model based on k-Nearest Neighbor and data generation method

no code implementations3 Sep 2024 Maoke Miao, Xinyu Zhang, Bo Liu, Rui Yin, Jiantao Yuan, Feng Gao, Xiao-Yu Chen

The Kolmogorov-Smirnov (KS) goodness-of-fit statistical tools are employed to investigate the validity of $k$NN approximation under different channel conditions and it is shown that the choice of $k$ plays a significant role in the approximation accuracy.

MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification

1 code implementation26 Aug 2024 Feng Gao, Xuepeng Jin, Xiaowei Zhou, Junyu Dong, Qian Du

Moreover, to alleviate the heterogeneous gap between HSI and LiDAR/SAR data, we design Fus-Mamba block for multi-source feature fusion.

Image Classification Mamba +1

Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification

no code implementations22 Aug 2024 Han Luo, Feng Gao, Junyu Dong, Lin Qi

To solve the problem, we propose a hierarchical attention and parallel filter fusion network for multi-source data classification.

Classification

Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography

no code implementations12 Aug 2024 Yuwei Zheng, Zijian Gao, Yuting Shen, Jiadong Zhang, Daohuai Jiang, Fengyu Liu, Feng Gao, Fei Gao

We conducted experiments by FPGA implementation of the algorithm, using both phantoms and in vivo human finger data to verify the feasibility of the proposed method.

Image Reconstruction

Exploring Cross-Domain Few-Shot Classification via Frequency-Aware Prompting

1 code implementation24 Jun 2024 Tiange Zhang, Qing Cai, Feng Gao, Lin Qi, Junyu Dong

However, most existing methods pay more attention to learning domain-adaptive inductive bias (meta-knowledge) through feature-wise manipulation or task diversity improvement while neglecting the phenomenon that deep networks tend to rely more on high-frequency cues to make the classification decision, which thus degenerates the robustness of learned inductive bias since high-frequency information is vulnerable and easy to be disturbed by noisy information.

cross-domain few-shot learning Inductive Bias

Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification

no code implementations3 Jun 2024 Junyan Lin, Xuepeng Jin, Feng Gao, Junyu Dong, Hui Yu

Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage.

Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism

no code implementations3 Jun 2024 Zhaomin Fang, Wankun Chen, Feng Gao, Yanhai Gan, Junyu Dong, Yang Zhou

Arctic Sea Ice Concentration (SIC) is the ratio of ice-covered area to the total sea area of the Arctic Ocean, which is a key indicator for maritime activities.

Image Super-Resolution

Sparse Focus Network for Multi-Source Remote Sensing Data Classification

no code implementations3 Jun 2024 Xuepeng Jin, Junyan Lin, Feng Gao, Lin Qi, Yang Zhou

Multi-source remote sensing data classification has emerged as a prominent research topic with the advancement of various sensors.

Classification

Dual-Stream Attention Network for Hyperspectral Image Unmixing

no code implementations3 Jun 2024 Yufang Wang, Wenmin Wu, Lin Qi, Feng Gao

Therefore, we adopt a "many to one" strategy to estimate the abundance of the central pixel.

Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

2 code implementations29 May 2024 Yasi Zhang, Peiyu Yu, Yaxuan Zhu, Yingshan Chang, Feng Gao, Ying Nian Wu, Oscar Leong

We validate our approach for various linear inverse problems, such as super-resolution, deblurring, inpainting, and compressed sensing, and demonstrate that we can outperform other methods based on flow matching.

Deblurring Image Generation +1

SSLChange: A Self-supervised Change Detection Framework Based on Domain Adaptation

1 code implementation28 May 2024 Yitao Zhao, Turgay Celik, Nanqing Liu, Feng Gao, Heng-Chao Li

In conventional remote sensing change detection (RS CD) procedures, extensive manual labeling for bi-temporal images is first required to maintain the performance of subsequent fully supervised training.

Change Detection Contrastive Learning +2

Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

no code implementations28 May 2024 Yunuo Chen, Tianyi Xie, Zeshun Zong, Xuan Li, Feng Gao, Yin Yang, Ying Nian Wu, Chenfanfu Jiang

Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks.

3D Generation Friction +1

GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details

1 code implementation20 May 2024 Boqian Li, Xuan Li, Ying Jiang, Tianyi Xie, Feng Gao, Huamin Wang, Yin Yang, Chenfanfu Jiang

In this paper, we propose GarmentDreamer, a novel method that leverages 3D Gaussian Splatting (GS) as guidance to generate wearable, simulation-ready 3D garment meshes from text prompts.

3D Geometry Prediction Text to 3D +1

Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism

no code implementations1 Apr 2024 Xiangming Xi, Feng Gao, Jun Xu, Fangtai Guo, Tianlei Jin

Multi-task learning (MTL) is a paradigm that simultaneously learns multiple tasks by sharing information at different levels, enhancing the performance of each individual task.

Multi-Task Learning Spoken Language Understanding

Multi-Objective Trajectory Planning with Dual-Encoder

no code implementations26 Mar 2024 Beibei Zhang, Tian Xiang, Chentao Mao, Yuhua Zheng, Shuai Li, Haoyi Niu, Xiangming Xi, Wenyuan Bai, Feng Gao

In this paper, we propose a two-stage approach to accelerate time-jerk optimal trajectory planning.

Trajectory Planning

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

1 code implementation25 Mar 2024 Yingshan Chang, Yasi Zhang, Zhiyuan Fang, YingNian Wu, Yonatan Bisk, Feng Gao

We hypothesize that the underlying phenomenological coverage has not been proportionally scaled up, leading to a skew of the presented phenomenon which harms generalization.

Relational Reasoning Text-to-Image Generation

PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling

1 code implementation CVPR 2024 Xiaoyun Zheng, Liwei Liao, Xufeng Li, Jianbo Jiao, Rongjie Wang, Feng Gao, Shiqi Wang, Ronggang Wang

To facilitate the development of these fields, in this paper, we present PKU-DyMVHumans, a versatile human-centric dataset for high-fidelity reconstruction and rendering of dynamic human scenarios from dense multi-view videos.

Novel View Synthesis

Annotation-Efficient Polyp Segmentation via Active Learning

no code implementations21 Mar 2024 Duojun Huang, Xinyu Xiong, De-Jun Fan, Feng Gao, Xiao-Jian Wu, Guanbin Li

To minimize annotation costs, we propose a deep active learning framework for annotation-efficient polyp segmentation.

Active Learning Segmentation

Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising

1 code implementation15 Mar 2024 Shuai Hu, Feng Gao, Xiaowei Zhou, Junyu Dong, Qian Du

To enhance the modeling of both global and local features, we have devised a convolution and attention fusion module aimed at capturing long-range dependencies and neighborhood spectral correlations.

Hyperspectral Image Denoising Image Denoising

Vision-Language Navigation with Embodied Intelligence: A Survey

no code implementations22 Feb 2024 Peng Gao, Peng Wang, Feng Gao, Fei Wang, Ruyue Yuan

As a long-term vision in the field of artificial intelligence, the core goal of embodied intelligence is to improve the perception, understanding, and interaction capabilities of agents and the environment.

Survey Vision-Language Navigation

VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality

no code implementations30 Jan 2024 Ying Jiang, Chang Yu, Tianyi Xie, Xuan Li, Yutao Feng, Huamin Wang, Minchen Li, Henry Lau, Feng Gao, Yin Yang, Chenfanfu Jiang

As consumer Virtual Reality (VR) and Mixed Reality (MR) technologies gain momentum, there's a growing focus on the development of engagements with 3D virtual content.

Mixed Reality Semantic Segmentation

Moirai: Towards Optimal Placement for Distributed Inference on Heterogeneous Devices

1 code implementation7 Dec 2023 Beibei Zhang, Hongwei Zhu, Feng Gao, Zhihui Yang, Sean Xiaoyang Wang

This paper presents Moirai that better exploits runtime inter-operator fusion in a model to render a coarsened computation graph, reducing the search space while maintaining the inter-operator optimization provided by inference backends.

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty

1 code implementation2 Dec 2023 Cheng-Fu Yang, Haoyang Xu, Te-Lin Wu, Xiaofeng Gao, Kai-Wei Chang, Feng Gao

In this paper, we aim to tackle this problem with a unified framework consisting of an end-to-end trainable method and a planning algorithm.

Denoising Vision-Language Navigation

Particle density and critical point for studying site percolation by finite size scaling

no code implementations20 Nov 2023 Dian Xu, Shanshan Wang, Feng Gao, Wei Li, Jianmin Shen

It is generally believed that the latent variables of unsupervised learning can capture the information related to phase transitions, which is usually achieved through the so-called order parameter.

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control

1 code implementation22 Sep 2023 Botian Xu, Feng Gao, Chao Yu, Ruize Zhang, Yi Wu, Yu Wang

In this work, we introduce OmniDrones, an efficient and flexible platform tailored for reinforcement learning in drone control, built on Nvidia's Omniverse Isaac Sim.

reinforcement-learning

Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection

1 code implementation21 Sep 2023 Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

In this letter, we explore Transformer-like architecture for SAR change detection to incorporate global attention.

Change Detection Inductive Bias

Efficient option pricing with unary-based photonic computing chip and generative adversarial learning

no code implementations8 Aug 2023 HUI ZHANG, Lingxiao Wan, Sergi Ramos-Calderer, Yuancheng Zhan, Wai-Keong Mok, Hong Cai, Feng Gao, Xianshu Luo, Guo-Qiang Lo, Leong Chuan Kwek, José Ignacio Latorre, Ai Qun Liu

In the modern financial industry system, the structure of products has become more and more complex, and the bottleneck constraint of classical computing power has already restricted the development of the financial industry.

Generative Adversarial Network

Enhancing Cell Proliferation and Migration by MIR-Carbonyl Vibrational Coupling: Insights from Transcriptome Profiling

no code implementations3 Aug 2023 Xingkun Niu, Feng Gao, Shaojie Hou, Shihao Liu, Xinmin Zhao, Jun Guo, Liping Wang, Feng Zhang

Cell proliferation and migration highly relate to normal tissue self-healing, therefore it is highly significant for artificial controlling.

MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

1 code implementation28 Jul 2023 Wei Jiang, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

Additionally, to capture global contexts, we propose the linear complexity attention-based global correlations capturing by leveraging the decomposition of the softmax operation.

Image Compression

Gradient-based adaptive wavelet de-noising method for photoacoustic imaging in vivo

no code implementations25 Jul 2023 Xinke Li, Peng Ge, Yuting Shen, Feng Gao, Fei Gao

The proposed de-noising method provides potential to improve the SNR of PA signal under single-shot low-power laser illumination for biomedical applications in vivo.

Denoising

Human Motion Generation: A Survey

no code implementations20 Jul 2023 Wentao Zhu, Xiaoxuan Ma, Dongwoo Ro, Hai Ci, Jinlu Zhang, Jiaxin Shi, Feng Gao, Qi Tian, Yizhou Wang

In this survey, we present a comprehensive literature review of human motion generation, which, to the best of our knowledge, is the first of its kind in this field.

Motion Generation Survey

Masked Path Modeling for Vision-and-Language Navigation

no code implementations23 May 2023 Zi-Yi Dou, Feng Gao, Nanyun Peng

In this paper, we introduce a masked path modeling (MPM) objective, which pretrains an agent using self-collected data for downstream navigation tasks.

Action Generation Navigate +1

Selecting Learnable Training Samples is All DETRs Need in Crowded Pedestrian Detection

no code implementations18 May 2023 Feng Gao, Jiaxu Leng, Gan Ji, Xinbo Gao

However, in crowded pedestrian detection, the performance of DETRs is still unsatisfactory due to the inappropriate sample selection method which results in more false positives.

object-detection Object Detection +1

SAWU-Net: Spatial Attention Weighted Unmixing Network for Hyperspectral Images

no code implementations22 Apr 2023 Lin Qi, Xuewen Qin, Feng Gao, Junyu Dong, Xinbo Gao

To this end, we put forward a spatial attention weighted unmixing network, dubbed as SAWU-Net, which learns a spatial attention network and a weighted unmixing network in an end-to-end manner for better spatial feature exploitation.

Hyperspectral Unmixing

Physical Knowledge Enhanced Deep Neural Network for Sea Surface Temperature Prediction

no code implementations19 Apr 2023 Yuxin Meng, Feng Gao, Eric Rigall, Ran Dong, Junyu Dong, Qian Du

To this end, we introduce a method for SST prediction that transfers physical knowledge from historical observations to numerical models.

Earth Observation Generative Adversarial Network

Multi-scale Adaptive Fusion Network for Hyperspectral Image Denoising

1 code implementation19 Apr 2023 Haodong Pan, Feng Gao, Junyu Dong, Qian Du

Two key components contribute to improving the hyperspectral image denoising: A progressively multiscale information aggregation network and a co-attention fusion module.

Hyperspectral Image Denoising Image Denoising

LLIC: Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression

no code implementations19 Apr 2023 Wei Jiang, Peirong Ning, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang

To tackle this issue, we propose Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression (LLIC).

Image Compression

Structure Embedded Nucleus Classification for Histopathology Images

no code implementations22 Feb 2023 Wei Lou, Xiang Wan, Guanbin Li, Xiaoying Lou, Chenghang Li, Feng Gao, Haofeng Li

Next, we convert a histopathology image into a graph structure with nuclei as nodes, and build a graph neural network to embed the spatial distribution of nuclei into their representations.

Classification Graph Neural Network +2

Differentiable Arbitrating in Zero-sum Markov Games

no code implementations20 Feb 2023 Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu

We initiate the study of how to perturb the reward in a zero-sum Markov game with two players to induce a desirable Nash equilibrium, namely arbitrating.

Multi-agent Reinforcement Learning reinforcement-learning +1

Artificial intelligence for diagnosing and predicting survival of patients with renal cell carcinoma: Retrospective multi-center study

no code implementations12 Jan 2023 Siteng Chen, Xiyue Wang, Jun Zhang, Liren Jiang, Ning Zhang, Feng Gao, Wei Yang, Jinxi Xiang, Sen yang, Junhua Zheng, Xiao Han

The OSrisk for the prediction of 5-year survival status achieved AUC of 0. 784 (0. 746-0. 819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0. 774 (0. 723-0. 820) and 0. 702 (0. 632-0. 765), respectively.

whole slide images

Lesion-aware Dynamic Kernel for Polyp Segmentation

1 code implementation12 Jan 2023 Ruifei Zhang, Peiwen Lai, Xiang Wan, De-Jun Fan, Feng Gao, Xiao-Jian Wu, Guanbin Li

Automatic and accurate polyp segmentation plays an essential role in early colorectal cancer diagnosis.

Decoder Segmentation

Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification

1 code implementation9 Jan 2023 Meng Wang, Feng Gao, Junyu Dong, Heng-Chao Li, Qian Du

It is commonly nontrivial to build a robust self-supervised learning model for multisource data classification, due to the fact that the semantic similarities of neighborhood regions are not exploited in existing contrastive learning framework.

Classification Contrastive Learning +2

GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods

no code implementations CVPR 2023 Da Yin, Feng Gao, Govind Thattai, Michael Johnston, Kai-Wei Chang

A key goal for the advancement of AI is to develop technologies that serve the needs not just of one group but of all communities regardless of their geographical region.

CL-MVSNet: Unsupervised Multi-View Stereo with Dual-Level Contrastive Learning

1 code implementation ICCV 2023 Kaiqiang Xiong, Rui Peng, Zhe Zhang, Tianxing Feng, Jianbo Jiao, Feng Gao, Ronggang Wang

On the one hand, we present an image-level contrastive branch to guide the model to acquire more context awareness, thus leading to more complete depth estimation in indistinguishable regions.

3D geometry Contrastive Learning +1

TPA-Net: Generate A Dataset for Text to Physics-based Animation

no code implementations25 Nov 2022 Yuxing Qiu, Feng Gao, Minchen Li, Govind Thattai, Yin Yang, Chenfanfu Jiang

Recent breakthroughs in Vision-Language (V&L) joint research have achieved remarkable results in various text-driven tasks.

Physical Simulations

Adaptive De-noising of Photoacoustic Signal and Image based on Modified Kalman Filter

no code implementations18 Nov 2022 Tianqu Hu, Zihao Huang, Peng Ge, Feng Gao, Fei Gao

As a burgeoning medical imaging method based on hybrid fusion of light and ultrasound, photoacoustic imaging (PAI) has demonstrated high potential in various biomedical applications recently, especially in revealing the functional and molecular information to improve diagnostic accuracy.

Towards Reasoning-Aware Explainable VQA

no code implementations9 Nov 2022 Rakesh Vaideeswaran, Feng Gao, Abhinav Mathur, Govind Thattai

Our method generates human-readable textual explanations while maintaining SOTA VQA accuracy on the GQA-REX (77. 49%) and VQA-E (71. 48%) datasets.

Decoder Explanation Generation +3

Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator

1 code implementation26 Oct 2022 Qinyuan Cheng, Linyang Li, Guofeng Quan, Feng Gao, Xiaofeng Mou, Xipeng Qiu

Besides, we introduce a sentence-level and a session-level score to measure the sentence fluency and session coherence in the interactive evaluation.

Sentence

Learning from Students: Online Contrastive Distillation Network for General Continual Learning

1 code implementation Conference 2022 Jin Li, Zhong Ji, Gang Wang, Qiang Wang, Feng Gao

The goal of General Continual Learning (GCL) is to preserve learned knowledge and learn new knowledge with constant memory from an infinite data stream where task boundaries are blurry.

Continual Learning

Full-Resolution Network and Dual-Threshold Iteration for Retinal Vessel and Coronary Angiograph Segmentation

1 code implementation JBHI 2022 Wentao Liu,Huihua Yang, Tong Tian, Zhiwei Cao, Xipeng Pan, Weijin Xu, Yang Jin, Feng Gao

The results demonstrate that FR-UNet outperforms state-of-the-art methods by achieving the highest Sen, AUC, F1, and IOU on most of the above-mentioned datasets with fewer parameters, and that DTI enhances vessel connectivity while greatly improving sensitivity.

Retinal Vessel Segmentation Segmentation

SSCU-Net: Spatial-Spectral Collaborative Unmixing Network for Hyperspectral Images

no code implementations12 Mar 2022 Lin Qi, Feng Gao, Junyu Dong, Xinbo Gao, Qian Du

Important findings on the use of spatial and spectral information in the autoencoder framework are discussed.

Hyperspectral Unmixing

Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image Segmentation

1 code implementation8 Feb 2022 Xinkai Zhao, Chaowei Fang, De-Jun Fan, Xutao Lin, Feng Gao, Guanbin Li

Semi-supervised learning (SSL), which aims at leveraging a few labeled images and a large number of unlabeled images for network training, is beneficial for relieving the burden of data annotation in medical image segmentation.

Contrastive Learning Image Segmentation +5

Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

1 code implementation22 Jan 2022 Junjie Wang, Feng Gao, Junyu Dong, Shan Zhang, Qian Du

Synthetic aperture radar (SAR) image change detection is a vital yet challenging task in the field of remote sensing image analysis.

Change Detection Feature Correlation

SAR Image Change Detection Based on Multiscale Capsule Network

1 code implementation22 Jan 2022 Yunhao Gao, Feng Gao, Junyu Dong, Heng-Chao Li

On the one hand, the multiscale capsule module is employed to exploit the spatial relationship of features.

Change Detection

Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

1 code implementation22 Jan 2022 Junjie Wang, Feng Gao, Junyu Dong, Qian Du

Second, an adaptive DropBlock (AdapDrop) is proposed as a regularization method employed in the generator and discriminator to alleviate the mode collapse issue.

Classification Hyperspectral Image Classification

A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering

no code implementations14 Jan 2022 Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan

Outside-knowledge visual question answering (OK-VQA) requires the agent to comprehend the image, make use of relevant knowledge from the entire web, and digest all the information to answer the question.

Generative Question Answering Image to text +3

Meta Convolutional Neural Networks for Single Domain Generalization

no code implementations CVPR 2022 Chaoqun Wan, Xu Shen, Yonggang Zhang, Zhiheng Yin, Xinmei Tian, Feng Gao, Jianqiang Huang, Xian-Sheng Hua

Taking meta features as reference, we propose compositional operations to eliminate irrelevant features of local convolutional features by an addressing process and then to reformulate the convolutional feature maps as a composition of related meta features.

Photo to Rest Generalization

Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering

no code implementations CVPR 2022 Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan

Most previous works address the problem by first fusing the image and question in the multi-modal space, which is inflexible for further fusion with a vast amount of external knowledge.

Generative Question Answering Image to text +3

Physics-Guided Generative Adversarial Networks for Sea Subsurface Temperature Prediction

1 code implementation4 Nov 2021 Yuxin Meng, Eric Rigall, Xueen Chen, Feng Gao, Junyu Dong, Sheng Chen

Physical modeling methods can offer the potential for extrapolation beyond observational conditions, while data-driven methods are flexible in adapting to data and are capable of detecting unexpected patterns.

Generative Adversarial Network

Synthetic Aperture Radar Image Change Detection via Siamese Adaptive Fusion Network

1 code implementation18 Oct 2021 Yunhao Gao, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

Moreover, a correlation layer is designed to further explore the correlation between multitemporal images.

Change Detection

ClueReader: Heterogeneous Graph Attention Network for Multi-hop Machine Reading Comprehension

no code implementations2 Jul 2021 Peng Gao, Feng Gao, Peng Wang, Jian-Cheng Ni, Fei Wang, Hamido Fujita

Multi-hop machine reading comprehension is a challenging task in natural language processing as it requires more reasoning ability across multiple documents.

Graph Attention Machine Reading Comprehension

SAR Image Change Detection Based on Multiscale Capsule Network

1 code implementation13 Jun 2021 Yunhao Gao, Feng Gao, Junyu Dong, Heng-Chao Li

On the one hand, the capsule module is employed to exploit the spatial relationship of features.

Change Detection

Learning the Precise Feature for Cluster Assignment

1 code implementation11 Jun 2021 Yanhai Gan, Xinghui Dong, Huiyu Zhou, Feng Gao, Junyu Dong

Based on this, we propose a general-purpose deep clustering framework which radically integrates representation learning and clustering into a single pipeline for the first time.

Clustering Deep Clustering +4

Towards Efficient Full 8-bit Integer DNN Online Training on Resource-limited Devices without Batch Normalization

no code implementations27 May 2021 Yukuan Yang, Xiaowei Chi, Lei Deng, Tianyi Yan, Feng Gao, Guoqi Li

In summary, the EOQ framework is specially designed for reducing the high cost of convolution and BN in network training, demonstrating a broad application prospect of online training in resource-limited devices.

Model Compression Quantization

Photoacoustic-monitored laser treatment for tattoo removal: a feasibility study

no code implementations26 May 2021 Yiyun Wang, Daohuai Jiang, Hengrong Lan, Feng Gao, Fei Gao

Skin blemishes and diseases have attracted increasing research interest in recent decades, due to their growing frequency of occurrence and the severity of related diseases.

Change Detection in Synthetic Aperture Radar Images Using a Dual-Domain Network

1 code implementation14 Apr 2021 Xiaofan Qu, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

In addition, we further propose a multi-region convolution module, which emphasizes the central region of each patch.

Change Detection

Deep Transformers for Fast Small Intestine Grounding in Capsule Endoscope Video

no code implementations7 Apr 2021 Xinkai Zhao, Chaowei Fang, Feng Gao, De-Jun Fan, Xutao Lin, Guanbin Li

In this paper, we propose a deep model to ground shooting range of small intestine from a capsule endoscope video which has duration of tens of hours.

Hyperspectral and LiDAR data classification based on linear self-attention

no code implementations6 Apr 2021 Min Feng, Feng Gao, Jian Fang, Junyu Dong

An efficient linear self-attention fusion model is proposed in this paper for the task of hyperspectral image (HSI) and LiDAR data joint classification.

Classification General Classification

Change Detection from SAR Images Based on Deformable Residual Convolutional Neural Networks

no code implementations6 Apr 2021 Junjie Wang, Feng Gao, Junyu Dong

Convolutional neural networks (CNN) have made great progress for synthetic aperture radar (SAR) images change detection.

Change Detection

Disentangled Non-Local Network for Hyperspectral and LiDAR Data Classification

no code implementations6 Apr 2021 Wenxia Liu, Feng Gao, Junyu Dong

As the ground objects become increasingly complex, the classification results obtained by single source remote sensing data can hardly meet the application requirements.

Classification General Classification

Experimentally Validated Hopping-Transport Model for Energetically Disordered Organic Semiconductors

no code implementations5 Mar 2021 Tanvi Upreti, Yuming Wang, Huotian Zhang, Dorothea Scheunemann, Feng Gao, Martijn Kemerink

Charge transport in disordered organic semiconductors occurs by hopping of charge carriers between localized sites that are randomly distributed in a strongly energy dependent density of states.

Materials Science

The dynamic energy balance in earthquakes expressed by fault surface morphology

no code implementations18 Jan 2021 Xin Wang, Juan Liu, Feng Gao, Zhizhen Zhang

The fault surface morphology is the direct result of the microscopic processes near the crack tip or on the frictional interface.

Geophysics

Superposed Wave (s-Wave): Accelerating Photoacoustic Simulation

no code implementations24 Dec 2020 Jiadong Zhang, Tengbo Lyu, Changchun Yang, Yimeng Yang, Shanshan Guo, Feng Gao, Fei Gao

However, it is still in a developing stage, and a lot of experiments have to be performed in a simulation setting.

Medical Physics

Attentional Separation-and-Aggregation Network for Self-supervised Depth-Pose Learning in Dynamic Scenes

no code implementations18 Nov 2020 Feng Gao, Jincheng Yu, Hao Shen, Yu Wang, Huazhong Yang

Learning depth and ego-motion from unlabeled videos via self-supervision from epipolar projection can improve the robustness and accuracy of the 3D perception and localization of vision-based robots.

Generalized Inverse Planning: Learning Lifted non-Markovian Utility for Generalizable Task Representation

no code implementations12 Nov 2020 Sirui Xie, Feng Gao, Song-Chun Zhu

Seeing that the proposed generalization problem has not been widely studied yet, we carefully define an evaluation protocol, with which we illustrate the effectiveness of MEIP on two proof-of-concept domains and one challenging task: learning to fold from demonstrations.

Conceptualized Representation Learning for Chinese Biomedical Text Mining

no code implementations25 Aug 2020 Ningyu Zhang, Qianghuai Jia, Kangping Yin, Liang Dong, Feng Gao, Nengwei Hua

In this paper, we investigate how the recently introduced pre-trained language model BERT can be adapted for Chinese biomedical corpora and propose a novel conceptualized representation learning approach.

Language Modelling Representation Learning

Deep learning for photoacoustic imaging: a survey

1 code implementation10 Aug 2020 Changchun Yang, Hengrong Lan, Feng Gao, Fei Gao

In this review, we performed an overview of some new developments and challenges in the application of machine learning to medical image analysis, with a special focus on deep learning in photoacoustic imaging.

BIG-bench Machine Learning Deep Learning +4

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense

no code implementations20 Apr 2020 Yixin Zhu, Tao Gao, Lifeng Fan, Siyuan Huang, Mark Edmonds, Hangxin Liu, Feng Gao, Chi Zhang, Siyuan Qi, Ying Nian Wu, Joshua B. Tenenbaum, Song-Chun Zhu

We demonstrate the power of this perspective to develop cognitive AI systems with humanlike common sense by showing how to observe and apply FPICU with little training data to solve a wide range of challenging tasks, including tool use, planning, utility inference, and social learning.

Common Sense Reasoning Small Data Image Classification

When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning

no code implementations26 Feb 2020 Xingbo Fu, Feng Gao, Jiang Wu

In this paper, we propose an actor-critic method - Attention-based Twin Delayed Deep Deterministic policy gradient (ATD3) algorithm to approximate a driver' s action according to observations and measure the driver' s attention allocation for consecutive time steps in car-following model.

Deep Reinforcement Learning reinforcement-learning +1

Deep Learning Enabled Real-Time Photoacoustic Tomography System via Single Data Acquisition Channel

no code implementations21 Jan 2020 Hengrong Lan, Daohuai Jiang, Feng Gao, Fei Gao

In this work, we develop a novel PACT system to provide real-time imaging, which is achieved by a 120-elements ultrasound array only using a single data acquisition (DAQ) channel.

Learning Perceptual Inference by Contrasting

1 code implementation NeurIPS 2019 Chi Zhang, Baoxiong Jia, Feng Gao, Yixin Zhu, Hongjing Lu, Song-Chun Zhu

"Thinking in pictures," [1] i. e., spatial-temporal reasoning, effortless and instantaneous for humans, is believed to be a significant ability to perform logical induction and a crucial factor in the intellectual history of technology development.

Understanding Graph Neural Networks with Generalized Geometric Scattering Transforms

1 code implementation14 Nov 2019 Michael Perlmutter, Alexander Tong, Feng Gao, Guy Wolf, Matthew Hirn

As a result, the proposed construction unifies and extends known theoretical results for many of the existing graph scattering architectures.

Spatiotemporal Attention Networks for Wind Power Forecasting

1 code implementation14 Sep 2019 Xingbo Fu, Feng Gao, Jiang Wu, Xinyu Wei, Fangwei Duan

This model captures spatial correlations among wind farms and temporal dependencies of wind power time series.

Time Series Time Series Analysis

Geometric Wavelet Scattering Networks on Compact Riemannian Manifolds

no code implementations24 May 2019 Michael Perlmutter, Feng Gao, Guy Wolf, Matthew Hirn

The Euclidean scattering transform was introduced nearly a decade ago to improve the mathematical understanding of convolutional neural networks.

Translation

Graph Classification with Geometric Scattering

no code implementations ICLR 2019 Feng Gao, Guy Wolf, Matthew Hirn

Furthermore, ConvNets inspired recent advances in geometric deep learning, which aim to generalize these networks to graph data by applying notions from graph signal processing to learn deep graph filter cascades.

General Classification Graph Classification +1

RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

no code implementations CVPR 2019 Chi Zhang, Feng Gao, Baoxiong Jia, Yixin Zhu, Song-Chun Zhu

In this work, we propose a new dataset, built in the context of Raven's Progressive Matrices (RPM) and aimed at lifting machine intelligence by associating vision with structural, relational, and analogical reasoning in a hierarchical representation.

Object Recognition Question Answering +2

Attention-driven Tree-structured Convolutional LSTM for High Dimensional Data Understanding

no code implementations29 Jan 2019 Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Kunlin Cao, Qi Song, Shaoting Zhang, Siwei Lyu, Youbing Yin

In order to address these limitations, we present tree-structured ConvLSTM models for tree-structured image analysis tasks which can be trained end-to-end.

Vocal Bursts Intensity Prediction

Residual Attention based Network for Hand Bone Age Assessment

no code implementations21 Dec 2018 Eric Wu, Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Shaoting Zhang, Kunlin Cao, Qi Song, Siwei Lyu, Youbing Yin

The hierarchical attention components of the residual attention subnet force our network to focus on the key components of the X-ray images and generate the final predictions as well as the associated visual supports, which is similar to the assessment procedure of clinicians.

Hand Segmentation

Geometric Scattering for Graph Data Analysis

no code implementations ICLR 2019 Feng Gao, Guy Wolf, Matthew Hirn

We explore the generalization of scattering transforms from traditional (e. g., image or audio) signals to graph data, analogous to the generalization of ConvNets in geometric deep learning, and the utility of extracted graph features in graph data analysis.

General Classification Graph Classification +1

Incorporating Intra-Class Variance to Fine-Grained Visual Recognition

no code implementations1 Mar 2017 Yan Bai, Feng Gao, Yihang Lou, Shiqi Wang, Tiejun Huang, Ling-Yu Duan

In this paper, we propose to leverage intra-class variance in metric learning of triplet network to improve the performance of fine-grained recognition.

Fine-Grained Visual Recognition Metric Learning +2

Improving Object Detection with Region Similarity Learning

no code implementations1 Mar 2017 Feng Gao, Yihang Lou, Yan Bai, Shiqi Wang, Tiejun Huang, Ling-Yu Duan

Object detection aims to identify instances of semantic objects of a certain class in images or videos.

Multi-Task Learning Object +3

Cannot find the paper you are looking for? You can Submit a new open access paper.