Search Results for author: Shang Gao

Found 37 papers, 15 papers with code

BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA

1 code implementation4 Mar 2025 Zhengyang Ji, Shang Gao, Li Liu, Yifan Jia, Yutao Yue

Biomedical visual question answering (VQA) has been widely studied and has demonstrated significant application value and potential in fields such as assistive medical diagnosis.

Medical Diagnosis Question Answering +1

Text-promptable Propagation for Referring Medical Image Sequence Segmentation

no code implementations16 Feb 2025 Runtian Yuan, Jilan Xu, Mohan Chen, Qingqiu Li, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao

We develop a strong baseline model, Text-Promptable Propagation (TPP), designed to exploit the intrinsic relationships among sequential images and their associated textual descriptions.

Interactive Segmentation Segmentation

Hyperbolic Chamfer Distance for Point Cloud Completion and Beyond

no code implementations23 Dec 2024 Fangzhou Lin, Songlin Hou, Haotian Liu, Shang Gao, Kazunori D Yamada, Haichong K. Zhang, Ziming Zhang

In divergence from the existing literature, which largely concentrates on resolving such concerns in the realm of Euclidean space, we put forth a notably uncomplicated yet potent metric specifically designed for point cloud completion tasks: {Hyperbolic Chamfer Distance (HyperCD)}.

Image Reconstruction Point Cloud Completion

SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework

no code implementations16 Dec 2024 Fangzhou Lin, Shang Gao, Yichuan Tang, Xihan Ma, Ryo Murakami, Ziming Zhang, John D. Obayemi, Winston W. Soboyejo, Haichong K. Zhang

This framework integrates a data-free learning-based method with an efficient BM3D-based analytical approach while preserves spectral linearity, providing noise reduction and ensuring that functional information is maintained.

Denoising

GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning

1 code implementation20 Oct 2024 Haiwen Diao, Ying Zhang, Shang Gao, Jiawen Zhu, Long Chen, Huchuan Lu

Cross-modal metric learning is a prominent research topic that bridges the semantic heterogeneity between vision and language.

Image Retrieval Image-text Retrieval +4

Measuring the Groundedness of Legal Question-Answering Systems

no code implementations11 Oct 2024 Dietrich Trautmann, Natalia Ostapuk, Quentin Grail, Adrian Alan Pol, Guglielmo Bonifazi, Shang Gao, Martin Gajek

In summary, this study demonstrates the potential of various detection methods to improve the trustworthiness of generative AI in legal settings.

Natural Language Inference Question Answering

A Little Confidence Goes a Long Way

no code implementations20 Aug 2024 John Scoville, Shang Gao, Devanshu Agrawal, Javed Qadrud-Din

We introduce a group of related methods for binary classification tasks using probes of the hidden state activations in large language models (LLMs).

Binary Classification

Self-Attention-Based Contextual Modulation Improves Neural System Identification

no code implementations12 Jun 2024 Isaac Lin, Tianye Wang, Shang Gao, Shiming Tang, Tai Sing Lee

We find that self-attention can replace posterior spatial-integration convolutions when learned incrementally, and is further enhanced in the presence of a fully connected readout layer, suggesting that the two context mechanisms are complementary.

Incremental Learning

GRAMMAR: Grounded and Modular Methodology for Assessment of Closed-Domain Retrieval-Augmented Language Model

1 code implementation30 Apr 2024 Xinzhe Li, Ming Liu, Shang Gao

Retrieval-Augmented Generation (RAG) systems are widely used across various industries for querying closed-domain and in-house knowledge bases.

Language Modeling Language Modelling +2

Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching

1 code implementation28 Apr 2024 Haiwen Diao, Ying Zhang, Shang Gao, Xiang Ruan, Huchuan Lu

Specifically, we propose a brand-new Deep Boosting Learning (DBL) algorithm, where an anchor branch is first trained to provide insights into the data properties, with a target branch gaining more advanced knowledge to develop optimal features and distance metrics.

Contrastive Learning Image-text matching +2

Can't Remember Details in Long Documents? You Need Some R&R

1 code implementation8 Mar 2024 Devanshu Agrawal, Shang Gao, Martin Gajek

Long-context large language models (LLMs) hold promise for tasks such as question-answering (QA) over long documents, but they tend to miss important information in the middle of context documents (arXiv:2307. 03172v3).

Question Answering

Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identification

1 code implementation15 Dec 2023 Shang Gao, Chenyang Yu, Pingping Zhang, Huchuan Lu

In addition, existing occluded person ReID benchmarks utilize occluded samples as queries, which will amplify the role of alleviating occlusion interference and underestimate the impact of the feature absence issue.

Decoder Human Parsing +3

Learning to Learn for Few-shot Continual Active Learning

no code implementations7 Nov 2023 Stella Ho, Ming Liu, Shang Gao, Longxiang Gao

Recent advances in continual learning are mostly confined to a supervised learning setting, especially in NLP domain.

Active Learning Continual Learning +3

Enhanced Knowledge Injection for Radiology Report Generation

no code implementations1 Nov 2023 Qingqiu Li, Jilan Xu, Runtian Yuan, Mohan Chen, Yuejie Zhang, Rui Feng, Xiaobo Zhang, Shang Gao

Automatic generation of radiology reports holds crucial clinical value, as it can alleviate substantial workload on radiologists and remind less experienced ones of potential anomalies.

Image Captioning Retrieval

Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data

1 code implementation2 Jul 2023 Xinzhe Li, Ming Liu, Shang Gao

This paper addresses the ethical concerns arising from the use of unauthorized public data in deep learning models and proposes a novel solution.

Question Answering text-classification +1

A Survey on Out-of-Distribution Evaluation of Neural NLP Models

no code implementations27 Jun 2023 Xinzhe Li, Ming Liu, Shang Gao, Wray Buntine

Adversarial robustness, domain generalization and dataset biases are three active lines of research contributing to out-of-distribution (OOD) evaluation on neural NLP models.

Adversarial Robustness Domain Generalization +1

Can Pretrained Language Models Derive Correct Semantics from Corrupt Subwords under Noise?

1 code implementation27 Jun 2023 Xinzhe Li, Ming Liu, Shang Gao

For Pretrained Language Models (PLMs), their susceptibility to noise has recently been linked to subword segmentation.

Segmentation

Track Anything: Segment Anything Meets Videos

1 code implementation24 Apr 2023 Jinyu Yang, Mingqi Gao, Zhe Li, Shang Gao, Fangjing Wang, Feng Zheng

Therefore, in this report, we propose Track Anything Model (TAM), which achieves high-performance interactive tracking and segmentation in videos.

Image Segmentation Segmentation +2

Anomaly Detection of UAV State Data Based on Single-class Triangular Global Alignment Kernel Extreme Learning Machine

no code implementations18 Feb 2023 Feisha Hu, Qi Wang, Haijian Shao, Shang Gao, Hualong Yu

To improve the performance of OCKELM, we choose a Triangular Global Alignment Kernel (TGAK) instead of an RBF Kernel and introduce the Fast Independent Component Analysis (FastICA) algorithm to reconstruct UAV data.

Anomaly Detection

Resource-Efficient RGBD Aerial Tracking

1 code implementation CVPR 2023 Jinyu Yang, Shang Gao, Zhe Li, Feng Zheng, Aleš Leonardis

However, current research on aerial perception has mainly focused on limited categories, such as pedestrian or vehicle, and most scenes are captured in urban environments from a birds-eye view.

Object Tracking

DGD-cGAN: A Dual Generator for Image Dewatering and Restoration

1 code implementation18 Nov 2022 Salma Gonzalez-Sabbagh, Antonio Robles-Kelly, Shang Gao

Our Dual Generator Dewatering cGAN (DGD-cGAN) removes the haze and colour cast induced by the water column and restores the true colours of underwater scenes whereby the effects of various attenuation and scattering phenomena that occur in underwater images are tackled by the two generators.

Generative Adversarial Network Image Enhancement

Learning Dual-Fused Modality-Aware Representations for RGBD Tracking

no code implementations6 Nov 2022 Shang Gao, Jinyu Yang, Zhe Li, Feng Zheng, Aleš Leonardis, Jingkuan Song

However, some existing RGBD trackers use the two modalities separately and thus some particularly useful shared information between them is ignored.

Object Tracking

Graph Classification via Discriminative Edge Feature Learning

no code implementations5 Oct 2022 Yang Yi, Xuequan Lu, Shang Gao, Antonio Robles-Kelly, Yuejie Zhang

Three new graph datasets are constructed based on ModelNet40, ModelNet10 and ShapeNet Part datasets.

Graph Classification

CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping

1 code implementation CVPR 2022 Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Rui-Wei Zhao, Tao Zhang, Xuequan Lu, Shang Gao

In this paper, we empirically prove that this problem is associated with the mixup of the activation values between less discriminative foreground regions and the background.

Clustering Object +2

Fault Diagnosis of Discrete-Event Systems under Non-Deterministic Observations with Output Fairness

no code implementations6 Apr 2022 Weijie Dong, Shang Gao, Xiang Yin, ShaoYuan Li

Non-deterministic observation is a general observation model that includes the case of intermittent loss of observations.

Fairness Fault Diagnosis

Towards Uniform Point Distribution in Feature-preserving Point Cloud Filtering

no code implementations5 Jan 2022 Shuaijun Chen, Jinxi Wang, Wei Pan, Shang Gao, Meili Wang, Xuequan Lu

As a popular representation of 3D data, point cloud may contain noise and need to be filtered before use.

Automated Security Assessment for the Internet of Things

no code implementations9 Sep 2021 Xuanyu Duan, Mengmeng Ge, Triet H. M. Le, Faheem Ullah, Shang Gao, Xuequan Lu, M. Ali Babar

This security model automatically assesses the security of the IoT network by capturing potential attack paths.

3D Face Recognition: A Survey

no code implementations25 Aug 2021 Yaping Jing, Xuequan Lu, Shang Gao

Face recognition is one of the most studied research topics in the community.

Face Recognition Survey

Hierarchical excitations from correlated spin tetrahedra on the breathing pyrochlore lattice

no code implementations29 Jan 2021 Shang Gao, Andrew F. May, Mao-Hua Du, Joseph A. M. Paddison, Hasitha Suriya Arachchige, Ganesh Pokharel, Clarina dela Cruz, Qiang Zhang, Georg Ehlers, David S. Parker, David G. Mandrus, Matthew B. Stone, Andrew D. Christianson

The hierarchy of the coupling strengths in a physical system often engenders an effective model at low energies where the decoupled high-energy modes are integrated out.

Strongly Correlated Electrons

Integration of Domain Knowledge using Medical Knowledge Graph Deep Learning for Cancer Phenotyping

no code implementations5 Jan 2021 Mohammed Alawad, Shang Gao, Mayanka Chandra Shekar, S. M. Shamimul Hasan, J. Blair Christian, Xiao-Cheng Wu, Eric B. Durbin, Jennifer Doherty, Antoinette Stroup, Linda Coyle, Lynne Penberthy, Georgia Tourassi

Word embeddings that effectively capture the meaning and context of the word that they represent can significantly improve the performance of downstream DL models for various NLP tasks.

Word Embeddings

A catastrophic charge density wave in BaFe$_2$Al$_9$

no code implementations1 Jan 2021 William R. Meier, Bryan C. Chakoumakos, Satoshi Okamoto, Michael A. McGuire, Raphaël P. Hermann, German D. Samolyuk, Shang Gao, Qiang Zhang, Matthew B. Stone, Andrew D. Christianson, Brian C. Sales

Single crystal x-ray diffraction reveals super-lattice peaks in the low-temperature phase signaling the development of a CDW lattice modulation.

Strongly Correlated Electrons Materials Science

Pose-guided Visible Part Matching for Occluded Person ReID

1 code implementation CVPR 2020 Shang Gao, Jingya Wang, Huchuan Lu, Zimo Liu

Occluded person re-identification is a challenging task as the appearance varies substantially with various obstacles, especially in the crowd scenario.

Graph Matching Occluded Person Re-Identification

Hierarchical Convolutional Attention Networks for Text Classification

no code implementations WS 2018 Shang Gao, Arvind Ramanathan, Georgia Tourassi

Recent work in machine translation has demonstrated that self-attention mechanisms can be used in place of recurrent neural networks to increase training speed without sacrificing model accuracy.

Document Classification General Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.