Search Results for author: Cong Yang

Found 21 papers, 15 papers with code

Comparison of Feature Learning Methods for Metadata Extraction from PDF Scholarly Documents

no code implementations9 Jan 2025 Zeyd Boukhers, Cong Yang

The availability of metadata for scientific documents is pivotal in propelling scientific knowledge forward and for adhering to the FAIR principles (i. e. Findability, Accessibility, Interoperability, and Reusability) of research findings.

Large Language Model in Medical Informatics: Direct Classification and Enhanced Text Representations for Automatic ICD Coding

no code implementations11 Nov 2024 Zeyd Boukhers, AmeerAli Khan, Qusai Ramadan, Cong Yang

Addressing the complexity of accurately classifying International Classification of Diseases (ICD) codes from medical discharge summaries is challenging due to the intricate nature of medical documentation.

Classification Code Classification +3

Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning

1 code implementation19 Sep 2024 Cong Yang, Zuchao Li, Hongzan Jiao, Zhi Gao, Lefei Zhang

This framework aims to fully leverage the intrinsic knowledge of large language models through visual instructions and enhance the effectiveness and accuracy of change features using pixel-level change detection tasks.

Change Detection Decoder +3

CAMAv2: A Vision-Centric Approach for Static Map Element Annotation

1 code implementation31 Jul 2024 Shiyuan Chen, Jiaxin Zhang, Ruohong Mei, Yingfeng Cai, Haoran Yin, Tao Chen, Wei Sui, Cong Yang

Compared with the original nuScenes static map element, our CAMAv2 annotations achieve lower reprojection errors (e. g., 4. 96 vs. 8. 03 pixels).

MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description

1 code implementation7 Jun 2024 Cong Yang, Zuchao Li, Lefei Zhang

Then, with the multimodal model aligned on region-attribute, guided by multi-grain visual features, MGIMM fully perceives both region-level and global image information, utilizing large language models for comprehensive descriptions of remote sensing images.

Attribute

Falcon 7b for Software Mention Detection in Scholarly Documents

no code implementations14 May 2024 AmeerAli Khan, Qusai Ramadan, Cong Yang, Zeyd Boukhers

This paper aims to tackle the challenge posed by the increasing integration of software tools in research across various disciplines by investigating the application of Falcon-7b for the detection and classification of software mentions within scholarly texts.

VRSO: Visual-Centric Reconstruction for Static Object Annotation

1 code implementation22 Mar 2024 Chenyao Yu, Yingfeng Cai, Jiaxin Zhang, Hui Kong, Wei Sui, Cong Yang

As a part of the perception results of intelligent driving systems, static object detection (SOD) in 3D space provides crucial cues for driving environment understanding.

Object object-detection +1

Gyroscope-Assisted Motion Deblurring Network

1 code implementation10 Feb 2024 Simin Luan, Cong Yang, Zeyd Boukhers, Xue Qin, Dongfeng Cheng, Wei Sui, Zhijun Li

Yet, their practical usage in real-world deblurring, especially motion blur, remains limited due to the lack of pixel-aligned training triplets (background, blurred image, and blur heat map) and restricted information inherent in blurred images.

Deblurring Image Restoration +1

Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning

1 code implementation2 Dec 2023 Cong Yang, Zuchao Li, Lefei Zhang

To efficiently align the image-text, we propose a novel two-stage vision-language pre-training-based approach to bootstrap interactive image-text alignment for remote sensing image captioning, called BITA, which relies on the design of a lightweight interactive Fourier Transformer to better align remote sensing image-text features.

Causal Language Modeling Contrastive Learning +5

Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks

1 code implementation10 Oct 2023 Cong Yang, Bipin Indurkhya, John See, Bo Gao, Yan Ke, Zeyd Boukhers, Zhenyu Yang, Marcin Grzegorzek

However, most existing shape and image datasets suffer from the lack of skeleton GT and inconsistency of GT standards.

Augmented Box Replay: Overcoming Foreground Shift for Incremental Object Detection

1 code implementation ICCV 2023 Liu Yuyang, Cong Yang, Goswami Dipam, Liu Xialei, Joost Van de Weijer

Foreground shift only occurs when replaying images of previous tasks and refers to the fact that their background might contain foreground objects of the current task.

Incremental Learning object-detection +1

Towards Accurate Ground Plane Normal Estimation from Ego-Motion

1 code implementation8 Dec 2022 Jiaxin Zhang, Wei Sui, Qian Zhang, Tao Chen, Cong Yang

In this paper, we introduce a novel approach for ground plane normal estimation of wheeled vehicles.

3D Object Detection Autonomous Driving +3

Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion

1 code implementation23 Jul 2022 Xu Zhou, Xinyu Lei, Cong Yang, Yichun Shi, Xiao Zhang, Jingwen Shi

The key idea in FedKF is to let the server return the global knowledge to be fused with the local knowledge in each training round so that the local model can be regularized towards the global optima.

Data-free Knowledge Distillation Fairness +2

Beyond Trading Data: The Hidden Influence of Public Awareness and Interest on Cryptocurrency Volatility

no code implementations12 Feb 2022 Zeyd Boukhers, Azeddine Bouabdallah, Cong Yang, Jan Jürjens

This study examines the various independent factors that affect the volatility of the Bitcoin-Dollar exchange rate.

Decision Making

CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization

no code implementations ECCV 2020 Yuxi Li, Weiyao Lin, John See, Ning Xu, Shugong Xu, Ke Yan, Cong Yang

Most current pipelines for spatio-temporal action localization connect frame-wise or clip-wise detection results to generate action proposals, where only local information is exploited and the efficiency is hindered by dense per-frame localization.

Action Detection Spatio-Temporal Action Localization +1

PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments

1 code implementation ECCV 2020 Zhiming Chen, Kean Chen, Weiyao Lin, John See, Hui Yu, Yan Ke, Cong Yang

The experimental results show that PIoU loss can dramatically improve the performance of OBB detectors, particularly on objects with high aspect ratios and complex backgrounds.

object-detection Object Detection In Aerial Images +2

User-Curated Image Collections: Modeling and Recommendation

no code implementations18 Sep 2015 Li Yuncheng, Cong Yang, Mei Tao, Luo Jiebo

We then consider image collection recommendation as a dynamic similarity measurement problem in response to user's clicked image set, and employ a metric learner to measure the similarity between the image collection and the clicked image set.

Image Retrieval Recommendation Systems +1

Cannot find the paper you are looking for? You can Submit a new open access paper.