Search Results for author: Bin Luo

Found 97 papers, 37 papers with code

Tracking Anything in High Quality

1 code implementation26 Jul 2023 Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li

To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.

Object Semantic Segmentation +3

Pedestrian Attribute Recognition: A Survey

1 code implementation22 Jan 2019 Xiao Wang, Shaofei Zheng, Rui Yang, Aihua Zheng, Zhe Chen, Jin Tang, Bin Luo

We also review some popular network architectures which have been widely applied in the deep learning community.

Attribute Multi-Label Learning +2

Tiny Object Tracking: A Large-scale Dataset and A Baseline

1 code implementation11 Feb 2022 Yabin Zhu, Chenglong Li, Yao Liu, Xiao Wang, Jin Tang, Bin Luo, Zhixiang Huang

Tiny objects, frequently appearing in practical applications, have weak appearance and features, and receive increasing interests in meany vision tasks, such as object detection and segmentation.

Attribute Knowledge Distillation +4

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

1 code implementation CVPR 2023 Hui Lv, Zhongqi Yue, Qianru Sun, Bin Luo, Zhen Cui, Hanwang Zhang

At each MIL training iteration, we use the current detector to divide the samples into two groups with different context biases: the most confident abnormal/normal snippets and the rest ambiguous ones.

Anomaly Detection Multiple Instance Learning +1

LasHeR: A Large-scale High-diversity Benchmark for RGBT Tracking

1 code implementation27 Apr 2021 Chenglong Li, Wanlin Xue, Yaqing Jia, Zhichen Qu, Bin Luo, Jin Tang, Dengdi Sun

RGBT tracking receives a surge of interest in the computer vision community, but this research field lacks a large-scale and high-diversity benchmark dataset, which is essential for both the training of deep RGBT trackers and the comprehensive evaluation of RGBT tracking methods.

Rgb-T Tracking Vocal Bursts Intensity Prediction

Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline

3 code implementations9 Mar 2024 Xiao Wang, Ju Huang, Shiao Wang, Chuanming Tang, Bo Jiang, Yonghong Tian, Jin Tang, Bin Luo

Current event-/frame-event based trackers undergo evaluation on short-term tracking datasets, however, the tracking of real-world scenarios involves long-term tracking, and the performance of existing tracking algorithms in these scenarios remains unclear.

Object Tracking Rgb-T Tracking

VcT: Visual change Transformer for Remote Sensing Image Change Detection

1 code implementation17 Oct 2023 Bo Jiang, Zitian Wang, Xixi Wang, Ziyan Zhang, Lan Chen, Xiao Wang, Bin Luo

Then, each pixel of feature map is regarded as a graph node and the graph neural network is proposed to model the structured information for coarse change map prediction.

Change Detection Representation Learning

Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness

1 code implementation19 May 2023 Yuxuan Zhou, Zhi-Qi Cheng, Jun-Yan He, Bin Luo, Yifeng Geng, Xuansong Xie

As a remedy, we propose a threefold strategy: (1) We forge an innovative pathway that encodes bone connectivity by harnessing the power of graph distances.

Action Recognition Skeleton Based Action Recognition

Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model

1 code implementation26 Mar 2024 Runmin Dong, Shuai Yuan, Bin Luo, Mengxuan Chen, Jinxiao Zhang, Lixian Zhang, Weijia Li, Juepeng Zheng, Haohuan Fu

Specifically, we inject the priors into the denoising model to improve the utilization of reference information in unchanged areas and regulate the reconstruction of semantically relevant content in changed areas.

Denoising Reference-based Super-Resolution

Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models

1 code implementation30 Nov 2023 Dong Li, Jiandong Jin, Yuhao Zhang, Yanlin Zhong, Yaoyang Wu, Lan Chen, Xiao Wang, Bin Luo

Current methods typically employ backbone networks to individually extract the features of RGB frames and event streams, and subsequently fuse these features for pattern recognition.

Language Modelling Prompt Engineering

A Unified RGB-T Saliency Detection Benchmark: Dataset, Baselines, Analysis and A Novel Approach

1 code implementation11 Jan 2017 Chenglong Li, Guizhao Wang, Yunpeng Ma, Aihua Zheng, Bin Luo, Jin Tang

In particular, we introduce a weight for each modality to describe the reliability, and integrate them into the graph-based manifold ranking algorithm to achieve adaptive fusion of different source data.

Saliency Detection

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

1 code implementation19 Jul 2023 Leilei Ma, Dengdi Sun, Lei Wang, Haifeng Zhao, Bin Luo

Specifically, we leverage semantic-aware representation learning to extract category-related local discriminative features and construct category prototypes.

Contrastive Learning Multi-Label Image Classification +2

DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs

1 code implementation19 Sep 2023 Jiawen Zhu, Huayi Tang, Zhi-Qi Cheng, Jun-Yan He, Bin Luo, Shihao Qiu, Shengming Li, Huchuan Lu

To address this, we propose a novel architecture called Darkness Clue-Prompted Tracking (DCPT) that achieves robust UAV tracking at night by efficiently learning to generate darkness clue prompts.

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search

1 code implementation19 May 2022 Xiao Wang, Zhe Chen, Bo Jiang, Jin Tang, Bin Luo, DaCheng Tao

To track the target in a video, current visual trackers usually adopt greedy search for target object localization in each frame, that is, the candidate region with the maximum response score will be selected as the tracking result of each frame.

Decision Making Image Captioning +5

\emph{cm}SalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks

1 code implementation21 Dec 2019 Bo Jiang, Zitai Zhou, Xiao Wang, Jin Tang, Bin Luo

Fusing complementary information of RGB and depth has been demonstrated to be effective for image salient object detection which is known as RGB-D salient object detection problem.

Edge Detection Generative Adversarial Network +6

Tracking by Joint Local and Global Search: A Target-aware Attention based Approach

1 code implementation9 Jun 2021 Xiao Wang, Jin Tang, Bin Luo, YaoWei Wang, Yonghong Tian, Feng Wu

In this paper, we propose a novel and general target-aware attention mechanism (termed TANet) and integrate it with tracking-by-detection framework to conduct joint local and global search for robust tracking.

Object Object Tracking

Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking

1 code implementation30 Mar 2021 Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, YaoWei Wang, Yonghong Tian, Feng Wu

In this paper, we propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy.

Object Tracking

LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception

2 code implementations27 Oct 2022 Chenyang Li, Zhi-Qi Cheng, Jun-Yan He, Pengyu Li, Bin Luo, Hanyuan Chen, Yifeng Geng, Jin-Peng Lan, Xuansong Xie

Streaming perception is a critical task in autonomous driving that requires balancing the latency and accuracy of the autopilot system.

Autonomous Driving

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

1 code implementation30 Mar 2023 Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Wangmeng Xiang, Binghui Chen, Bin Luo, Yifeng Geng, Xuansong Xie

Real-time perception, or streaming perception, is a crucial aspect of autonomous driving that has yet to be thoroughly explored in existing research.

Autonomous Driving

Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer

1 code implementation13 Dec 2021 Yunyun huang, Xiaoyu Shen, Chuanyi Li, Jidong Ge, Bin Luo

Given the fact of a case, Legal Judgment Prediction (LJP) involves a series of sub-tasks such as predicting violated law articles, charges and term of penalty.

An Extractive-and-Abstractive Framework for Source Code Summarization

1 code implementation15 Jun 2022 Weisong Sun, Chunrong Fang, Yuchen Chen, Quanjun Zhang, Guanhong Tao, Tingxu Han, Yifei Ge, Yudu You, Bin Luo

The extractive module in the framework performs a task of extractive code summarization, which takes in the code snippet and predicts important statements containing key factual details.

Code Summarization Machine Translation +2

Communication Resources Constrained Hierarchical Federated Learning for End-to-End Autonomous Driving

1 code implementation28 Jun 2023 Wei-Bin Kou, Shuai Wang, Guangxu Zhu, Bin Luo, Yingxian Chen, Derrick Wing Kwan Ng, Yik-Chung Wu

While federated learning (FL) improves the generalization of end-to-end autonomous driving by model aggregation, the conventional single-hop FL (SFL) suffers from slow convergence rate due to long-range communications among vehicles and cloud server.

Autonomous Driving Federated Learning

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks

1 code implementation25 Dec 2023 Wentao Zou, Qi Li, Jidong Ge, Chuanyi Li, Xiaoyu Shen, LiGuo Huang, Bin Luo

We hope that our findings can provide a deeper understanding of PEFT methods on various PTMs and SE downstream tasks.

Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality Benchmarks

1 code implementation25 Dec 2023 Andong Lu, jiacong Zhao, Chenglong Li, Jin Tang, Bin Luo

To address this challenge, we propose a novel invertible prompt learning approach, which integrates the content-preserving prompts into a well-trained tracking model to adapt to various modality-missing scenarios, for robust RGBT tracking.

Temporal Coherent and Graph Optimized Manifold Ranking for Visual Tracking

no code implementations17 Apr 2018 Bo Jiang, Doudou Lin, Bin Luo, Jin Tang

To address this problem, we propose a novel unified temporal coherence and graph optimized ranking model for weighted patch representation in visual tracking problem.

Graph Ranking Visual Tracking

Multiple Images Recovery Using a Single Affine Transformation

no code implementations23 May 2017 Bo Jiang, Chris Ding, Bin Luo

One approach to deal with noise image data is to use data recovery techniques which aim to recover the true uncorrupted signals from the observed noise images.

Describe and Attend to Track: Learning Natural Language guided Structural Representation and Visual Attention for Object Tracking

no code implementations25 Nov 2018 Xiao Wang, Chenglong Li, Rui Yang, Tianzhu Zhang, Jin Tang, Bin Luo

To refine the states of the target and re-track the target when it is back to view from heavy occlusion and out of view, we elaborately design a novel subnetwork to learn the target-driven visual attentions from the guidance of both visual and natural language cues.

Object Tracking

FANet: Quality-Aware Feature Aggregation Network for Robust RGB-T Tracking

no code implementations24 Nov 2018 Yabin Zhu, Chenglong Li, Bin Luo, Jin Tang

This paper investigates how to perform robust visual tracking in adverse and challenging conditions using complementary visual and thermal infrared data (RGBT tracking).

Rgb-T Tracking

Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning

no code implementations27 Nov 2018 Xiao Wang, Tao Sun, Rui Yang, Chenglong Li, Bin Luo, Jin Tang

In this paper, we propose an efficient quality-aware deep neural network to model the weight of data from each domain using deep reinforcement learning (DRL).

Decision Making object-detection +5

Graph Matching via Multiplicative Update Algorithm

no code implementations NeurIPS 2017 Bo Jiang, Jin Tang, Chris Ding, Yihong Gong, Bin Luo

As a fundamental problem in computer vision, graph matching problem can usually be formulated as a Quadratic Programming (QP) problem with doubly stochastic and discrete (integer) constraints.

Graph Matching

SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation

no code implementations CVPR 2018 Xiao Wang, Chenglong Li, Bin Luo, Jin Tang

Based on the generated hard positive samples, we train a Siamese network for visual tracking and our experiments validate the effectiveness of the introduced algorithm.

Object Visual Tracking

Multiple Graph Adversarial Learning

no code implementations22 Jan 2019 Bo Jiang, Ziyan Zhang, Jin Tang, Bin Luo

In this paper, we propose a novel Multiple Graph Adversarial Learning (MGAL) framework for multi-graph representation and learning.

Binary Constraint Preserving Graph Matching

no code implementations CVPR 2017 Bo Jiang, Jin Tang, Chris Ding, Bin Luo

There are three main contributions of the proposed method: (1) we propose a new graph matching relaxation model, called Binary Constraint Preserving Graph Matching (BPGM), which aims to incorporate the discrete binary mapping constraints more in graph matching relaxation.

Graph Matching

Robust Graph Data Learning via Latent Graph Convolutional Representation

no code implementations26 Apr 2019 Bo Jiang, Ziyan Zhang, Bin Luo

Given an input graph $\textbf{A}$, LatGCR aims to generate a flexible latent graph $\widetilde{\textbf{A}}$ for graph convolutional representation which obviously enhances the representation capacity and also performs robustly w. r. t graph structural attacks and noises.

Graph Learning Node Classification +1

Improved Hard Example Mining by Discovering Attribute-based Hard Person Identity

no code implementations6 May 2019 Xiao Wang, Ziliang Chen, Rui Yang, Bin Luo, Jin Tang

In this paper, we propose Hard Person Identity Mining (HPIM) that attempts to refine the hard example mining to improve the exploration efficacy in person re-identification.

Attribute Metric Learning +1

Attributes Guided Feature Learning for Vehicle Re-identification

no code implementations22 May 2019 Hongchao Li, Xianmin Lin, Aihua Zheng, Chenglong Li, Bin Luo, Ran He, Amir Hussain

In particular, our network is end-to-end trained and contains three subnetworks of deep features embedded by the corresponding attributes (i. e., camera view, vehicle type and vehicle color).

Generative Adversarial Network Vehicle Re-Identification

PH-GCN: Person Re-identification with Part-based Hierarchical Graph Convolutional Network

no code implementations20 Jul 2019 Bo Jiang, Xixi Wang, Bin Luo

Given a person image, PH-GCN first constructs a hierarchical graph to represent the pairwise relationships among different parts.

Person Re-Identification

Dense Feature Aggregation and Pruning for RGBT Tracking

no code implementations24 Jul 2019 Yabin Zhu, Chenglong Li, Bin Luo, Jin Tang, Xiao Wang

In different modalities, we propose to prune the densely aggregated features of all modalities in a collaborative way.

Edge-guided Non-local Fully Convolutional Network for Salient Object Detection

no code implementations7 Aug 2019 Zhengzheng Tu, Yan Ma, Chenglong Li, Jin Tang, Bin Luo

To maintain the clear edge structure of salient objects, we propose a novel Edge-guided Non-local FCN (ENFNet) to perform edge guided feature learning for accurate salient object detection.

object-detection RGB Salient Object Detection +1

Semi-supervised Learning with Adaptive Neighborhood Graph Propagation Network

no code implementations14 Aug 2019 Bo Jiang, Leiling Wang, Jin Tang, Bin Luo

In this paper, we first re-interpret graph convolution operation in GCNs as a composition of feature propagation and (non-linear) transformation.

graph construction

Residual Objectness for Imbalance Reduction

no code implementations24 Aug 2019 Joya Chen, Dong Liu, Bin Luo, Xuezheng Peng, Tong Xu, Enhong Chen

For a long time, object detectors have suffered from extreme imbalance between foregrounds and backgrounds.

Context-Aware Graph Attention Networks

no code implementations4 Sep 2019 Bo Jiang, Leiling Wang, Jin Tang, Bin Luo

In particular, CaGAT conducts context-aware learning on both node feature representation and edge (weight) representation simultaneously and cooperatively in a unified manner which can boost their respective performance in network training.

Graph Attention

GmCN: Graph Mask Convolutional Network

no code implementations4 Sep 2019 Bo Jiang, Beibei Wang, Jin Tang, Bin Luo

Graph Convolutional Networks (GCNs) have shown very powerful for graph data representation and learning tasks.

Graph Learning

Stereo-based Multi-motion Visual Odometry for Mobile Robots

no code implementations15 Oct 2019 Qing Zhao, Bin Luo, Yun Zhang

In this letter, a stereo-based multi-motion visual odometry method is proposed to acquire the poses of the robot and other moving objects.

Motion Segmentation Visual Odometry

GLMNet: Graph Learning-Matching Networks for Feature Matching

no code implementations18 Nov 2019 Bo Jiang, Pengfei Sun, Jin Tang, Bin Luo

However, the matching graphs we feed to existing graph convolutional matching networks are generally fixed and independent of graph matching, which thus are not guaranteed to be optimal for the graph matching task.

Graph Learning Graph Matching +1

DymSLAM:4D Dynamic Scene Reconstruction Based on Geometrical Motion Segmentation

no code implementations10 Mar 2020 Chenjie Wang, Bin Luo, Yun Zhang, Qing Zhao, Lu Yin, Wei Wang, Xin Su, Yajun Wang, Chengyuan Li

The only input of DymSLAM is stereo video, and its output includes a dense map of the static environment, 3D model of the moving objects and the trajectories of the camera and the moving objects.

Motion Segmentation

M$^5$L: Multi-Modal Multi-Margin Metric Learning for RGBT Tracking

no code implementations17 Mar 2020 Zhengzheng Tu, Chun Lin, Chenglong Li, Jin Tang, Bin Luo

Classifying the confusing samples in the course of RGBT tracking is a quite challenging problem, which hasn't got satisfied solution.

Metric Learning

Can Synthetic Data Improve Object Detection Results for Remote Sensing Images?

no code implementations9 Jun 2020 Weixing Liu, Jun Liu, Bin Luo

Deep learning approaches require enough training samples to perform well, but it is a challenge to collect enough real training data and label them manually.

object-detection Object Detection

U2-ONet: A Two-level Nested Octave U-structure with Multiscale Attention Mechanism for Moving Instances Segmentation

no code implementations26 Jul 2020 Chenjie Wang, Chengyuan Li, Bin Luo

Most scenes in practical applications are dynamic scenes containing moving objects, so segmenting accurately moving objects is crucial for many computer vision applications.

RGBT Tracking via Multi-Adapter Network with Hierarchical Divergence Loss

no code implementations14 Nov 2020 Andong Lu, Chenglong Li, Yuqing Yan, Jin Tang, Bin Luo

In specific, we use the modified VGG-M as the generality adapter to extract the modality-shared target representations. To extract the modality-specific features while reducing the computational complexity, we design a modality adapter, which adds a small block to the generality adapter in each layer and each modality in a parallel manner.

Representation Learning Rgb-T Tracking

Identifying Exaggerated Language

no code implementations EMNLP 2020 Li Kong, Chuanyi Li, Jidong Ge, Bin Luo, Vincent Ng

While hyperbole is one of the most prevalent rhetorical devices, it is arguably one of the least studied devices in the figurative language processing community.

Sentence

Object Detection based on OcSaFPN in Aerial Images with Noise

no code implementations18 Dec 2020 Chengyuan Li, Jun Liu, Hailong Hong, Wenju Mao, Chenjie Wang, Chudi Hu, Xin Su, Bin Luo

On the basis of this, a novel octave convolution-based semantic attention feature pyramid network (OcSaFPN) is proposed to get higher accuracy in object detection with noise.

Attribute Denoising +2

PICA: A Pixel Correlation-based Attentional Black-box Adversarial Attack

no code implementations19 Jan 2021 Jie Wang, Zhaoxia Yin, Jin Tang, Jing Jiang, Bin Luo

The studies on black-box adversarial attacks have become increasingly prevalent due to the intractable acquisition of the structural knowledge of deep neural networks (DNNs).

Adversarial Attack

RiWNet: A moving object instance segmentation Network being Robust in adverse Weather conditions

no code implementations4 Sep 2021 Chenjie Wang, Chengyuan Li, Bin Luo, Wei Wang, Jun Liu

Then we extend SOLOV2 to capture temporal information in video to learn motion information, and propose a moving object instance segmentation network with RiWFPN called RiWNet.

Instance Segmentation Segmentation +1

Robust Graph Data Learning with Latent Graph Convolutional Representation

no code implementations29 Sep 2021 Bo Jiang, Ziyan Zhang, Bin Luo

Given an input graph $\textbf{A}$, LatGCR aims to generate a flexible latent graph $\tilde{\textbf{A}}$ for graph convolutional representation which obviously enhances the representation capacity and also performs robustly w. r. t graph structural attacks and noises.

Graph Learning

Don’t Miss the Potential Customers! Retrieving Similar Ads to Improve User Targeting

no code implementations Findings (EMNLP) 2021 Yi Feng, Ting Wang, Chuanyi Li, Vincent Ng, Jidong Ge, Bin Luo, Yucheng Hu, Xiaopeng Zhang

User targeting is an essential task in the modern advertising industry: given a package of ads for a particular category of products (e. g., green tea), identify the online users to whom the ad package should be targeted.

Neural Program Repair: Systems, Challenges and Solutions

no code implementations22 Feb 2022 Wenkang Zhong, Chuanyi Li, Jidong Ge, Bin Luo

Automated Program Repair (APR) aims to automatically fix bugs in the source code.

Program Repair

Universal adversarial perturbation for remote sensing images

no code implementations22 Feb 2022 Qingyu Wang, Guorui Feng, Zhaoxia Yin, Bin Luo

Firstly, the former is used to generate the UAP, which can learn the distribution of perturbations better, and then the latter is used to find the sensitive regions concerned by the RSI classification model.

Classification Object Recognition

Unified GCNs: Towards Connecting GCNs with CNNs

no code implementations26 Apr 2022 Ziyan Zhang, Bo Jiang, Bin Luo

Graph Convolutional Networks (GCNs) have been widely demonstrated their powerful ability in graph data representation and learning.

Vibration-Based Bridge Health Monitoring using Telecommunication Cables

no code implementations10 May 2022 Jingxiao Liu, Siyuan Yuan, Bin Luo, Biondo Biondi, Hae Young Noh

Bridge Health Monitoring (BHM) enables early damage detection of bridges and is thus critical for avoiding more severe damages that might result in major financial and human losses.

Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code

no code implementations24 May 2022 Changan Niu, Chuanyi Li, Bin Luo, Vincent Ng

In particular, the development and use of pre-trained models of source code has enabled state-of-the-art results to be achieved on a wide variety of SE tasks.

GAMnet: Robust Feature Matching via Graph Adversarial-Matching Network

no code implementations MM 2021 Bo Jiang, Pengfei Sun, Ziyan Zhang, Jin Tang, Bin Luo

Also, GAMnet exploits sparse GM optimization as correspondence solver which is differentiable and can also incorporate discrete one-to-one matching constraints approximately in natural in the final matching prediction.

Ranked #7 on Graph Matching on PASCAL VOC (matching accuracy metric)

Graph Matching

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

no code implementations18 Sep 2022 Wenjin Wang, Zhengjie Huang, Bin Luo, Qianglong Chen, Qiming Peng, Yinxu Pan, Weichong Yin, Shikun Feng, Yu Sun, dianhai yu, Yin Zhang

At first, a document graph is proposed to model complex relationships among multi-grained multimodal elements, in which salient visual regions are detected by a cluster-based method.

Common Sense Reasoning document understanding +1

Cost-effective photonic super-resolution millimeter-wave joint radar-communication system using self-coherent detection

no code implementations9 Oct 2022 Wenlin Bai, Peixuan Li, Xihua Zou, Ningyuan Zhong, Wei Pan, Lianshan Yan, Bin Luo

Then the self-coherent detection, as a simple and low-cost means, is accordingly facilitated for both de-chirping of MMW radar and frequency down-conversion reception of MMW communication, which circumvents the costly high-speed mixers along with MMW local oscillators and more significantly achieves the real-time decomposition of radar and communication information.

Joint Radar-Communication Super-Resolution

Data Dimension Reduction makes ML Algorithms efficient

no code implementations17 Nov 2022 Wisal Khan, Muhammad Turab, Waqas Ahmad, Syed Hasnat Ahmad, Kelash Kumar, Bin Luo

Similarly, in AE based DDR, we compare unsupervised learning algorithm accuracy and time before and after AE representation learning.

Dimensionality Reduction Representation Learning

Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer based Approach

no code implementations19 Nov 2022 Xixi Wang, Bo Jiang, Xiao Wang, Bin Luo

(1) It employs a flexible graph model, termed Batch Graph to jointly encode the visual and semantic relationships of samples within each mini-batch.

Metric Learning

Adversarial Example Defense via Perturbation Grading Strategy

no code implementations16 Dec 2022 Shaowei Zhu, Wanli Lyu, Bin Li, Zhaoxia Yin, Bin Luo

In addition, the proposed method does not modify any task model, which can be used as a preprocessing module, which significantly reduces the deployment cost in practical applications.

CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models

no code implementations8 Feb 2023 Changan Niu, Chuanyi Li, Vincent Ng, Bin Luo

Despite the recent advances showing that a model pre-trained on large-scale source code data is able to gain appreciable generalization capability, it still requires a sizeable amount of data on the target task for fine-tuning.

Benchmarking Few-Shot Learning +1

Judicial Intelligent Assistant System: Extracting Events from Divorce Cases to Detect Disputes for the Judge

no code implementations23 Mar 2023 Yuan Zhang, Chuanyi Li, Yu Sheng, Jidong Ge, Bin Luo

It is a difficult but necessary task to extract the key information for the cases from these textual materials and to clarify the dispute focus of related parties.

Adversarial Examples Detection with Enhanced Image Difference Features based on Local Histogram Equalization

no code implementations8 May 2023 Zhaoxia Yin, Shaowei Zhu, Hang Su, Jianteng Peng, Wanli Lyu, Bin Luo

However, numerous studies have proven that previous methods create detection or defense against certain attacks, which renders the method ineffective in the face of the latest unknown attack methods.

Feature Compression

Sparse-Input Neural Network using Group Concave Regularization

1 code implementation1 Jul 2023 Bin Luo, Susan Halabi

To overcome this limitation, we propose a framework of sparse-input neural networks using group concave regularization for feature selection in both low-dimensional and high-dimensional settings.

feature selection

Unified-modal Salient Object Detection via Adaptive Prompt Learning

no code implementations28 Nov 2023 Kunpeng Wang, Chenglong Li, Zhengzheng Tu, Bin Luo

Existing single-modal and multi-modal salient object detection (SOD) methods focus on designing specific architectures tailored for their respective tasks.

object-detection Object Detection +1

Robust Transductive Few-shot Learning via Joint Message Passing and Prototype-based Soft-label Propagation

no code implementations28 Nov 2023 Jiahui Wang, Qin Xu, Bo Jiang, Bin Luo

Label propagation methods try to propagate the labels of support samples on the constructed graph encoding the relationships between both support and query samples.

Few-Shot Learning

Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning

no code implementations25 Dec 2023 Andong Lu, Tianrui Zha, Chenglong Li, Jin Tang, XiaoFeng Wang, Bin Luo

To perform effective collaborative modeling between image relighting and person ReID tasks, we integrate the multilevel feature interactions in CENet.

Image Relighting Person Re-Identification

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

no code implementations3 Jan 2024 Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Yusen Hu, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Bin Luo, Yifeng Geng, Xuansong Xie, Jingren Zhou

This paper introduces the WordArt Designer API, a novel framework for user-driven artistic typography synthesis utilizing Large Language Models (LLMs) on ModelScope.

Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens

no code implementations3 Jan 2024 Dengdi Sun, Yajie Pan, Andong Lu, Chenglong Li, Bin Luo

We introduce independent dynamic template tokens to interact with the search region, embedding temporal information to address appearance changes, while also retaining the involvement of the initial static template tokens in the joint feature extraction process to ensure the preservation of the original reliable target appearance information that prevent deviations from the target appearance caused by traditional temporal updates.

Rgb-T Tracking Template Matching

Unifying Graph Contrastive Learning via Graph Message Augmentation

no code implementations8 Jan 2024 Ziyan Zhang, Bo Jiang, Jin Tang, Bin Luo

Based on the proposed GMA, we then propose a unified graph contrastive learning, termed Graph Message Contrastive Learning (GMCL), that employs attribution-guided universal GMA for graph contrastive learning.

Contrastive Learning Data Augmentation +2

Source-free Domain Adaptive Object Detection in Remote Sensing Images

no code implementations31 Jan 2024 Weixing Liu, Jun Liu, Xin Su, Han Nie, Bin Luo

To address this challenge, we propose a practical source-free object detection (SFOD) setting for RS images, which aims to perform target domain adaptation using only the source pre-trained model.

Domain Adaptation object-detection +1

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

no code implementations5 Mar 2024 Junwen He, Yifan Wang, Lijun Wang, Huchuan Lu, Jun-Yan He, Jin-Peng Lan, Bin Luo, Xuansong Xie

Multimodal Large Language Model (MLLMs) leverages Large Language Models as a cognitive framework for diverse visual-language tasks.

Language Modelling Large Language Model +2

Cannot find the paper you are looking for? You can Submit a new open access paper.