no code implementations • 13 Apr 2025 • Xiang Hu, Pingping Zhang, Yuhao Wang, Bin Yan, Huchuan Lu
Furthermore, we propose the View-Refine Decoder (VRD) to obtain additional controllable conditions to generate missing cross-view features.
1 code implementation • 15 Feb 2025 • Zirui Song, Bin Yan, YuHan Liu, Miao Fang, Mingzhe Li, Rui Yan, Xiuying Chen
Large Language Models (LLMs) have demonstrated remarkable success in various tasks such as natural language understanding, text summarization, and machine translation.
2 code implementations • 5 Dec 2024 • Jian Han, Jinlai Liu, Yi Jiang, Bin Yan, Yuqi Zhang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu
We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.
no code implementations • 5 Dec 2024 • Bin Yan, Martin Sundermeyer, David Joseph Tan, Huchuan Lu, Federico Tombari
In this paper, we address the challenge of performing open-vocabulary video instance segmentation (OV-VIS) in real-time.
no code implementations • 2 Oct 2024 • Dongyang Li, Linyuan Wang, Guangwei Xiong, Bin Yan, Dekui Ma, Jinxian Peng
With the development and application of deep learning in signal detection tasks, the vulnerability of neural networks to adversarial attacks has also become a security threat to signal detection networks.
no code implementations • 14 Aug 2024 • Xinrui Zhang, Ailong Cai, Shaoyu Wang, Linyuan Wang, Zhizhong Zheng, Lei LI, Bin Yan
However, these methods have limited perception ability in the diverse morphologies of different metal implants with artifacts, which may generate spurious anatomical structures and exhibit inferior generalization capability.
no code implementations • 9 May 2024 • Chen Chen, Kai Qiao, Jie Yang, Jian Chen, Bin Yan
In this model, the teacher-guided MIM pretraining model is introduced into PCB CT image element segmentation for the first time, and a multi-scale local visual field extraction (MVE) module is proposed to reduce redundancy by focusing on local visual fields.
no code implementations • 8 Jan 2024 • Shuxiao Ma, Linyuan Wang, Senbao Hou, Bin Yan
Next, we use the contrast loss function to minimize the distance between the image embedding features and the text embedding features to complete the alignment operation of the stimulus image and text information.
2 code implementations • 25 Dec 2023 • Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo
We evaluate our unified models on various benchmarks.
no code implementations • 29 Aug 2023 • Shuxiao Ma, Linyuan Wang, Bin Yan
A convolutional network then maps from this multimodal feature space to voxel space, constructing the multimodal visual information encoding network model.
no code implementations • ICCV 2023 • Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo
Open-world instance segmentation is a rising task, which aims to segment all objects in the image by learning from a limited number of base-category objects.
2 code implementations • 1 Aug 2023 • Mingzhan Yang, Guangxin Han, Bin Yan, Wenhua Zhang, Jinqing Qi, Huchuan Lu, Dong Wang
Also, our method shows strong generalization for diverse trackers and scenarios in a plug-and-play and training-free manner.
Ranked #6 on
Object Tracking
on QuadTrack
no code implementations • 1 Aug 2023 • Ruoxi Qin, Linyuan Wang, Xuehui Du, Xingyuan Chen, Bin Yan
The deep neural network has attained significant efficiency in image recognition.
no code implementations • 5 Jul 2023 • Shuhao Shi, Kai Qiao, Zhengyan Wang, Jie Yang, Baojie Song, Jian Chen, Bin Yan
Recently, more and more GNN-based methods have been proposed for bot detection.
1 code implementation • 14 Apr 2023 • Shuhao Shi, Kai Qiao, Jie Yang, Baojie Song, Jian Chen, Bin Yan
This paper proposes a Random Forest boosted Graph Neural Network for social bot detection, called RF-GNN, which employs graph neural networks (GNNs) as the base classifiers to construct a random forest, effectively combining the advantages of ensemble learning and GNNs to improve the accuracy and robustness of the model.
1 code implementation • CVPR 2023 • Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu
All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks.
Described Object Detection
Generalized Referring Expression Comprehension
+15
1 code implementation • 14 Feb 2023 • Shuhao Shi, Kai Qiao, Jie Yang, Baojie Song, Jian Chen, Bin Yan
The proposed framework is evaluated using three real-world bot detection benchmark datasets, and it consistently exhibits superiority over the baselines.
1 code implementation • 3 Jan 2023 • Shuhao Shi, Kai Qiao, Jian Chen, Shuai Yang, Jie Yang, Baojie Song, Linyuan Wang, Bin Yan
However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research.
Ranked #1 on
Stance Detection
on MGTAB
no code implementations • ICCV 2023 • Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo
In this work, we end the current fragmented situation and propose UniRef to unify the three reference-based object segmentation tasks with a single architecture.
no code implementations • 6 Oct 2022 • Qi Peng, Wenlin Liu, Ruoxi Qin, Libin Hou, Bin Yan, Linyuan Wang
Adversarial attacks are considered the intrinsic vulnerability of CNNs.
1 code implementation • 14 Jul 2022 • Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu
We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters.
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+4
no code implementations • 8 May 2022 • Shuhao Shi, Jian Chen, Kai Qiao, Shuai Yang, Linyuan Wang, Bin Yan
The Graph Convolutional Networks (GCNs) have achieved excellent results in node classification tasks, but the model's performance at low label rates is still unsatisfactory.
1 code implementation • 25 Mar 2022 • Xin Chen, Bin Yan, Jiawen Zhu, Huchuan Lu, Xiang Ruan, Dong Wang
First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head.
no code implementations • AAAI Workshop AdvML 2022 • Qi Peng, Ruoxi Qin, Wenlin Liu, Libin Hou, Bin Yan, Linyuan Wang
Recent advances in adversarial attacks uncover the intrinsic vulnerability of modern deep neural networks (DNNs).
no code implementations • AAAI Workshop AdvML 2022 • Ruoxi Qin, Linyuan Wang, Xuehui Du, Bin Yan, Xingyuan Chen
A new constraints norm is proposed in model training based on these criteria to isolate adversarial transferability without any prior knowledge of adversarial samples.
no code implementations • 29 Sep 2021 • Shuhao Shi, Pengfei Xie, Xu Luo, Kai Qiao, Linyuan Wang, Jian Chen, Bin Yan
AMC-GNN generates two graph views by data augmentation and compares different layers' output embeddings of Graph Neural Network encoders to obtain feature representations, which could be used for downstream tasks.
no code implementations • 3 Jun 2021 • Pengfei Xie, Linyuan Wang, Ruoxi Qin, Kai Qiao, Shuhao Shi, Guoen Hu, Bin Yan
In this paper, we propose a new gradient iteration framework, which redefines the relationship between the above three.
no code implementations • 25 May 2021 • S. Shi, Kai Qiao, Shuai Yang, L. Wang, J. Chen, Bin Yan
Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN.
no code implementations • 6 May 2021 • Ruoxi Qin, Linyuan Wang, Xingyuan Chen, Xuehui Du, Bin Yan
The defense strategies are particularly passive in these processes, and enhancing initiative of such strategies can be an effective way to get out of this arms race.
1 code implementation • CVPR 2021 • Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu
Object tracking has achieved significant progress over the past few years.
Ranked #27 on
Video Object Tracking
on NT-VOT211
1 code implementation • ICCV 2021 • Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu
In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component.
Ranked #6 on
Visual Object Tracking
on AVisT
1 code implementation • CVPR 2021 • Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu
The correlation operation is a simple fusion manner to consider the similarity between the template and the search region.
Ranked #5 on
Visual Tracking
on TNL2K
no code implementations • 11 Feb 2021 • Vincenzo Cirigliano, Kaori Fuyuto, Christopher Lee, Emanuele Mereghetti, Bin Yan
We present a comprehensive analysis of the potential sensitivity of the Electron-Ion Collider (EIC) to charged lepton flavor violation (CLFV) in the channel $ep\to \tau X$, within the model-independent framework of the Standard Model Effective Field Theory (SMEFT).
High Energy Physics - Phenomenology High Energy Physics - Experiment Nuclear Experiment Nuclear Theory
no code implementations • 15 Jan 2021 • Bin Yan, C. -P. Yuan
We demonstrate that the $Zh$ data collected at the 13 TeV LHC can already resolve the apparent degeneracy of the anomalous $Zb\bar{b}$ couplings implied by the LEP precision electroweak measurements, with a strong dependence on the observed distribution of the $Z$ boson transverse momentum.
High Energy Physics - Phenomenology High Energy Physics - Experiment
no code implementations • 30 Dec 2020 • Francesco Caravelli, Bin Yan, Luis Pedro Garcia-Pintos, Alioscia Hamma
We study the role of coherence in closed and open quantum batteries.
Quantum Physics Other Condensed Matter
no code implementations • 14 Dec 2020 • Bin Yan, Vladimir Y. Chernyak, Wojciech H. Zurek, Nikolai A. Sinitsyn
We explore nonadiabatic quantum phase transitions in an Ising spin chain with a linearly time-dependent transverse field and two different spins per unit cell.
Statistical Mechanics General Relativity and Quantum Cosmology High Energy Physics - Theory Quantum Physics
1 code implementation • CVPR 2021 • Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang
Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation.
Ranked #18 on
Semi-Supervised Video Object Segmentation
on VOT2020
Semi-Supervised Video Object Segmentation
Visual Object Tracking
1 code implementation • 4 Jul 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
In recent years, the multiple-stage strategy has become a popular trend for visual tracking.
no code implementations • 26 Mar 2020 • Kai Qiao, Chi Zhang, Jian Chen, Linyuan Wang, Li Tong, Bin Yan
Except for deep network structure, the task or corresponding big dataset is also important for deep network models, but neglected by previous studies.
1 code implementation • CVPR 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
An effective and efficient perturbation generator is trained with a carefully designed adversarial loss, which can simultaneously cool hot regions where the target exists on the heatmaps and force the predicted bounding box to shrink, making the tracked target invisible to trackers.
no code implementations • 13 Mar 2020 • Kai Qiao, Jian Chen, Linyuan Wang, Chi Zhang, Li Tong, Bin Yan
In this study, we proposed a new GAN-based Bayesian visual reconstruction method (GAN-BVRM) that includes a classifier to decode categories from fMRI data, a pre-trained conditional generator to generate natural images of specified categories, and a set of encoding models and evaluator to evaluate generated images.
no code implementations • 1 Feb 2020 • Zifei Zhang, Kai Qiao, Lingyun Jiang, Linyuan Wang, Bin Yan
To alleviate the tradeoff between the attack success rate and image fidelity, we propose a method named AdvJND, adding visual model coefficients, just noticeable difference coefficients, in the constraint of a distortion function when generating adversarial examples.
1 code implementation • ICCV 2019 • Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang
In this work, we present a novel robust and real-time long-term tracking framework based on the proposed skimming and perusal modules.
1 code implementation • 27 Jul 2019 • Kai Qiao, Chi Zhang, Jian Chen, Linyuan Wang, Li Tong, Bin Yan
Recently, visual encoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation.
no code implementations • 12 Apr 2019 • Lingyun Jiang, Kai Qiao, Ruoxi Qin, Linyuan Wang, Jian Chen, Haibing Bu, Bin Yan
In image classification of deep learning, adversarial examples where inputs intended to add small magnitude perturbations may mislead deep neural networks (DNNs) to incorrect results, which means DNNs are vulnerable to them.
no code implementations • 19 Mar 2019 • Kai Qiao, Jian Chen, Linyuan Wang, Chi Zhang, Lei Zeng, Li Tong, Bin Yan
Despite the hierarchically similar representations of deep network and human vision, visual information flows from primary visual cortices to high visual cortices and vice versa based on the bottom-up and top-down manners, respectively.
Neurons and Cognition
no code implementations • 10 Mar 2019 • Ziheng Li, Wenkun Zhang, Linyuan Wang, Ailong Cai, Ningning Liang, Bin Yan, Lei LI
Limited-angle computed tomography (CT) image reconstruction is a challenging reconstruction problem in the fields of CT. With the development of deep learning, the generative adversarial network (GAN) perform well in image restoration by approximating the distribution of training sample data.
Medical Physics
no code implementations • 23 Feb 2019 • Chi Zhang, Kai Qiao, Linyuan Wang, Li Tong, Guoen Hu, Ruyuan Zhang, Bin Yan
In this framework, we employ the transfer learning technique to incorporate a pre-trained DNN (i. e., AlexNet) and train a nonlinear mapping from visual features to brain activity.
no code implementations • 22 Dec 2018 • Chi Zhang, Xiaohan Duan, Linyuan Wang, Yongli Li, Bin Yan, Guoen Hu, Ruyuan Zhang, Li Tong
Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images.
no code implementations • 16 Jan 2018 • Chi Zhang, Kai Qiao, Linyuan Wang, Li Tong, Ying Zeng, Bin Yan
Without semantic prior information, we present a novel method to reconstruct nature images from fMRI signals of human visual cortex based on the computation model of convolutional neural network (CNN).
no code implementations • 2 Jan 2018 • Kai Qiao, Chi Zhang, Linyuan Wang, Bin Yan, Jian Chen, Lei Zeng, Li Tong
We firstly employed the CapsNet to train the nonlinear mapping from image stimuli to high-level capsule features, and from high-level capsule features to image stimuli again in an end-to-end manner.
no code implementations • 29 Jul 2016 • Hanming Zhang, Liang Li, Kai Qiao, Linyuan Wang, Bin Yan, Lei LI, Guoen Hu
The qualitative and quantitative evaluations of experimental results indicate that the proposed method show a stable and prospective performance on artifacts reduction and detail recovery for limited angle tomography.