no code implementations • 17 Oct 2024 • Jiacong Zhou, Xianyun Wang, Jun Yu
DRPO leverages NDCG, a widely used LTR metric, to optimize the ranking of responses within lists based on preference data, thereby enhancing ranking accuracies.
1 code implementation • 12 Oct 2024 • Ting Yu, Kunhao Fu, Jian Zhang, Qingming Huang, Jun Yu
Long-term Video Question Answering (VideoQA) is a challenging vision-and-language bridging task focusing on semantic understanding of untrimmed long-term videos and diverse free-form questions, simultaneously emphasizing comprehensive cross-modal reasoning to yield precise answers.
no code implementations • 12 Oct 2024 • Ting Yu, Kunhao Fu, Shuhui Wang, Qingming Huang, Jun Yu
Video Question Answering (VideoQA) represents a crucial intersection between video understanding and language processing, requiring both discriminative unimodal comprehension and sophisticated cross-modal interaction for accurate inference.
no code implementations • 9 Oct 2024 • Sanxi Li, Jun Yu, Mingsheng Zhang
Search prominence may have a detrimental impact on a firm's profits in the presence of costly product returns.
no code implementations • 8 Oct 2024 • Xuetao Li, Fang Gao, Jun Yu, Shaodong Li, Feng Shuang
Embodied AI represents a paradigm in AI research where artificial agents are situated within and interact with physical or virtual environments.
no code implementations • 8 Oct 2024 • Jun Yu, Yifan Zhang, Badrinadh Aila, Vinod Namboodiri
We benchmark on two main aspects: 1) positioning system and 2) exploration support, prioritizing training scalability and real-time inference, to validate the prospect of image-based solution towards indoor navigation.
no code implementations • 8 Oct 2024 • Fang Gao, Xuetao Li, Jiabao Wang, Shengheng Ma, Jun Yu
With the development of steel materials, metallographic analysis has become increasingly important.
no code implementations • 4 Oct 2024 • Yiqun Sun, Qiang Huang, Yixuan Tang, Anthony K. H. Tung, Jun Yu
Semantic text embedding is essential to many tasks in Natural Language Processing (NLP).
no code implementations • 8 Sep 2024 • Jun Yu, Wenjian Wang
In addition to the model, we present a comprehensive dataset specifically curated for surface defect detection in recycled and recirculated books.
1 code implementation • 29 Aug 2024 • Ye Yu, Fengxin Chen, Jun Yu, Zhen Kan
Meanwhile, we use a mean-teacher-assisted Gaussian process learning strategy to establish a connection between the latent and pseudo-latent vectors obtained from the labeled and unlabeled data.
1 code implementation • 28 Jul 2024 • Buyu Liu, Kai Wang, Yansong Liu, Jun Bao, Tingting Han, Jun Yu
Unlike prior methods that neglect layout consistency, lack the ability to handle detailed text prompts, or are incapable of generalizing to unseen view points, MVPbev simultaneously generates cross-view consistent images of different perspective views with a two-stage design, allowing object-level control and novel view generation at test-time.
no code implementations • CVPR 2024 • Zhenzhong Kuang, Xiaochen Yang, Yingjie Shen, Chao Hu, Jun Yu
On the other, we anonymize the visual clues (i. e. appearance and geometry structure) by distracting the extrinsic identity attention.
1 code implementation • 20 Jun 2024 • Mengcheng Lan, Min Meng, Jun Yu, Jigang Wu
As such, the key issues of WS-PDA are: 1) how to sufficiently discover the knowledge from the noisy labeled source domain and the unlabeled target domain, and 2) how to successfully adapt the knowledge across domains.
no code implementations • 9 Jun 2024 • Jun Yu, Yunxiang Zhang, Fengzhao Sun, Leilei Wang, Renjie Lu
In this report, we present our solution for the semantic segmentation in adverse weather, in UG2+ Challenge at CVPR 2024.
1 code implementation • 8 Jun 2024 • Rui Zhong, Yang Cao, Jun Yu, Masaharu Munetomo
Motivated by the potential of large language models (LLMs) as optimizers for solving combinatorial optimization problems, this paper proposes a novel LLM-assisted optimizer (LLMO) to address adversarial robustness neural architecture search (ARNAS), a specific application of combinatorial optimization.
no code implementations • 28 May 2024 • Haoxiang Shi, xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang
Previous ERC methods relied on simple connections for cross-modal fusion and ignored the information differences between modalities, resulting in the model being unable to focus on modality-specific emotional information.
no code implementations • 26 May 2024 • Yawen Zou, Chunzhi Gu, Jun Yu, Shangce Gao, Chao Zhang
Black-Box unsupervised domain adaptation (BBUDA) learns knowledge only with the prediction of target data from the source model without access to the source data and source model, which attempts to alleviate concerns about the privacy and security of data.
1 code implementation • 20 May 2024 • Zhenwei Shao, Zhou Yu, Jun Yu, Xuecheng Ouyang, Lihao Zheng, Zhenbiao Gai, Mingyang Wang, Jiajun Ding
By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding.
Ranked #62 on Visual Question Answering on MM-Vet
no code implementations • 3 May 2024 • H. Peter Boswijk, Jun Yu, Yang Zu
A real-time date-stamping strategy based on the devolatized sample is proposed for the origination and conclusion dates of the explosive regime.
1 code implementation • 29 Apr 2024 • Jun Yu, Yutong Dai, Xiaokang Liu, Jin Huang, Yishan Shen, Ke Zhang, Rong Zhou, Eashan Adhikarla, Wenxuan Ye, Yixin Liu, Zhaoming Kong, Kai Zhang, Yilong Yin, Vinod Namboodiri, Brian D. Davison, Jason H. Moore, Yong Chen
Overall, we hope this survey provides the research community with a comprehensive overview of the advancements in MTL from its inception in 1997 to the present in 2023.
no code implementations • 26 Apr 2024 • Shun Maeda, Chunzhi Gu, Jun Yu, Shogo Tokai, Shangce Gao, Chao Zhang
We introduce the task of human action anomaly detection (HAAD), which aims to identify anomalous motions in an unsupervised manner given only the pre-determined normal category of training action samples.
1 code implementation • 25 Mar 2024 • Rui Zhong, Yuefeng Xu, Chao Zhang, Jun Yu
In this paper, we borrow the large language model (LLM) ChatGPT-3. 5 to automatically and quickly design a new metaheuristic algorithm (MA) with only a small amount of input.
1 code implementation • 20 Mar 2024 • Jingyi Wang, Xiaobo Xia, Long Lan, Xinghao Wu, Jun Yu, Wenjing Yang, Bo Han, Tongliang Liu
Given data with noisy labels, over-parameterized deep networks suffer overfitting mislabeled data, resulting in poor generalization.
no code implementations • 20 Mar 2024 • Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu
Leveraging the synergy of both audio data and visual data is essential for understanding human emotions and behaviors, especially in in-the-wild setting.
no code implementations • 19 Mar 2024 • Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Yang Zheng, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu
This paper presents our approach for the VA (Valence-Arousal) estimation task in the ABAW6 competition.
no code implementations • 19 Mar 2024 • Jun Yu, Jichao Zhu, Wangyuan Zhu
Compound Expression Recognition (CER) plays a crucial role in interpersonal interactions.
no code implementations • 18 Mar 2024 • Jun Yu, Wangyuan Zhu, Jichao Zhu
In this paper, we present the solution to the Emotional Mimicry Intensity (EMI) Estimation challenge, which is part of 6th Affective Behavior Analysis in-the-wild (ABAW) Competition. The EMI Estimation challenge task aims to evaluate the emotional intensity of seed videos by assessing them from a set of predefined emotion categories (i. e., "Admiration", "Amusement", "Determination", "Empathic Pain", "Excitement" and "Joy").
no code implementations • 18 Mar 2024 • Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu
Facial Expression Recognition (FER) plays a crucial role in computer vision and finds extensive applications across various fields.
Facial Expression Recognition Facial Expression Recognition (FER)
1 code implementation • 15 Mar 2024 • Rui Zhong, Yuefeng Xu, Chao Zhang, Jun Yu
This paper introduces a novel metaheuristic algorithm, known as the efficient multiplayer battle game optimizer (EMBGO), specifically designed for addressing complex numerical optimization tasks.
no code implementations • 12 Mar 2024 • Ting Yu, Xiaojun Lin, Shuhui Wang, Weiguo Sheng, Qingming Huang, Jun Yu
Three-Dimensional (3D) dense captioning is an emerging vision-language bridging task that aims to generate multiple detailed and accurate descriptions for 3D scenes.
1 code implementation • 11 Mar 2024 • Guobao Xiao, Jun Yu, Jiayi Ma, Deng-Ping Fan, Ling Shao
The principle of LSC is to preserve the latent semantic consensus in both data points and model hypotheses.
2 code implementations • 4 Mar 2024 • Yuhao Wu, Jiangchao Yao, Xiaobo Xia, Jun Yu, Ruxin Wang, Bo Han, Tongliang Liu
Despite the success of the carefully-annotated benchmarks, the effectiveness of existing graph neural networks (GNNs) can be considerably impaired in practice when the real-world graph data is noisily labeled.
no code implementations • 20 Jan 2024 • Cong Lei, Yuxuan Du, Peng Mi, Jun Yu, Tongliang Liu
Quantum kernels hold great promise for offering computational advantages over classical learners, with the effectiveness of these kernels closely tied to the design of the quantum feature map.
no code implementations • 16 Jan 2024 • Bingyuan Zhang, xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang
In recent years, the field of talking faces generation has attracted considerable attention, with certain methods adept at generating virtual faces that convincingly imitate human expressions.
1 code implementation • 7 Jan 2024 • Xiangyang Miao, Guobao Xiao, Shiping Wang, Jun Yu
In our approach, we design a distinctive self-attention block to capture global context and parallel process it with the established local context learning module, which enables us to simultaneously capture both local and global consensuses.
no code implementations • 31 Dec 2023 • Yuefeng Xu, Rui Zhong, Chao Zhang, Jun Yu
Various popular multiplayer battle royale games share a lot of common elements.
1 code implementation • 26 Dec 2023 • Junwen Guo, Guobao Xiao, Shiping Wang, Jun Yu
To further apply the recalibrated graph contexts to the global domain, we propose the Graph Context Guidance Transformer.
no code implementations • 19 Dec 2023 • Xiang Feng, Yongbo He, YuBo Wang, Chengkai Wang, Zhenzhong Kuang, Jiajun Ding, Feiwei Qin, Jun Yu, Jianping Fan
This framework aims to guide the NeRF model to synthesize high-resolution novel views via single-scene internal learning rather than requiring any external high-resolution training data.
no code implementations • 3 Dec 2023 • Eashan Adhikarla, Kai Zhang, Jun Yu, Lichao Sun, John Nicholson, Brian D. Davison
As a result, it raises concerns about the overall robustness of the machine learning techniques for computer vision applications that are deployed publicly for consumers.
no code implementations • 30 Nov 2023 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang
To fully exploit saliency guidance, on each map, we select a pixel pair from the cluster with the highest centroid saliency to form a patch pair.
no code implementations • 7 Nov 2023 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang
We present PD-REAL, a novel large-scale dataset for unsupervised anomaly detection (AD) in the 3D domain.
no code implementations • 25 Oct 2023 • Zhuo Huang, Muyang Li, Li Shen, Jun Yu, Chen Gong, Bo Han, Tongliang Liu
By fully exploring both variant and invariant parameters, our EVIL can effectively identify a robust subnetwork to improve OOD generalization.
1 code implementation • NeurIPS 2023 • Zhuo Huang, Li Shen, Jun Yu, Bo Han, Tongliang Liu
Therefore, the label guidance on labeled data is hard to be propagated to unlabeled data.
1 code implementation • 10 Oct 2023 • Zhenying Fang, Jun Yu, Richang Hong
Furthermore, the reliable classification module (RCM) predicts reliable global action categories to reduce false positives.
no code implementations • 1 Oct 2023 • Chaojian Yu, Xiaolong Shi, Jun Yu, Bo Han, Tongliang Liu
Given that the only difference between adversarial and natural training lies in the inclusion of adversarial perturbations, we further hypothesize that adversarial perturbations degrade the generalization of features in natural data and verify this hypothesis through extensive experiments.
no code implementations • 14 Sep 2023 • Yu Ding, Jun Yu, Chunzhi Gu, Shangce Gao, Chao Zhang
Recently, a novel mathematical ANN model, known as the dendritic neuron model (DNM), has been proposed to address nonlinear problems by more accurately reflecting the structure of real neurons.
1 code implementation • 2 Sep 2023 • Xiaobo Xia, Pengqian Lu, Chen Gong, Bo Han, Jun Yu, Tongliang Liu
However, such a procedure is arguably debatable from two folds: (a) it does not consider the bad influence of noisy labels in selected small-loss examples; (b) it does not make good use of the discarded large-loss examples, which may be clean or have meaningful information for generalization.
1 code implementation • 27 Jul 2023 • Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan Jing, Pengwei Li, Xiaohua Qi, Cheng Jin, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng, Minglei Li, Di Xu, Changpeng Yang, Yuanqi Yao, Gang Wu, Jian Kuai, Xianming Liu, Junjun Jiang, Jiamian Huang, Baojun Li, Jiale Chen, Shuang Zhang, Sun Ao, Zhenyu Li, Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu
In this paper, we summarize the winning solutions from the RoboDepth Challenge -- an academic competition designed to facilitate and advance robust OoD depth estimation.
1 code implementation • 21 Jul 2023 • Fang Gao, Xuetao Li, Jun Yu, Feng Shaung
The advent of Chat-GPT has led to a surge of interest in Embodied AI.
no code implementations • 11 Jul 2023 • Hui Kang, Sheng Liu, Huaxi Huang, Jun Yu, Bo Han, Dadong Wang, Tongliang Liu
In recent years, research on learning with noisy labels has focused on devising novel algorithms that can achieve robustness to noisy training labels while generalizing to clean data.
no code implementations • 12 Jun 2023 • Yuhao Wu, Xiaobo Xia, Jun Yu, Bo Han, Gang Niu, Masashi Sugiyama, Tongliang Liu
Training a classifier exploiting a huge amount of supervised data is expensive or even prohibited in a situation, where the labeling cost is high.
1 code implementation • 11 Jun 2023 • Mengyu Li, Jun Yu, Tao Li, Cheng Meng
Sinkhorn algorithm has been used pervasively to approximate the solution to optimal transport (OT) and unbalanced optimal transport (UOT) problems.
no code implementations • 9 Jun 2023 • Zepeng Liu, Zhicheng Yang, Mingye Zhu, Andy Wong, Yibing Wei, Mei Han, Jun Yu, Jui-Hsin Lai
Image dehazing is a meaningful low-level computer vision task and can be applied to a variety of contexts.
1 code implementation • 26 May 2023 • Kai Zhang, Rong Zhou, Eashan Adhikarla, Zhiling Yan, Yixin Liu, Jun Yu, Zhengliang Liu, Xun Chen, Brian D. Davison, Hui Ren, Jing Huang, Chen Chen, Yuyin Zhou, Sunyang Fu, Wei Liu, Tianming Liu, Xiang Li, Yong Chen, Lifang He, James Zou, Quanzheng Li, Hongfang Liu, Lichao Sun
Traditional biomedical artificial intelligence (AI) models, designed for specific tasks or modalities, often exhibit limited flexibility in real-world deployment and struggle to utilize holistic information.
Ranked #1 on Text Summarization on MeQSum
no code implementations • 13 May 2023 • Ke Zhang, Yan Yang, Jun Yu, Hanliang Jiang, Jianping Fan, Qingming Huang, Weidong Han
To address this limitation, we propose a unified Med-VLP framework based on Multi-task Paired Masking with Alignment (MPMA) to integrate the cross-modal alignment task into the joint image-text reconstruction framework to achieve more comprehensive cross-modal interaction, while a Global and Local Alignment (GLA) module is designed to assist self-supervised paradigm in obtaining semantic representations with rich domain knowledge.
1 code implementation • CVPR 2023 • Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu
A recent benchmark AGQA poses a promising paradigm to generate QA pairs automatically from pre-annotated scene graphs, enabling it to measure diverse reasoning abilities with granular control.
1 code implementation • 18 Apr 2023 • Zhaoming Kong, Fangxi Deng, Haomin Zhuang, Jun Yu, Lifang He, Xiaowei Yang
In this paper, to investigate the applicability of existing denoising techniques, we compare a variety of denoising methods on both synthetic and real-world datasets for different applications.
no code implementations • 14 Apr 2023 • Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, Myungwoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, YuFei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao
This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC).
no code implementations • 8 Apr 2023 • Jun Yu, Shenshen Du, Guochen Xie, Renjie Lu, Pengwei Li, Zhongpeng Cai, Keda Lu
Synthetic Aperture Radar (SAR) to electro-optical (EO) image translation is a fundamental task in remote sensing that can enrich the dataset by fusing information from different sources.
4 code implementations • CVPR 2023 • Zhuo Huang, Miaoxi Zhu, Xiaobo Xia, Li Shen, Jun Yu, Chen Gong, Bo Han, Bo Du, Tongliang Liu
Experimentally, we simulate photon-limited corruptions using CIFAR10/100 and ImageNet30 datasets and show that SharpDRO exhibits a strong generalization ability against severe corruptions and exceeds well-known baseline methods with large performance gains.
no code implementations • 16 Mar 2023 • Jun Yu, Jichao Zhu, Wangyuan Zhu, Zhongpeng Cai, Guochen Xie, Renda Li, Gongpeng Zhao
Emotional Reaction Intensity(ERI) estimation is an important task in multimodal scenarios, and has fundamental applications in medicine, safe driving and other fields.
no code implementations • 15 Mar 2023 • Jun Yu, Renda Li, Zhongpeng Cai, Gongpeng Zhao, Guochen Xie, Jichao Zhu, Wangyuan Zhu
Human affective behavior analysis plays a vital role in human-computer interaction (HCI) systems.
no code implementations • 15 Mar 2023 • Jun Yu, Zhongpeng Cai, Renda Li, Gongpeng Zhao, Guochen Xie, Jichao Zhu, Wangyuan Zhu
Facial Expression Recognition (FER) is an important task in computer vision and has wide applications in human-computer interaction, intelligent security, emotion analysis, and other fields.
1 code implementation • CVPR 2023 • Zhou Yu, Xuecheng Ouyang, Zhenwei Shao, Meng Wang, Jun Yu
Knowledge-based visual question answering (VQA) requires external knowledge beyond the image to answer the question.
Ranked #3 on Visual Question Answering (VQA) on A-OKVQA
2 code implementations • journal 2023 • Zhaoqing Wang, Ziyu Chen, Yaqian Li, Yandong Guo, Jun Yu, Mingming Gong, Tongliang Liu
To address this problem, we propose a mosaic representation learning framework (MosRep), consisting of a new data augmentation strategy that enriches the backgrounds of each small crop and improves the quality of visual representations.
no code implementations • CVPR 2024 • Buyu Liu, BaoJun, Jianping Fan, Xi Peng, Kui Ren, Jun Yu
More desired attacks, to this end, should be able to fool defenses with such consistency checks.
no code implementations • 18 Feb 2023 • Ce Zhou, Qian Li, Chen Li, Jun Yu, Yixin Liu, Guangjing Wang, Kai Zhang, Cheng Ji, Qiben Yan, Lifang He, Hao Peng, JianXin Li, Jia Wu, Ziwei Liu, Pengtao Xie, Caiming Xiong, Jian Pei, Philip S. Yu, Lichao Sun
This study provides a comprehensive review of recent research advancements, challenges, and opportunities for PFMs in text, image, graph, as well as other data modalities.
no code implementations • journal 2023 • Zhuo Huang, Xiaobo Xia, Li Shen, Jun Yu, Chen Gong, Bo Han, Tongliang Liu
Robust generalization aims to deal with the most challenging data distributions which are rarely presented in training set and contain severe noise corruptions.
no code implementations • 5 Feb 2023 • Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian
In this paper, we propose a novel and flexible conditional diffusion model by introducing conditions into the forward process.
no code implementations • ICCV 2023 • Xiaobo Xia, Bo Han, Yibing Zhan, Jun Yu, Mingming Gong, Chen Gong, Tongliang Liu
As selected data have high discrepancies in probabilities, the divergence of two networks can be maintained by training on such data.
3 code implementations • ICCV 2023 • Yijie Lin, Mouxing Yang, Jun Yu, Peng Hu, Changqing Zhang, Xi Peng
In this paper, we study a novel and widely existing problem in graph matching (GM), namely, Bi-level Noisy Correspondence (BNC), which refers to node-level noisy correspondence (NNC) and edge-level noisy correspondence (ENC).
Ranked #1 on Graph Matching on Willow Object Class
no code implementations • 31 Oct 2022 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang
Moreover, our method only requires very few normal samples to train the student network due to the teacher-student distillation mechanism.
1 code implementation • MM '22: Proceedings of the 30th ACM International Conference on Multimedia 2022 • Jun Yu, Zhongpeng Cai, Zepeng Liu, Guochen Xie, Peng He
The purpose of micro expression (ME) and macro expression (MaE) spotting task is to locate the onset and offset frames of MaE and ME clips.
3 code implementations • MM '22: Proceedings of the 30th ACM International Conference on Multimedia 2022 • Jun Yu, Guochen Xie, Zhongpeng Cai, Peng He, Fang Gao, Qiang Ling
We (Team: USTC-IAT-United) also compare our method with other competitors' in MEGC2022, and the expert evaluation results show that our method performs best, which verifies the effectiveness of our method.
no code implementations • 4 Oct 2022 • Chaojian Yu, Dawei Zhou, Li Shen, Jun Yu, Bo Han, Mingming Gong, Nannan Wang, Tongliang Liu
Firstly, applying a pre-specified perturbation budget on networks of various model capacities will yield divergent degree of robustness disparity between natural and robust accuracies, which deviates from robust network's desideratum.
1 code implementation • Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2022 • Jun Yu, Liwen Zhang, Shenshen Du, Hao Chang, Keda Lu, Zhong Zhang, Ye Yu, Lei Wang, Qiang Ling
To overcome these difficulties, this paper first select fewer but suitable data augmentation methods to improve the accuracy of the supervised model based on the labeled training set, which is suitable for the characteristics of hyperspectral images.
1 code implementation • 23 Sep 2022 • Jun Yu, Zhaoming Kong, Liang Zhan, Li Shen, Lifang He
The assessment of Alzheimer's Disease (AD) and Mild Cognitive Impairment (MCI) associated with brain changes remains a challenging task.
1 code implementation • Conference and Labs of the Evaluation Forum 2022 • Jun Yu, Hao Chang, Keda Lu, Guochen Xie, Liwen Zhang, Zhongpeng Cai, Shenshen Du, Zhihong Wei, Zepeng Liu, Fang Gao, Feng Shuang
This motivates us to explore the impact of different methods and components in fine-grained classification on FungiCLEF 2022.
no code implementations • 21 Aug 2022 • Jun Yu, Shunqing Zhang, Jiayun Sun, Shugong Xu, Shan Cao
Multi-stream carrier aggregation is a key technology to expand bandwidth and improve the throughput of the fifth-generation wireless communication systems.
2 code implementations • Machine Learning 2022 • Hao Chang, Guochen Xie, Jun Yu, Qiang Ling, Fang Gao, Ye Yu
Semi-supervised Fine-Grained Recognition is a challenging task due to the difficulty of data imbalance, high inter-class similarity and domain mismatch.
no code implementations • 4 Jul 2022 • Chunzhi Gu, Jun Yu, Chao Zhang
Specifically, the inductive bias imposed by the extra CVAE path encourages two latent variables in two paths to respectively govern separate representations for each partial-body motion.
1 code implementation • 17 Jun 2022 • Chaojian Yu, Bo Han, Li Shen, Jun Yu, Chen Gong, Mingming Gong, Tongliang Liu
Here, we explore the causes of robust overfitting by comparing the data distribution of \emph{non-overfit} (weak adversary) and \emph{overfitted} (strong adversary) adversarial training, and observe that the distribution of the adversarial data generated by weak adversary mainly contain small-loss data.
no code implementations • 31 May 2022 • Jingyi Zhang, Cheng Meng, Jun Yu, Mengrui Zhang, Wenxuan Zhong, Ping Ma
Theoretically, we show the selected subsample can be used for efficient density estimation by deriving the convergence rate for the proposed subsample kernel density estimator.
1 code implementation • 30 May 2022 • Tao Li, Cheng Meng, Hongteng Xu, Jun Yu
Distribution comparison plays a central role in many machine learning tasks like data classification and generative modeling.
1 code implementation • 26 May 2022 • Mengyu Li, Jun Yu, Hongteng Xu, Cheng Meng
As a valid metric of metric-measure spaces, Gromov-Wasserstein (GW) distance has shown the potential for matching problems of structured data like point clouds and graphs.
2 code implementations • 4 May 2022 • Jun Yu, Hao Chang, Keda Lu, Liwen Zhang, Shenshen Du, Zhong Zhang
Multi-modal aerial view object classification (MAVOC) in Automatic target recognition (ATR), although an important and challenging problem, has been under studied.
no code implementations • 28 Mar 2022 • Jun Yu, Zhongpeng Cai, Peng He, Guocheng Xie, Qiang Ling
Moreover, we introduce the multi-fold ensemble method to train and ensemble several models with the same architecture but different data distributions to enhance the performance of our solution.
1 code implementation • 24 Mar 2022 • Zhou Yu, Zitian Jin, Jun Yu, Mingliang Xu, Hongbo Wang, Jianping Fan
Recent advances in Transformer architectures [1] have brought remarkable improvements to visual question answering (VQA).
no code implementations • 15 Feb 2022 • Yibing Zhan, Zhi Chen, Jun Yu, Baosheng Yu, DaCheng Tao, Yong Luo
As a result, HLN significantly improves the performance of scene graph generation by integrating and reasoning from object interactions, relationship interactions, and transitive inference of hyper-relationships.
no code implementations • 30 Jan 2022 • Yexiong Lin, Yu Yao, Yuxuan Du, Jun Yu, Bo Han, Mingming Gong, Tongliang Liu
Algorithms which minimize the averaged loss have been widely designed for dealing with noisy labels.
1 code implementation • CVPR 2022 • Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian
Audio-Guided video semantic segmentation is a challenging problem in visual analysis and editing, which automatically separates foreground objects from background in a video sequence according to the referring audio expressions.
no code implementations • CVPR 2022 • Jun Bao, Buyu Liu, Jun Yu
This paper aims to address the single image gaze target detection problem.
1 code implementation • Association for the Advancement of Artificial Intelligence 2021 • Jun Yu, Hao Chang, Keda Lu
It’s more efficient to look for ways improving the data based a fixed neural network architecture.
no code implementations • 21 Nov 2021 • Jun Yu, Zhaoming Kong, Aditya Kendre, Hao Peng, Carl Yang, Lichao Sun, Alex Leow, Lifang He
This paper presents a novel graph-based kernel learning approach for connectome analysis.
no code implementations • 7 Oct 2021 • Xiaopeng Li, Jiang Wu, Zhanbo Xu, Kun Liu, Jun Yu, Xiaohong Guan
This paper focuses on the uncertainty set prediction of the aggregated generation of geographically distributed wind farms.
no code implementations • 29 Sep 2021 • Xiaobo Xia, Bo Han, Yibing Zhan, Jun Yu, Mingming Gong, Chen Gong, Tongliang Liu
The sample selection approach is popular in learning with noisy labels, which tends to select potentially clean data out of noisy data for robust training.
1 code implementation • 16 Aug 2021 • Yuhao Cui, Zhou Yu, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu
Nevertheless, most existing VLP approaches have not fully utilized the intrinsic knowledge within the image-text pairs, which limits the effectiveness of the learned alignments and further restricts the performance of their models.
no code implementations • 14 Jul 2021 • Hao Chang, Guochen Xie, Jun Yu, Qiang Ling
Semi-supervised Fine-Grained Recognition is a challenge task due to the difficulty of data imbalance, high inter-class similarity and domain mismatch.
no code implementations • 10 Jul 2021 • Fang Gao, Jiabao Wang, Jun Yu, Yaoxiong Wang, Feng Shuang
It consists of a dense residual network structure, an adaptive weight channel attention (AWCA) module, a patch second non-local (PSNL) module and a soft label generation method.
1 code implementation • 27 Jun 2021 • Jun Bao, Buyu Liu, Jun Yu
We propose a novel method on refining cross-person gaze prediction task with eye/face images only by explicitly modelling the person-specific differences.
no code implementations • 10 Jun 2021 • Dawei Zhou, Nannan Wang, Xinbo Gao, Bo Han, Jun Yu, Xiaoyu Wang, Tongliang Liu
However, pre-processing methods may suffer from the robustness degradation effect, in which the defense reduces rather than improving the adversarial robustness of a target model in a white-box setting.
no code implementations • 1 Jun 2021 • Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Jun Yu, Gang Niu, Masashi Sugiyama
Lots of approaches, e. g., loss correction and label correction, cannot handle such open-set noisy labels well, since they need training data and test data to share the same label space, which does not hold for learning with open-set noisy labels.
no code implementations • NeurIPS 2021 • Xiaobo Xia, Tongliang Liu, Bo Han, Mingming Gong, Jun Yu, Gang Niu, Masashi Sugiyama
In this way, we also give large-loss but less selected data a try; then, we can better distinguish between the cases (a) and (b) by seeing if the losses effectively decrease with the uncertainty after the try.
Ranked #26 on Image Classification on mini WebVision 1.0
no code implementations • 18 May 2021 • Bofeng Wu, guocheng niu, Jun Yu, Xinyan Xiao, Jian Zhang, Hua Wu
This paper proposes an approach to Dense Video Captioning (DVC) without pairwise event-sentence annotation.
no code implementations • ICCV 2021 • Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu
Then, we train a denoising model to minimize the distances between the adversarial examples and the natural examples in the class activation feature space.
1 code implementation • 25 Dec 2020 • Jun Yu, Hao Zhou, Yibing Zhan, DaCheng Tao
Essentially, DGCPN addresses the inaccurate similarity problem by exploring and exploiting the data's intrinsic relationships in a graph.
1 code implementation • NeurIPS 2020 • Cheng Meng, Jun Yu, Jingyi Zhang, Ping Ma, Wenxuan Zhong
The proposed method, named principal optimal transport direction (POTD), estimates the basis of the SDR subspace using the principal directions of the optimal transport coupling between the data respecting different response categories.
no code implementations • 21 Aug 2020 • Jinfeng Li, Weifeng Liu, Yicong Zhou, Jun Yu, Dapeng Tao
Traditional domain adaptation algorithms assume that enough labeled data, which are treated as the prior knowledge are available in the source domain.
no code implementations • 30 May 2020 • Jun Yu, Mengyan Li, Xinlong Hao, Guochen Xie
Recognizing Families In the Wild (RFIW) is a challenging kinship recognition task with multiple tracks, which is based on Families in the Wild (FIW), a large-scale and comprehensive image database for automatic kinship recognition.
no code implementations • 30 May 2020 • Jun Yu, Guochen Xie, Mengyan Li, Xinlong Hao
While in inference procedure, we try another similarity computing method by dropping the followed several fully connected layers and directly computing the cosine similarity of the two feature vectors.
no code implementations • 21 May 2020 • Jun Yu, HaiYing Wang, Mingyao Ai, Huiming Zhang
We first derive optimal Poisson subsampling probabilities in the context of quasi-likelihood estimation under the A- and L-optimality criteria.
1 code implementation • 25 Apr 2020 • Zhou Yu, Yuhao Cui, Jun Yu, Meng Wang, DaCheng Tao, Qi Tian
Most existing works focus on a single task and design neural architectures manually, which are highly task-specific and hard to generalize to different tasks.
Ranked #19 on Visual Question Answering (VQA) on VQA v2 test-std
no code implementations • 7 Apr 2020 • Maoying Qiao, Tongliang Liu, Jun Yu, Wei Bian, DaCheng Tao
To alleviate this problem, in this paper, a repulsiveness-encouraging prior is introduced among mixing components and a diversified EPCA mixture (DEPCAM) model is developed in the Bayesian framework.
no code implementations • 6 Apr 2020 • Maoying Qiao, Jun Yu, Wei Bian, DaCheng Tao
Specifically, an HMRNet is reorganized into a hierarchical structure with homogeneous networks as its layers and heterogeneous links connecting them.
no code implementations • 16 Mar 2020 • Yijun Song, Jingwen Wang, Lin Ma, Zhou Yu, Jun Yu
The task of temporally grounding textual queries in videos is to localize one video segment that semantically corresponds to the given query.
no code implementations • 12 Aug 2019 • Zhou Yu, Yuhao Cui, Jun Yu, DaCheng Tao, Qi Tian
Learning an effective attention mechanism for multimodal data is important in many vision-and-language tasks that require a synergic understanding of both the visual and textual contents.
7 code implementations • CVPR 2019 • Zhou Yu, Jun Yu, Yuhao Cui, DaCheng Tao, Qi Tian
In this paper, we propose a deep Modular Co-Attention Network (MCAN) that consists of Modular Co-Attention (MCA) layers cascaded in depth.
Ranked #7 on Question Answering on SQA3D
no code implementations • 20 Jun 2019 • Natalya Pya Arnqvist, Blaise Ngendangenzwa, Eric Lindahl, Leif Nilsson, Jun Yu
One of the primary concerns of product quality control in the automotive industry is an automated detection of defects of small sizes on specular car body surfaces.
1 code implementation • 6 Jun 2019 • Zhou Yu, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang, DaCheng Tao
It is both crucial and natural to extend this research direction to the video domain for video question answering (VideoQA).
Ranked #31 on Video Question Answering on ActivityNet-QA
Visual Question Answering (VQA) Zero-Shot Video Question Answer
no code implementations • 20 May 2019 • Jun Yu, Jing Li, Zhou Yu, Qingming Huang
Despite the success of existing studies, current methods only model the co-attention that characterizes the inter-modal interactions while neglecting the self-attention that characterizes the intra-modal interactions.
no code implementations • 9 May 2019 • Yinglu Liu, Hao Shen, Yue Si, Xiaobo Wang, Xiangyu Zhu, Hailin Shi, Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi, Jun Yu, Haonian Xie, Guochen Xie, Mengyan Li, Qing Lu, Zengfu Wang, Shenqi Lai, Zhenhua Chai, Xiaoming Wei
However, previous competitions on facial landmark localization (i. e., the 300-W, 300-VW and Menpo challenges) aim to predict 68-point landmarks, which are incompetent to depict the structure of facial components.
no code implementations • CVPR 2019 • Yibing Zhan, Jun Yu, Ting Yu, DaCheng Tao
In this paper, we explore the beneficial effect of undetermined relationships on visual relationship detection.
no code implementations • 22 Apr 2019 • Jian Zhang, Jun Yu, DaCheng Tao
Next, we exploit an affine transformation to align the local deep features of each neighbourhood with the global features.
no code implementations • 16 Apr 2019 • Jun Yu, Jinghan Yao, Jian Zhang, Zhou Yu, DaCheng Tao
In this paper, we propose a one-stage framework, SPRNet, which performs efficient instance segmentation by introducing a single pixel reconstruction (SPR) branch to off-the-shelf one-stage detectors.
no code implementations • 5 Apr 2019 • Maoying Qiao, Jun Yu, Wei Bian, Qiang Li, DaCheng Tao
Stochastic block models (SBMs) have been playing an important role in modeling clusters or community structures of network data.
no code implementations • 26 Mar 2019 • Jun Yu, Xiao-Jun Wu
With the advantage of low storage cost and high efficiency, hashing learning has received much attention in the domain of Big Data.
no code implementations • 26 Mar 2019 • Jun Yu, Xiao-Jun Wu
Our model not only considers the inter-modality correlation by maximizing the kernel correlation but also preserves the semantically structural information within each modality.
no code implementations • 6 Dec 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
With the advantage of low storage cost and high retrieval efficiency, hashing techniques have recently been an emerging topic in cross-modal similarity search.
no code implementations • 24 Oct 2018 • Zhou Zhao, Hanbing Zhan, Lingtao Meng, Jun Xiao, Jun Yu, Min Yang, Fei Wu, Deng Cai
In this paper, we study the problem of image retweet prediction in social media, which predicts the image sharing behavior that the user reposts the image tweets from their followees.
no code implementations • 13 Aug 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
Many hashing methods based on a single view have been extensively studied for information retrieval.
no code implementations • 19 Jun 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
Recently, hashing techniques have gained importance in large-scale retrieval tasks because of their retrieval speed.
1 code implementation • 9 May 2018 • Zhou Yu, Jun Yu, Chenchao Xiang, Zhou Zhao, Qi Tian, DaCheng Tao
Visual grounding aims to localize an object in an image referred to by a textual query phrase.
Ranked #9 on Phrase Grounding on Flickr30k Entities Test
no code implementations • ECCV 2018 • Xiaoqing Yin, Xinchao Wang, Jun Yu, Maojun Zhang, Pascal Fua, DaCheng Tao
Images captured by fisheye lenses violate the pinhole camera assumption and suffer from distortions.
no code implementations • ICLR 2018 • Wei Zhang, Qiuyu Chen, Jun Yu, Jianping Fan
In this paper, a deep boosting algorithm is developed to learn more discriminative ensemble classifier by seamlessly combining a set of base deep CNNs (base experts) with diverse capabilities, e. g., these base deep CNNs are sequentially trained to recognize a set of object classes in an easy-to-hard way according to their learning complexities.
no code implementations • 18 Dec 2017 • Chaoqun Hong, Jun Yu
In the proposed deep learning based framework, Manifold Regularized Convolutional Layers (MRCL) improve traditional convolutional layers by learning the relationship among outputs of neurons.
2 code implementations • 4 Dec 2017 • Jun Yu, Xingxin Xu, Fei Gao, Shengjie Shi, Meng Wang, DaCheng Tao, Qingming Huang
Experimental results show that our method is capable of generating both visually comfortable and identity-preserving face sketches/photos over a wide range of challenging data.
Ranked #1 on Face Sketch Synthesis on CUFS (FID metric)
2 code implementations • 10 Aug 2017 • Zhou Yu, Jun Yu, Chenchao Xiang, Jianping Fan, DaCheng Tao
For fine-grained image and question representations, a `co-attention' mechanism is developed by using a deep neural network architecture to jointly learn the attentions for both the image and the question, which can allow us to reduce the irrelevant features effectively and obtain more discriminative features for image and question representations.
6 code implementations • ICCV 2017 • Zhou Yu, Jun Yu, Jianping Fan, DaCheng Tao
For multi-modal feature fusion, here we develop a Multi-modal Factorized Bilinear (MFB) pooling approach to efficiently and effectively combine multi-modal features, which results in superior performance for VQA compared with other bilinear pooling approaches.
no code implementations • 8 Jul 2017 • Tianyi Zhao, Baopeng Zhang, Wei zhang, Ning Zhou, Jun Yu, Jianping Fan
Our LMM model can provide an end-to-end approach for jointly learning: (a) the deep networks to extract more discriminative deep features for image and object class representation; (b) the tree classifier for recognizing large numbers of object classes hierarchically; and (c) the visual hierarchy adaptation for achieving more accurate indexing of large numbers of object classes hierarchically.
no code implementations • 24 Jun 2017 • Tianyi Zhao, Jun Yu, Zhenzhong Kuang, Wei zhang, Jianping Fan
In this paper, a deep mixture of diverse experts algorithm is developed for seamlessly combining a set of base deep CNNs (convolutional neural networks) with diverse outputs (task spaces), e. g., such base deep CNNs are trained to recognize different subsets of tens of thousands of atomic object classes.
no code implementations • 15 Nov 2016 • Ping Li, Jun Yu, Meng Wang, Luming Zhang, Deng Cai, Xuelong. Li
To achieve this goal, we cast the problem into a constrained rank minimization framework by adopting the least squares regularization.
no code implementations • 7 Jul 2016 • Anders Hildeman, David Bolin, Jonas Wallin, Adam Johansson, Tufve Nyholm, Thomas Asklund, Jun Yu
The amount of data needed to train a model for s-CT generation is of the order of 100 million voxels.
no code implementations • 8 Feb 2015 • Matt Taddy, Chun-Sheng Chen, Jun Yu, Mitch Wyle
We derive ensembles of decision trees through a nonparametric Bayesian model, allowing us to view random forests as samples from a posterior distribution.
Applications