Search Results for author: Yadan Luo

Found 37 papers, 16 papers with code

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

no code implementations20 Mar 2024 Djamahl Etchegaray, Zi Huang, Tatsuya Harada, Yadan Luo

In this work, we tackle the limitations of current LiDAR-based 3D object detection systems, which are hindered by a restricted class vocabulary and the high costs associated with annotating new object classes.

3D Object Detection object-detection

ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection

no code implementations27 Feb 2024 Bo Peng, Yadan Luo, Yonggang Zhang, Yixuan Li, Zhen Fang

Extensive experiments across OOD detection benchmarks empirically demonstrate that our proposed \textsc{ConjNorm} has established a new state-of-the-art in a variety of OOD detection setups, outperforming the current best method by up to 13. 25$\%$ and 28. 19$\%$ (FPR95) on CIFAR-100 and ImageNet-1K, respectively.

Density Estimation Out-of-Distribution Detection +1

Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook

no code implementations12 Jan 2024 Ziying Song, Lin Liu, Feiyang Jia, Yadan Luo, Guoxin Zhang, Lei Yang, Li Wang, Caiyan Jia

In the realm of modern autonomous driving, the perception system is indispensable for accurately assessing the state of the surrounding environment, thereby enabling informed prediction and planning.

3D Object Detection Autonomous Driving +2

Learning Efficient Unsupervised Satellite Image-based Building Damage Detection

1 code implementation4 Dec 2023 Yiyun Zhang, Zijian Wang, Yadan Luo, Xin Yu, Zi Huang

Existing Building Damage Detection (BDD) methods always require labour-intensive pixel-level annotations of buildings and their conditions, hence largely limiting their applications.

Damaged Building Detection Disaster Response +2

In Search of Lost Online Test-time Adaptation: A Survey

1 code implementation31 Oct 2023 Zixin Wang, Yadan Luo, Liang Zheng, Zhuoxiao Chen, Sen Wang, Zi Huang

In this paper, we present a comprehensive survey on online test-time adaptation (OTTA), a paradigm focused on adapting machine learning models to novel data distributions upon batch arrival.

Test-time Adaptation

Towards Open World Active Learning for 3D Object Detection

1 code implementation16 Oct 2023 Zhuoxiao Chen, Yadan Luo, Zixin Wang, Zijian Wang, Xin Yu, Zi Huang

To seek effective solutions, we investigate a more practical yet challenging research task: Open World Active Learning for 3D Object Detection (OWAL-3D), aiming at selecting a small number of 3D boxes to annotate while maximizing detection performance on both known and unknown classes.

3D Object Detection Active Learning +3

Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Error

1 code implementation6 Aug 2023 Zixin Wang, Yadan Luo, Zhi Chen, Sen Wang, Zi Huang

The prevalence of domain adaptive semantic segmentation has prompted concerns regarding source domain data leakage, where private information from the source domain could inadvertently be exposed in the target domain.

Model Selection Pseudo Label +2

Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling

1 code implementation ICCV 2023 Zhuoxiao Chen, Yadan Luo, Zheng Wang, Mahsa Baktashmotlagh, Zi Huang

Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection.

3D Object Detection object-detection +1

KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection

no code implementations ICCV 2023 Yadan Luo, Zhuoxiao Chen, Zhen Fang, Zheng Zhang, Zi Huang, Mahsa Baktashmotlagh

Achieving a reliable LiDAR-based object detector in autonomous driving is paramount, but its success hinges on obtaining large amounts of precise 3D annotations.

3D Object Detection Active Learning +4

Exploring Active 3D Object Detection from a Generalization Perspective

1 code implementation23 Jan 2023 Yadan Luo, Zhuoxiao Chen, Zijian Wang, Xin Yu, Zi Huang, Mahsa Baktashmotlagh

To alleviate the high annotation cost in LiDAR-based 3D object detection, active learning is a promising solution that learns to select only a small portion of unlabeled data to annotate, without compromising model performance.

3D Object Detection Active Learning +2

How Far Pre-trained Models Are from Neural Collapse on the Target Dataset Informs their Transferability

no code implementations ICCV 2023 Zijian Wang, Yadan Luo, Liang Zheng, Zi Huang, Mahsa Baktashmotlagh

This paper focuses on model transferability estimation, i. e., assessing the performance of pre-trained models on a downstream task without performing fine-tuning.

Federated Zero-Shot Learning for Visual Recognition

no code implementations5 Sep 2022 Zhi Chen, Yadan Luo, Sen Wang, Jingjing Li, Zi Huang

We identify two key challenges in our FedZSL protocol: 1) the trained models are prone to be biased to the locally observed classes, thus failing to generalize to the unseen classes and/or seen classes appeared on other devices; 2) as each category in the training data comes from a single source, the central model is highly vulnerable to model replacement (backdoor) attacks.

Federated Learning Zero-Shot Learning

Discovering Domain Disentanglement for Generalized Multi-source Domain Adaptation

1 code implementation11 Jul 2022 Zixin Wang, Yadan Luo, Peng-Fei Zhang, Sen Wang, Zi Huang

A typical multi-source domain adaptation (MSDA) approach aims to transfer knowledge learned from a set of labeled source domains, to an unlabeled target domain.

Disentanglement Domain Adaptation

GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning

no code implementations5 Jul 2022 Zhi Chen, Yadan Luo, Sen Wang, Jingjing Li, Zi Huang

To address this issue, we propose a novel flow-based generative framework that consists of multiple conditional affine coupling layers for learning unseen data generation.

Attribute Generalized Zero-Shot Learning

Source-Free Progressive Graph Learning for Open-Set Domain Adaptation

2 code implementations13 Feb 2022 Yadan Luo, Zijian Wang, Zhuoxiao Chen, Zi Huang, Mahsa Baktashmotlagh

However, most existing OSDA approaches are limited due to three main reasons, including: (1) the lack of essential theoretical analysis of generalization bound, (2) the reliance on the coexistence of source and target data during adaptation, and (3) failing to accurately estimate the uncertainty of model predictions.

Action Recognition Domain Adaptation +2

RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management

no code implementations8 Sep 2021 Zhuoxiao Chen, Yiyun Zhang, Yadan Luo, Zijian Wang, Jinjiang Zhong, Anthony Southon

With the rapid development of intelligent detection algorithms based on deep learning, much progress has been made in automatic road defect recognition and road marking parsing.

Asset Management Defect Detection

Conditional Extreme Value Theory for Open Set Video Domain Adaptation

1 code implementation1 Sep 2021 Zhuoxiao Chen, Yadan Luo, Mahsa Baktashmotlagh

The majority of video domain adaptation algorithms are proposed for closed-set scenarios in which all the classes are shared among the domains.

Action Recognition Domain Adaptation +1

Learning to Diversify for Single Domain Generalization

1 code implementation ICCV 2021 Zijian Wang, Yadan Luo, Ruihong Qiu, Zi Huang, Mahsa Baktashmotlagh

Domain generalization (DG) aims to generalize a model trained on multiple source (i. e., training) domains to a distributionally different target (i. e., test) domain.

Domain Generalization

Mitigating Generation Shifts for Generalized Zero-Shot Learning

1 code implementation7 Jul 2021 Zhi Chen, Yadan Luo, Sen Wang, Ruihong Qiu, Jingjing Li, Zi Huang

Generalized Zero-Shot Learning (GZSL) is the task of leveraging semantic information (e. g., attributes) to recognize the seen and unseen samples, where unseen classes are not observable during training.

Attribute Generalized Zero-Shot Learning

Context-Aware Attention-Based Data Augmentation for POI Recommendation

no code implementations30 Jun 2021 Yang Li, Yadan Luo, Zheng Zhang, Shazia W. Sadiq, Peng Cui

It aims at suggesting the next POI to a user in spatial and temporal context, which is a practical yet challenging task in various applications.

Data Augmentation

Discovering Collaborative Signals for Next POI Recommendation with Iterative Seq2Graph Augmentation

no code implementations30 Jun 2021 Yang Li, Tong Chen, Yadan Luo, Hongzhi Yin, Zi Huang

Furthermore, the sparse POI-POI transitions restrict the ability of a model to learn effective sequential patterns for recommendation.

Multi-Task Learning

Enhanced Modality Transition for Image Captioning

no code implementations23 Feb 2021 Ziwei Wang, Yadan Luo, Zi Huang

In this work, we explicitly build a Modality Transition Module (MTM) to transfer visual features into semantic representations before forwarding them to the language model.

Image Captioning Language Modelling +2

Semantics Disentangling for Generalized Zero-Shot Learning

1 code implementation ICCV 2021 Zhi Chen, Yadan Luo, Ruihong Qiu, Sen Wang, Zi Huang, Jingjing Li, Zheng Zhang

Generalized zero-shot learning (GZSL) aims to classify samples under the assumption that some classes are not observable during training.

Generalized Zero-Shot Learning Relation Network

Interpretable Signed Link Prediction with Signed Infomax Hyperbolic Graph

1 code implementation25 Nov 2020 Yadan Luo, Zi Huang, Hongxu Chen, Yang Yang, Mahsa Baktashmotlagh

Most of the prior efforts are devoted to learning node embeddings with graph neural networks (GNNs), which preserve the signed network topology by message-passing along edges to facilitate the downstream link prediction task.

Link Prediction

Adversarial Bipartite Graph Learning for Video Domain Adaptation

1 code implementation31 Jul 2020 Yadan Luo, Zi Huang, Zijian Wang, Zheng Zhang, Mahsa Baktashmotlagh

To further enhance the model capacity and testify the robustness of the proposed architecture on difficult transfer tasks, we extend our model to work in a semi-supervised setting using an additional video-level bipartite graph.

Domain Adaptation Graph Learning +1

Progressive Graph Learning for Open-Set Domain Adaptation

1 code implementation ICML 2020 Yadan Luo, Zijian Wang, Zi Huang, Mahsa Baktashmotlagh

The existing domain adaptation approaches which tackle this problem work in the closed-set setting with the assumption that the source and the target data share exactly the same classes of objects.

Domain Adaptation Graph Learning

ORD: Object Relationship Discovery for Visual Dialogue Generation

no code implementations15 Jun 2020 Ziwei Wang, Zi Huang, Yadan Luo, Huimin Lu

With the rapid advancement of image captioning and visual question answering at single-round level, the question of how to generate multi-round dialogue about visual content has not yet been well explored. Existing visual dialogue methods encode the image into a fixed feature vector directly, concatenated with the question and history embeddings to predict the response. Some recent methods tackle the co-reference resolution problem using co-attention mechanism to cross-refer relevant elements from the image, history, and the target question. However, it remains challenging to reason visual relationships, since the fine-grained object-level information is omitted before co-attentive reasoning.

Dialogue Generation Graph Attention +5

Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling

no code implementations12 Nov 2019 Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang

Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data.

Continual Learning Few-Shot Learning

Deep Collaborative Discrete Hashing with Semantic-Invariant Structure

no code implementations5 Nov 2019 Zijian Wang, Zheng Zhang, Yadan Luo, Zi Huang

Existing deep hashing approaches fail to fully explore semantic correlations and neglect the effect of linguistic context on visual attention learning, leading to inferior performance.

Deep Hashing

CANZSL: Cycle-Consistent Adversarial Networks for Zero-Shot Learning from Natural Language

no code implementations21 Sep 2019 Zhi Chen, Jingjing Li, Yadan Luo, Zi Huang, Yang Yang

Thus, a multi-modal cycle-consistency loss between the synthesized semantic representations and the ground truth can be learned and leveraged to enforce the generated semantic features to approximate to the real distribution in semantic space.

Generative Adversarial Network Zero-Shot Learning

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation

no code implementations1 Aug 2019 Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang

Visual paragraph generation aims to automatically describe a given image from different perspectives and organize sentences in a coherent way.

Imitation Learning reinforcement-learning +1

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval

no code implementations5 Apr 2019 Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Huimin Lu

With the increasing number of online stores, there is a pressing need for intelligent search systems to understand the item photos snapped by customers and search against large-scale product databases to find their desired items.

Attribute Image Retrieval +1

Look Deeper See Richer: Depth-aware Image Paragraph Captioning

no code implementations ACM International Conference on Multimedia 2018 Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin

Existing image paragraph captioning methods give a series of sentences to represent the objects and regions of interests, where the descriptions are essentially generated by feeding the image fragments containing objects and regions into conventional image single-sentence captioning models.

Image Captioning Image Paragraph Captioning +1

Collaborative Learning for Extremely Low Bit Asymmetric Hashing

1 code implementation25 Sep 2018 Yadan Luo, Zi Huang, Yang Li, Fumin Shen, Yang Yang, Peng Cui

Hashing techniques are in great demand for a wide range of real-world applications such as image retrieval and network compression.

Image Retrieval Retrieval

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

no code implementations22 Aug 2018 Yadan Luo, Ziwei Wang, Zi Huang, Yang Yang, Cong Zhao

Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming.

Segmentation Semantic Segmentation +1

Binary Subspace Coding for Query-by-Image Video Retrieval

no code implementations6 Dec 2016 Ruicong Xu, Yang Yang, Yadan Luo, Fumin Shen, Zi Huang, Heng Tao Shen

The first approach, termed Inner-product Binary Coding (IBC), preserves the inner relationships of images and videos in a common Hamming space.

Retrieval Video Retrieval

Zero-Shot Hashing via Transferring Supervised Knowledge

no code implementations16 Jun 2016 Yang Yang, Wei-Lun Chen, Yadan Luo, Fumin Shen, Jie Shao, Heng Tao Shen

Supervised knowledge e. g. semantic labels or pair-wise relationship) associated to data is capable of significantly improving the quality of hash codes and hash functions.

Image Retrieval Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.