Search Results for author: Wei Xia

Found 46 papers, 14 papers with code

DeSAM: Decoupling Segment Anything Model for Generalizable Medical Image Segmentation

1 code implementation1 Jun 2023 Yifan Gao, Wei Xia, Dingdu Hu, Xin Gao

In fully automatic mode, the presence of inevitable poor prompts (such as points outside the mask or boxes significantly larger than the mask) can significantly mislead mask generation.

Domain Generalization Image Segmentation +2

Rethinking k-means from manifold learning perspective

no code implementations12 May 2023 Quanxue Gao, Qianqian Wang, Han Lu, Wei Xia, Xinbo Gao

Although numerous clustering algorithms have been developed, many existing methods still leverage k-means technique to detect clusters of data points.

Multi-View Clustering via Semi-non-negative Tensor Factorization

no code implementations29 Mar 2023 Jing Li, Quanxue Gao, Qianqian Wang, Wei Xia, Xinbo Gao

Multi-view clustering (MVC) based on non-negative matrix factorization (NMF) and its variants have received a huge amount of attention in recent years due to their advantages in clustering interpretability.

Towards Regression-Free Neural Networks for Diverse Compute Platforms

no code implementations27 Sep 2022 Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia

With the shift towards on-device deep learning, ensuring a consistent behavior of an AI service across diverse compute platforms becomes tremendously important.

Neural Architecture Search regression

Data-driven Attention and Data-independent DCT based Global Context Modeling for Text-independent Speaker Recognition

no code implementations4 Aug 2022 Wei Xia, John H. L. Hansen

In this study, a general global time-frequency context modeling framework is proposed to leverage the context information specifically for speaker representation modeling.

Speaker Recognition Speaker Verification +1

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training

no code implementations12 May 2022 Yue Zhao, Yantao Shen, Yuanjun Xiong, Shuo Yang, Wei Xia, Zhuowen Tu, Bernt Schiele, Stefano Soatto

We present a method to train a classification system that achieves paragon performance in both error rate and NFR, at the inference cost of a single model.

MeMOT: Multi-Object Tracking with Memory

no code implementations CVPR 2022 Jiarui Cai, Mingze Xu, Wei Li, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto

We propose an online tracking algorithm that performs the object detection and data association under a common framework, capable of linking objects after a long time span.

Multi-Object Tracking object-detection +1

An Automatic Detection Method Of Cerebral Aneurysms In Time-Of-Flight Magnetic Resonance Angiography Images Based On Attention 3D U-Net

no code implementations26 Oct 2021 Chen Geng, Meng Chen, Ruoyu Di, Dongdong Wang, Liqin Yang, Wei Xia, Yuxin Li, Daoying Geng

Conclusions:Compared with the results of our previous studies and other studies, the method in this paper achieves a very competitive sensitivity with less training data and maintains a low false positive rate. As the only method currently using 3D U-Net for aneurysm detection, it proves the feasibility and superior performance of this network in aneurysm detection, and also explores the potential of the channel attention mechanism in this task.

Path Auxiliary Proposal for MCMC in Discrete Space

no code implementations ICLR 2022 Haoran Sun, Hanjun Dai, Wei Xia, Arun Ramamurthy

Energy-based Model (EBM) offers a powerful approach for modeling discrete structure, but both inference and learning of EBM are hard as it involves sampling from discrete distributions.

Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora

no code implementations23 Sep 2021 Szu-Jui Chen, Wei Xia, John H. L. Hansen

With additional techniques such as pronunciation and silence probability modeling, plus multi-style training, we achieve a +5. 42% and +3. 18% relative WER improvement for the development and evaluation sets of the Fearless Steps Corpus.

speech-recognition Speech Recognition

Effective and Efficient Graph Learning for Multi-view Clustering

no code implementations15 Aug 2021 Quanxue Gao, Wei Xia, Xinbo Gao, Xiangdong Zhang, Qin Li, DaCheng Tao

Despite the impressive clustering performance and efficiency in characterizing both the relationship between data and cluster structure, existing graph-based multi-view clustering methods still have the following drawbacks.

Graph Learning

Long Short-Term Transformer for Online Action Detection

1 code implementation NeurIPS 2021 Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Xia, Zhuowen Tu, Stefano Soatto

We present Long Short-term TRansformer (LSTR), a temporal modeling algorithm for online action detection, which employs a long- and short-term memory mechanism to model prolonged sequence data.

Online Action Detection Playing the Game of 2048

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

no code implementations6 Jul 2021 Wei Li, Yuanjun Xiong, Shuo Yang, Mingze Xu, Yongxin Wang, Wei Xia

We design a new instance-to-track matching objective to learn appearance embedding that compares a candidate detection to the embedding of the tracks persisted in the tracker.

Multiple Object Tracking Representation Learning

Learning Hierarchical Graph Neural Networks for Image Clustering

2 code implementations ICCV 2021 Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto

Our hierarchical GNN uses a novel approach to merge connected components predicted at each level of the hierarchy to form a new graph at the next level.

Face Clustering

Harnessing Unrecognizable Faces for Improving Face Recognition

no code implementations8 Jun 2021 Siqi Deng, Yuanjun Xiong, Meng Wang, Wei Xia, Stefano Soatto

The common implementation of face recognition systems as a cascade of a detection stage and a recognition or verification stage can cause problems beyond failures of the detector.

Face Recognition Quantization

Compatibility-aware Heterogeneous Visual Search

no code implementations CVPR 2021 Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto

Existing systems use the same embedding model to compute representations (embeddings) for the query and gallery images.

Neural Architecture Search Retrieval

Optical manipulation of electronic dimensionality in a quantum material

no code implementations21 Jan 2021 Shaofeng Duan, Yun Cheng, Wei Xia, Yuanyuan Yang, Fengfeng Qi, Tianwei Tang, Yanfeng Guo, Dong Qian, Dao Xiang, Jie Zhang, Wentao Zhang

Exotic phenomenon can be achieved in quantum materials by confining electronic states into two dimensions.

Strongly Correlated Electrons Materials Science Superconductivity

Learning Self-Consistency for Deepfake Detection

1 code implementation ICCV 2021 Tianchen Zhao, Xiang Xu, Mingze Xu, Hui Ding, Yuanjun Xiong, Wei Xia

We propose a new method to detect deepfake images using the cue of the source feature inconsistency within the forged images.

DeepFake Detection Face Swapping +2

Self-supervised Text-independent Speaker Verification using Prototypical Momentum Contrastive Learning

1 code implementation13 Dec 2020 Wei Xia, Chunlei Zhang, Chao Weng, Meng Yu, Dong Yu

First, we examine a simple contrastive learning approach (SimCLR) with a momentum contrastive (MoCo) learning framework, where the MoCo speaker embedding system utilizes a queue to maintain a large set of negative examples.

Contrastive Learning Representation Learning +1

DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning

no code implementations12 Dec 2020 Mufan Sang, Wei Xia, John H. L. Hansen

Despite speaker verification has achieved significant performance improvement with the development of deep neural networks, domain mismatch is still a challenging problem in this field.

Disentanglement Domain Adaptation +1

Positive-Congruent Training: Towards Regression-Free Model Updates

no code implementations CVPR 2021 Sijie Yan, Yuanjun Xiong, Kaustav Kundu, Shuo Yang, Siqi Deng, Meng Wang, Wei Xia, Stefano Soatto

Reducing inconsistencies in the behavior of different versions of an AI system can be as important in practice as reducing its overall error.

Image Classification regression

SMOT: Single-Shot Multi Object Tracking

1 code implementation30 Oct 2020 Wei Li, Yuanjun Xiong, Shuo Yang, Siqi Deng, Wei Xia

We combine this scheme with SSD detectors by proposing a novel tracking anchor assignment module.

Multi-Object Tracking

3D-Aided Data Augmentation for Robust Face Understanding

no code implementations3 Oct 2020 Yifan Xing, Yuanjun Xiong, Wei Xia

Data augmentation has been highly effective in narrowing the data gap and reducing the cost for human annotation, especially for tasks where ground truth labels are difficult and expensive to acquire.

3D Face Modelling Data Augmentation +1

Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias

no code implementations21 Sep 2020 Mufan Sang, Wei Xia, John H. L. Hansen

In forensic applications, it is very common that only small naturalistic datasets consisting of short utterances in complex or unknown acoustic environments are available.

Inductive Bias Knowledge Distillation +1

Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification

no code implementations5 Sep 2020 Zhenyu Wang, Wei Xia, John H. L. Hansen

Forensic audio analysis for speaker verification offers unique challenges due to location/scenario uncertainty and diversity mismatch between reference and naturalistic field recordings.

Domain Adaptation Speaker Verification

Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations

no code implementations2 Sep 2020 Wei Xia, John H. L. Hansen

In this study, we propose the global context guided channel and time-frequency transformations to model the long-range, non-local time-frequency dependencies and channel variances in speaker representations.

Representation Learning Speaker Verification

6VecLM: Language Modeling in Vector Space for IPv6 Target Generation

no code implementations5 Aug 2020 Tianyu Cui, Gang Xiong, Gaopeng Gou, Junzheng Shi, Wei Xia

Fast IPv6 scanning is challenging in the field of network measurement as it requires exploring the whole IPv6 address space but limited by current computational power.

Language Modelling

Towards causal benchmarking of bias in face analysis algorithms

1 code implementation ECCV 2020 Guha Balakrishnan, Yuanjun Xiong, Wei Xia, Pietro Perona

To address this problem we develop an experimental method for measuring algorithmic bias of face analysis algorithms, which manipulates directly the attributes of interest, e. g., gender and skin tone, in order to reveal causal links between attribute variation and performance change.

Benchmarking Fairness

On Improving Temporal Consistency for Online Face Liveness Detection

no code implementations11 Jun 2020 Xiang Xu, Yuanjun Xiong, Wei Xia

In this paper, we focus on improving the online face liveness detection system to enhance the security of the downstream face recognition system.

Face Recognition

Towards Backward-Compatible Representation Learning

3 code implementations CVPR 2020 Yantao Shen, Yuanjun Xiong, Wei Xia, Stefano Soatto

Backward compatibility is critical to quickly deploy new embedding models that leverage ever-growing large-scale training datasets and improvements in deep learning architectures and training methods.

Face Recognition Representation Learning

Sound Event Detection in Multichannel Audio using Convolutional Time-Frequency-Channel Squeeze and Excitation

no code implementations4 Aug 2019 Wei Xia, Kazuhito Koishida

In this study, we introduce a convolutional time-frequency-channel "Squeeze and Excitation" (tfc-SE) module to explicitly model inter-dependencies between the time-frequency domain and multiple channels.

Event Detection Sound Event Detection

Segmenting Objects in Day and Night:Edge-Conditioned CNN for Thermal Image Semantic Segmentation

1 code implementation24 Jul 2019 Chenglong Li, Wei Xia, Yan Yan, Bin Luo, Jin Tang

These advantages of thermal infrared cameras make the segmentation of semantic objects in day and night.

Semantic Segmentation

Analyses of Multi-collection Corpora via Compound Topic Modeling

1 code implementation17 Jun 2019 Clint P. George, Wei Xia, George Michailidis

The usability study on some real-world corpora illustrates the superiority of cLDA to explore the underlying topics automatically but also model their connections and variations across multiple collections.

Topic Models Variational Inference

Learning Robust Search Strategies Using a Bandit-Based Approach

no code implementations10 May 2018 Wei Xia, Roland H. C. Yap

However, choosing or designing a good search heuristic is non-trivial and is often a manual process.

Correlation Heuristics for Constraint Programming

no code implementations6 May 2018 Ruiwei Wang, Wei Xia, Roland H. C. Yap

We evaluate our correlation heuristics with well known heuristics, namely, dom/wdeg, impact-based search and activity-based search.

CNN: Single-label to Multi-label

no code implementations22 Jun 2014 Yunchao Wei, Wei Xia, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan

Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks.

Image Classification

Subcategory-Aware Object Classification

no code implementations CVPR 2013 Jian Dong, Wei Xia, Qiang Chen, Jianshi Feng, Zhongyang Huang, Shuicheng Yan

In this paper, we introduce a subcategory-aware object classification framework to boost category level object classification performance.

Classification General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.