1 code implementation • 9 Sep 2024 • Qiang Huang, Xiao Yan, Xin Wang, Susie Xi Rao, Zhichao Han, Fangcheng Fu, Wentao Zhang, Jiawei Jiang
We also adapt Transformer codebase to train TF-TGN efficiently with multiple GPUs.
no code implementations • 16 Aug 2024 • Qiang Huang, Chuizheng Meng, Defu Cao, Biwei Huang, Yi Chang, Yan Liu
Counterfactual estimation from observations represents a critical endeavor in numerous application fields, such as healthcare and finance, with the primary challenge being the mitigation of treatment bias.
no code implementations • 30 Apr 2024 • Qiang Huang
Therefore our experiments are geared towards recognising the source speakers given the converted voices, which are generated by using FragmentVC on the randomly paired utterances from source and target speakers.
no code implementations • 6 Mar 2024 • Yifan Bao, Yihao Ang, Qiang Huang, Anthony K. H. Tung, Zhiyong Huang
This underscores its adeptness in seamlessly integrating latent features with external conditions.
no code implementations • 21 Feb 2024 • Qiang Huang, Yanhao Wang, Yiqun Sun, Anthony K. H. Tung
To bridge this gap, we revisit and refine the diversity-aware $k$MIPS (D$k$MIPS) problem by incorporating two well-known diversity objectives -- minimizing the average and maximum pairwise item similarities within the results -- into the original relevance objective.
no code implementations • 6 Oct 2023 • Biao Wu, Qiang Huang, Anthony K. H. Tung
In this paper, we concentrate on the domain of tabular data and introduce a novel methodology, Local Distribution Shifting Synthesis (\textsc{LDSS}), to detect leaked data that are used to train classification models.
1 code implementation • 7 Sep 2023 • Yihao Ang, Qiang Huang, Yifan Bao, Anthony K. H. Tung, Zhiyong Huang
Synthetic Time Series Generation (TSG) is crucial in a range of applications, including data augmentation, anomaly detection, and privacy preservation.
1 code implementation • 31 Aug 2023 • Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du
To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed.
no code implementations • 22 Aug 2023 • Yonghe Zhao, Qiang Huang, Shuai Fu, Huiyan Sun
Most causal inference models based on the POF (CIMs-POF) are designed for eliminating confounding bias and default to an underlying assumption of Confounding Covariates.
no code implementations • 2 Aug 2023 • Yonghe Zhao, Qiang Huang, Siwei Wu, Yun Peng, Huiyan Sun
By disentangling observed and unobserved confounders, VLUCI constructs a doubly variational inference model to approximate the distribution of unobserved confounders, which are used for inferring more accurate counterfactual outcomes.
no code implementations • 24 Jul 2023 • Yonghe Zhao, Qiang Huang, Haolong Zeng, Yun Pen, Huiyan Sun
Extensive experiments on synthetic datasets show that the DRL model performs superiorly in learning de-confounding representations and outperforms state-of-the-art counterfactual inference models for continuous treatment variables.
1 code implementation • 21 Feb 2023 • Qiang Huang, Anthony K. H. Tung
Finding the nearest neighbor to a hyperplane (or Point-to-Hyperplane Nearest Neighbor Search, simply P2HNNS) is a new and challenging problem with applications in many research domains.
1 code implementation • 3 Feb 2023 • Yingying Zhu, Hongji Yang, Yuxin Lu, Qiang Huang
To address the above three challenges for cross-view image matching, we propose a new backbone network, named Simple Attention-based Image Geo-localization network (SAIG).
Ranked #2 on Image-Based Localization on VIGOR Cross Area
no code implementations • 15 Jan 2023 • Basel Barakat, Qiang Huang
In our work, the layer selection can be done in four different ways.
1 code implementation • 23 Nov 2022 • Qiang Huang, Yanhao Wang, Anthony K. H. Tung
To speed up the Maximum Inner Product Search (MIPS) on item vectors, we design a shifting-invariant asymmetric transformation and develop a novel sublinear-time Shifting-Aware Asymmetric Locality Sensitive Hashing (SA-ALSH) scheme.
no code implementations • 29 Sep 2021 • Yujie Gu, Yangkun Cao, Qiang Huang, Huiyan Sun
The other is the convolution operation for features to find the optimal solution adopting the Laplacian smoothness and the prior knowledge that nodes with many neighbors are difficult to attack.
1 code implementation • 21 Sep 2021 • Tao Zhuo, Qiang Huang, Mohan Kankanhalli
Raven's Progressive Matrices (RPM) is highly correlated with human intelligence, and it has been widely used to measure the abstract reasoning ability of humans.
no code implementations • 29 Oct 2020 • Yanpei Shi, Mingjie Chen, Qiang Huang, Thomas Hain
The use of memory mechanism could reach 10. 6% and 7. 7% relative improvement compared with not using memory mechanism.
no code implementations • 16 May 2020 • Qiang Huang, Thomas Hain
The first task is to predict an utterance quality score, and the second is to identify where an anomalous distortion takes place in a recording.
no code implementations • 15 May 2020 • Yanpei Shi, Qiang Huang, Thomas Hain
The obtained results show that the proposed approach using speaker dependent speech enhancement can yield better speaker recognition and speech enhancement performances than two baselines in various noise conditions.
no code implementations • 15 May 2020 • Yanpei Shi, Qiang Huang, Thomas Hain
To evaluate the effectiveness of the proposed approach, artificial datasets based on Switchboard Cellular part1 (SWBC) and Voxceleb1 are constructed in two conditions, where speakers' voices are overlapped and not overlapped.
2 code implementations • 17 Jan 2020 • Qiang Huang, Makoto Yamada, Yuan Tian, Dinesh Singh, Dawei Yin, Yi Chang
In this paper, we propose GraphLIME, a local interpretable model explanation for graphs using the Hilbert-Schmidt Independence Criterion (HSIC) Lasso, which is a nonlinear feature selection method.
no code implementations • 14 Jan 2020 • Yanpei Shi, Qiang Huang, Thomas Hain
Instead of individually processing speech enhancement and speaker recognition, the two modules are integrated into one framework by a joint optimisation using deep neural networks.
no code implementations • 18 Nov 2019 • Qiang Huang, Jianhui Bu, Weijian Xie, Shengwen Yang, Weijia Wu, Li-Ping Liu
Sentence matching is an essential task in the QA systems and is usually reformulated as a Paraphrase Identification (PI) problem.
Ranked #13 on Paraphrase Identification on Quora Question Pairs (Accuracy metric)
no code implementations • 17 Oct 2019 • Yanpei Shi, Qiang Huang, Thomas Hain
In the proposed approach, frame-level encoder and attention are applied on segments of an input utterance and generate individual segment vectors.
no code implementations • 24 Sep 2019 • Yanpei Shi, Qiang Huang, Thomas Hain
While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments.
no code implementations • CCL 2020 • Yuncong Li, Zhe Yang, Cunxiang Yin, Xu Pan, Lunan Cui, Qiang Huang, Ting Wei
Aspect-category sentiment analysis (ACSA) aims to predict the aspect categories mentioned in texts and their corresponding sentiment polarities.
1 code implementation • 17 Mar 2017 • Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley
Audio tagging aims to perform multi-label classification on audio chunks and it is a newly proposed task in the Detection and Classification of Acoustic Scenes and Events 2016 (DCASE 2016) challenge.
Sound
2 code implementations • 24 Feb 2017 • Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley
In this paper, we propose to use a convolutional neural network (CNN) to extract robust features from mel-filter banks (MFBs), spectrograms or even raw waveforms for audio tagging.
no code implementations • 13 Jul 2016 • Yong Xu, Qiang Huang, Wenwu Wang, Mark D. Plumbley
In this paper, we present a deep neural network (DNN)-based acoustic scene classification framework.
2 code implementations • 13 Jul 2016 • Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley
For the unsupervised feature learning, we propose to use a symmetric or asymmetric deep de-noising auto-encoder (sDAE or aDAE) to generate new data-driven features from the Mel-Filter Banks (MFBs) features.
no code implementations • 24 Jun 2016 • Yong Xu, Qiang Huang, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley
Compared with the conventional Gaussian Mixture Model (GMM) and support vector machine (SVM) methods, the proposed fully DNN-based method could well utilize the long-term temporal information with the whole chunk as the input.