Search Results for author: Lan Wang

Found 47 papers, 12 papers with code

Multimodal Transformers are Hierarchical Modal-wise Heterogeneous Graphs

no code implementations2 May 2025 Yijie Jin, Junjie Peng, Xuanchao Lin, Haochen Yuan, Lan Wang, Cangzhi Zheng

In this work, from the perspective of efficiency optimization, we propose and prove that MulTs are hierarchical modal-wise heterogeneous graphs (HMHGs), and we introduce the graph-structured representation pattern of MulTs.

Multimodal Sentiment Analysis

Water Quality Data Imputation via A Fast Latent Factorization of Tensors with PID-based Optimizer

no code implementations10 Mar 2025 Qian Liu, Lan Wang, Bing Yang, Hao Wu

Water quality data can supply a substantial decision support for water resources utilization and pollution prevention.

Imputation Missing Values

Generative Zero-Shot Composed Image Retrieval

no code implementations CVPR 2025 Lan Wang, Wei Ao, Vishnu Naresh Boddeti, Ser-Nam Lim

Composed Image Retrieval (CIR) is a vision-language task utilizing queries comprising images and textual descriptions to achieve precise image retrieval.

Image Generation Image Retrieval +1

IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks

1 code implementation21 Dec 2024 Yaming Zhang, Chenqiang Gao, Fangcen Liu, Junjie Guo, Lan Wang, Xinggan Peng, Deyu Meng

By fine-tuning approximately 3% of the backbone parameters, IV-tuning outperforms full fine-tuning across various baselines in infrared-visible semantic segmentation and object detection, as well as previous state-of-the-art methods.

object-detection Object Detection +2

OmniCreator: Self-Supervised Unified Generation with Universal Editing

no code implementations3 Dec 2024 Haodong Chen, Lan Wang, Harry Yang, Ser-Nam Lim

On the other hand, when presented with a text prompt only, OmniCreator becomes generative, producing high-quality video as a result of the semantic correspondence learned.

Denoising Semantic correspondence +2

SEAL: Semantic Attention Learning for Long Video Representation

no code implementations CVPR 2025 Lan Wang, Yujia Chen, Du Tran, Vishnu Naresh Boddeti, Wen-Sheng Chu

Long video understanding presents challenges due to the inherent high computational complexity and redundant temporal information.

Diversity Question Answering +2

Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines

1 code implementation16 Nov 2024 Yixiang Chen, Xinyu Zhang, Jinran Wang, Xurong Xie, Nan Yan, Hui Chen, Lan Wang

The Structured Dialogue System, referred to as SuDoSys, is an innovative Large Language Model (LLM)-based chatbot designed to provide psychological counseling.

Chatbot Language Modeling +3

An End-To-End Stuttering Detection Method Based On Conformer And BILSTM

no code implementations14 Nov 2024 Xiaokang Liu, Changqing Xu, Yudong Yang, Lan Wang, Nan Yan

In the SLT 2024 Stuttering Speech Challenge based on the AS-70 dataset [1], our model improved the mean F1 score by 24. 8% compared to the baseline method and achieved first place.

Event Detection Multi-Task Learning

A Tale of Two Cities: Pessimism and Opportunism in Offline Dynamic Pricing

no code implementations12 Nov 2024 Zeyu Bian, Zhengling Qi, Cong Shi, Lan Wang

We address this challenge by framing the problem to a partial identification framework.

Investigation of unsupervised and supervised hyperspectral anomaly detection

no code implementations13 Aug 2024 Mazharul Hossain, Aaron Robinson, Lan Wang, Chrysanthe Preza

We later utilized a supervised classifier to determine the weights of a voting ensemble, creating a hybrid of heterogeneous unsupervised HS-AD algorithms with a supervised classifier in a model stacking, which improved detection accuracy.

Anomaly Detection Hyperspectral Unmixing

Representation learning with CGAN for casual inference

no code implementations3 Jul 2024 Zhaotian Weng, Jianbo Hong, Lan Wang

Conditional Generative Adversarial Nets (CGAN) is often used to improve conditional image generation performance.

Causal Inference Conditional Image Generation +1

Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition

no code implementations14 Jun 2024 Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian

Disordered speech recognition profound implications for improving the quality of life for individuals afflicted with, for example, dysarthria.

speech-recognition Speech Recognition

Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network

no code implementations6 May 2024 Xiaokang Liu, Xiaoxia Du, Juan Liu, Rongfeng Su, Manwa Lawrence Ng, Yumei Zhang, Yudong Yang, Shaofeng Zhao, Lan Wang, Nan Yan

Currently, research on the automatic assessment of dysarthria primarily focuses on two approaches: one that utilizes expert features combined with machine learning, and the other that employs data-driven deep learning methods to extract representations.

Deep Learning Graph Attention

Private Optimal Inventory Policy Learning for Feature-based Newsvendor with Unknown Demand

no code implementations23 Apr 2024 Tuoyi Zhao, Wen-Xin Zhou, Lan Wang

By leveraging the structure of the newsvendor problem, we attain a faster excess population risk bound compared to that obtained from an indiscriminate application of existing results for general nonsmooth convex loss.

parameter estimation Privacy Preserving

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

no code implementations22 Mar 2024 Sepehr Dehdashtian, Lan Wang, Vishnu Naresh Boddeti

However, owing to the nature of their training process, these models have the potential to 1) propagate or amplify societal biases in the training data and 2) learn to rely on spurious features.

Fairness

Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions

no code implementations11 Mar 2024 Lan Wang, Vishnu Boddeti, SerNam Lim

While existing video editing tasks are limited to changes in attributes, backgrounds, and styles, our method aims to predict open-ended human action changes in video.

counterfactual Video Editing +1

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

no code implementations9 Mar 2024 Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang

In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.

Off-policy Evaluation in Doubly Inhomogeneous Environments

1 code implementation14 Jun 2023 Zeyu Bian, Chengchun Shi, Zhengling Qi, Lan Wang

This work aims to study off-policy evaluation (OPE) under scenarios where two key reinforcement learning (RL) assumptions -- temporal stationarity and individual homogeneity are both violated.

Offline RL Off-policy evaluation

Context-aware Domain Adaptation for Time Series Anomaly Detection

no code implementations15 Apr 2023 Kwei-Herng Lai, Lan Wang, Huiyuan Chen, Kaixiong Zhou, Fei Wang, Hao Yang, Xia Hu

We formulate context sampling into the Markov decision process and exploit deep reinforcement learning to optimize the time series domain adaptation process via context sampling and design a tailored reward function to generate domain-invariant features that better align two domains for anomaly detection.

Anomaly Detection Deep Reinforcement Learning +5

ProTeGe: Untrimmed Pretraining for Video Temporal Grounding by Video Temporal Grounding

no code implementations CVPR 2023 Lan Wang, Gaurav Mittal, Sandra Sajeev, Ye Yu, Matthew Hall, Vishnu Naresh Boddeti, Mei Chen

We present ProTeGe as the first method to perform VTG-based untrimmed pretraining to bridge the gap between trimmed pretrained backbones and downstream VTG tasks.

text similarity

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

no code implementations29 Dec 2022 Yang Xu, Chengchun Shi, Shikai Luo, Lan Wang, Rui Song

Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline data generated by a potentially different behavior policy.

Decision Making Off-policy evaluation +1

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

no code implementations23 Dec 2022 Zuyue Fu, Zhengling Qi, Zhuoran Yang, Zhaoran Wang, Lan Wang

To tackle the distributional mismatch, we leverage the idea of pessimism and use our OPE method to develop an off-policy learning algorithm for finding a desirable policy pair for both Alice and Bob.

Decision Making Off-policy evaluation +1

Denoising Self-attentive Sequential Recommendation

no code implementations8 Dec 2022 Huiyuan Chen, Yusan Lin, Menghai Pan, Lan Wang, Chin-Chia Michael Yeh, Xiaoting Li, Yan Zheng, Fei Wang, Hao Yang

Transformer-based sequential recommenders are very powerful for capturing both short-term and long-term sequential item dependencies.

Denoising Sequential Recommendation

Do learned representations respect causal relationships?

1 code implementation CVPR 2022 Lan Wang, Vishnu Naresh Boddeti

Second, we apply NCINet to identify the causal relations between image representations of different pairs of attributes with known and unknown causal relations between the labels.

Attribute Causal Discovery +1

A Robust Statistical Analysis of the Role of Hydropower on the System Electricity Price and Price Volatility

no code implementations4 Mar 2022 Olukunle O. Owolabi, Kathryn Lawson, Sanhita Sengupta, Yingsi Huang, Lan Wang, Chaopeng Shen, Mila Getmansky Sherman, Deborah A. Sunter

Hydroelectric power (hydropower) is unique in that it can function as both a conventional source of electricity and as backup storage (pumped hydroelectric storage) for providing energy in times of high demand on the grid.

quantile regression

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition

no code implementations24 Jan 2022 Xurong Xie, Xiang Sui, Xunying Liu, Lan Wang

Meanwhile, approaches of multi-accent modelling including multi-style training, multi-accent decision tree state tying, DNN tandem and multi-level adaptive network (MLAN) tandem hidden Markov model (HMM) modelling are combined and compared in this paper.

Acoustic Modelling speech-recognition +1

Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition

no code implementations24 Jan 2022 Xurong Xie, Rukiye Ruzi, Xunying Liu, Lan Wang

Dysarthric speech recognition is a challenging task due to acoustic variability and limited amount of available data.

speech-recognition Speech Recognition

Analysis of animal-related electric outages using species distribution models and community science data

1 code implementation22 Dec 2021 Mei-Ling E. Feng, Olukunle O. Owolabi, Toryn L. J. Schafer, Sanhita Sengupta, Lan Wang, David S. Matteson, Judy P. Che-Castaldo, Deborah A. Sunter

These flexible, species-specific estimates can allow future animal-indicators of grid reliability to be investigated in more diverse regions and ecological communities, providing a better understanding of the variation that exists in animal-outage relationship.

Adversarial Representation Learning With Closed-Form Solvers

1 code implementation12 Sep 2021 Bashir Sadeghi, Lan Wang, Vishnu Naresh Boddeti

Adversarial representation learning aims to learn data representations for a target task while removing unwanted sensitive information at the same time.

Form Representation Learning

Unsupervised Cross-Lingual Speech Emotion Recognition Using Pseudo Multilabel

1 code implementation19 Aug 2021 Jin Li, Nan Yan, Lan Wang

However, cross-lingual SER remains a challenge in real-world applications due to a great difference between the source and target domain distributions.

Speech Emotion Recognition

FDN: Finite Difference Network with Hierarchical Convolutional Features for Text-independent Speaker Verification

1 code implementation18 Aug 2021 Jin Li, Nan Yan, Lan Wang

For example, RawNet and RawNet2 extracted speaker's feature embeddings from waveforms automatically for recognizing their voice, which can vastly reduce the front-end computation and obtain state-of-the-art performance.

Rhythm Text-Independent Speaker Verification

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition

no code implementations18 Aug 2021 Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang

The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Event2Graph: Event-driven Bipartite Graph for Multivariate Time-series Anomaly Detection

no code implementations15 Aug 2021 Yuhang Wu, Mengting Gu, Lan Wang, Yusan Lin, Fei Wang, Hao Yang

Modeling inter-dependencies between time-series is the key to achieve high performance in anomaly detection for multivariate time-series data.

Anomaly Detection Time Series +1

Generalization bounds via distillation

no code implementations ICLR 2021 Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang

This paper theoretically investigates the following empirical phenomenon: given a high-complexity network with poor generalization bounds, one can distill it into a network with nearly identical predictions but low complexity and vastly smaller generalization bounds.

Data Augmentation Generalization Bounds

Critical Risk Indicators (CRIs) for the electric power grid: A survey and discussion of interconnected effects

1 code implementation19 Jan 2021 Judy P. Che-Castaldo, Rémi Cousin, Stefani Daryanto, Grace Deng, Mei-Ling E. Feng, Rajesh K. Gupta, Dezhi Hong, Ryan M. McGranaghan, Olukunle O. Owolabi, Tianyi Qu, Wei Ren, Toryn L. J. Schafer, Ashutosh Sharma, Chaopeng Shen, Mila Getmansky Sherman, Deborah A. Sunter, Lan Wang, David S. Matteson

We also provide relevant critical risk indicators (CRIs) across diverse domains that may influence electric power grid risks, including climate, ecology, hydrology, finance, space weather, and agriculture.

Applications

Bayesian Learning for Deep Neural Network Adaptation

1 code implementation14 Dec 2020 Xurong Xie, Xunying Liu, Tan Lee, Lan Wang

A key task for speech recognition systems is to reduce the mismatch between training and evaluation data that is often attributable to speaker differences.

speech-recognition Speech Recognition +1

Data Augmentation for End-to-end Code-switching Speech Recognition

no code implementations4 Nov 2020 Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian

Training a code-switching end-to-end automatic speech recognition (ASR) model normally requires a large amount of data, while code-switching data is often limited.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Resampling-based Confidence Intervals for Model-free Robust Inference on Optimal Treatment Regimes

no code implementations25 Nov 2019 Yunan Wu, Lan Wang

We first study a smoothed robust estimator that directly targets the parameter corresponding to the Bayes decision rule for optimal treatment regimes estimation.

A Survey of Tuning Parameter Selection for High-dimensional Regression

no code implementations10 Aug 2019 Yunan Wu, Lan Wang

Penalized (or regularized) regression, as represented by Lasso and its variants, has become a standard technique for analyzing high-dimensional data when the number of variables substantially exceeds the sample size.

regression Survey +1

A novel learning-based frame pooling method for Event Detection

no code implementations7 Mar 2016 Lan Wang, Chenqiang Gao, Jiang Liu, Deyu Meng

Detecting complex events in a large video collection crawled from video websites is a challenging task.

Event Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.