1 code implementation • 11 Jun 2025 • Inclusion AI, Biao Gong, Cheng Zou, Chuanyang Zheng, Chunluan Zhou, Canxiang Yan, Chunxiang Jin, Chunjie Shen, Dandan Zheng, Fudong Wang, Furong Xu, Guangming Yao, Jun Zhou, Jingdong Chen, Jianxin Sun, Jiajia Liu, Jianjiang Zhu, Jun Peng, Kaixiang Ji, Kaiyou Song, Kaimeng Ren, Libin Wang, Lixiang Ru, Lele Xie, Longhua Tan, Lyuxin Xue, Lan Wang, Mochen Bai, Ning Gao, Pei Chen, Qingpei Guo, Qinglong Zhang, Qiang Xu, Rui Liu, Ruijie Xiong, Sirui Gao, Tinghao Liu, Taisong Li, Weilong Chai, Xinyu Xiao, Xiaomei Wang, Xiaoxue Chen, Xiao Lu, Xiaoyu Li, Xingning Dong, Xuzheng Yu, Yi Yuan, Yuting Gao, Yunxiao Sun, Yipeng chen, Yifei Wu, Yongjie Lyu, Ziping Ma, Zipeng Feng, Zhijiang Fang, Zhihao Qiu, Ziyuan Huang, Zhengyu He
We propose Ming-Omni, a unified multimodal model capable of processing images, text, audio, and video, while demonstrating strong proficiency in both speech and image generation.
no code implementations • 2 May 2025 • Yijie Jin, Junjie Peng, Xuanchao Lin, Haochen Yuan, Lan Wang, Cangzhi Zheng
In this work, from the perspective of efficiency optimization, we propose and prove that MulTs are hierarchical modal-wise heterogeneous graphs (HMHGs), and we introduce the graph-structured representation pattern of MulTs.
no code implementations • 10 Mar 2025 • Qian Liu, Lan Wang, Bing Yang, Hao Wu
Water quality data can supply a substantial decision support for water resources utilization and pollution prevention.
no code implementations • CVPR 2025 • Lan Wang, Wei Ao, Vishnu Naresh Boddeti, Ser-Nam Lim
Composed Image Retrieval (CIR) is a vision-language task utilizing queries comprising images and textual descriptions to achieve precise image retrieval.
1 code implementation • 21 Dec 2024 • Yaming Zhang, Chenqiang Gao, Fangcen Liu, Junjie Guo, Lan Wang, Xinggan Peng, Deyu Meng
By fine-tuning approximately 3% of the backbone parameters, IV-tuning outperforms full fine-tuning across various baselines in infrared-visible semantic segmentation and object detection, as well as previous state-of-the-art methods.
no code implementations • 7 Dec 2024 • Zebin Wang, Yi Han, Ethan X. Fang, Lan Wang, Junwei Lu
We consider the inference for the ranking of large language models (LLMs).
no code implementations • 3 Dec 2024 • Haodong Chen, Lan Wang, Harry Yang, Ser-Nam Lim
On the other hand, when presented with a text prompt only, OmniCreator becomes generative, producing high-quality video as a result of the semantic correspondence learned.
no code implementations • CVPR 2025 • Lan Wang, Yujia Chen, Du Tran, Vishnu Naresh Boddeti, Wen-Sheng Chu
Long video understanding presents challenges due to the inherent high computational complexity and redundant temporal information.
1 code implementation • 16 Nov 2024 • Yixiang Chen, Xinyu Zhang, Jinran Wang, Xurong Xie, Nan Yan, Hui Chen, Lan Wang
The Structured Dialogue System, referred to as SuDoSys, is an innovative Large Language Model (LLM)-based chatbot designed to provide psychological counseling.
no code implementations • 14 Nov 2024 • Xiaokang Liu, Changqing Xu, Yudong Yang, Lan Wang, Nan Yan
In the SLT 2024 Stuttering Speech Challenge based on the AS-70 dataset [1], our model improved the mean F1 score by 24. 8% compared to the baseline method and achieved first place.
no code implementations • 12 Nov 2024 • Zeyu Bian, Zhengling Qi, Cong Shi, Lan Wang
We address this challenge by framing the problem to a partial identification framework.
no code implementations • 13 Aug 2024 • Mazharul Hossain, Aaron Robinson, Lan Wang, Chrysanthe Preza
We later utilized a supervised classifier to determine the weights of a voting ensemble, creating a hybrid of heterogeneous unsupervised HS-AD algorithms with a supervised classifier in a model stacking, which improved detection accuracy.
no code implementations • 3 Jul 2024 • Zhaotian Weng, Jianbo Hong, Lan Wang
Conditional Generative Adversarial Nets (CGAN) is often used to improve conditional image generation performance.
no code implementations • 14 Jun 2024 • Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian
Disordered speech recognition profound implications for improving the quality of life for individuals afflicted with, for example, dysarthria.
no code implementations • 6 May 2024 • Xiaokang Liu, Xiaoxia Du, Juan Liu, Rongfeng Su, Manwa Lawrence Ng, Yumei Zhang, Yudong Yang, Shaofeng Zhao, Lan Wang, Nan Yan
Currently, research on the automatic assessment of dysarthria primarily focuses on two approaches: one that utilizes expert features combined with machine learning, and the other that employs data-driven deep learning methods to extract representations.
no code implementations • 23 Apr 2024 • Tuoyi Zhao, Wen-Xin Zhou, Lan Wang
By leveraging the structure of the newsvendor problem, we attain a faster excess population risk bound compared to that obtained from an indiscriminate application of existing results for general nonsmooth convex loss.
no code implementations • 22 Mar 2024 • Sepehr Dehdashtian, Lan Wang, Vishnu Naresh Boddeti
However, owing to the nature of their training process, these models have the potential to 1) propagate or amplify societal biases in the training data and 2) learn to rely on spurious features.
no code implementations • 11 Mar 2024 • Lan Wang, Vishnu Boddeti, SerNam Lim
While existing video editing tasks are limited to changes in attributes, backgrounds, and styles, our method aims to predict open-ended human action changes in video.
no code implementations • 9 Mar 2024 • Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang
In this model, the inherent acoustic characteristics of individuals related to the tongue motion details are encoded by using wav2vec 2. 0, while the ASR transcriptions related to the universality of tongue motions are encoded by using BERT.
1 code implementation • 14 Jun 2023 • Zeyu Bian, Chengchun Shi, Zhengling Qi, Lan Wang
This work aims to study off-policy evaluation (OPE) under scenarios where two key reinforcement learning (RL) assumptions -- temporal stationarity and individual homogeneity are both violated.
no code implementations • 13 Jun 2023 • Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu, Jiaojiao Xu, Bo Liu, Xuemei Wang, Yao Zhang, Qiong Yan, Muhan Lv, Xiaomei Chen, Shuhua Zhang, Yihua Wang, Yang Liu, Li Yin, Yanni Liu, Yanqing Huang, Yunfang Liu, Kun Wang, Meiqin Su, Li Bian, Ping An, Xin Zhang, Linxue Qian, Shao Li, Xiaolong Qi
Validation analysis revealed that the AUCs of DLRP were 0. 91 for GEV (95% CI 0. 90 to 0. 93, p < 0. 05) and 0. 88 for HRV (95% CI 0. 86 to 0. 89, p < 0. 01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM.
no code implementations • 15 Apr 2023 • Kwei-Herng Lai, Lan Wang, Huiyuan Chen, Kaixiong Zhou, Fei Wang, Hao Yang, Xia Hu
We formulate context sampling into the Markov decision process and exploit deep reinforcement learning to optimize the time series domain adaptation process via context sampling and design a tailored reward function to generate domain-invariant features that better align two domains for anomaly detection.
no code implementations • CVPR 2023 • Lan Wang, Gaurav Mittal, Sandra Sajeev, Ye Yu, Matthew Hall, Vishnu Naresh Boddeti, Mei Chen
We present ProTeGe as the first method to perform VTG-based untrimmed pretraining to bridge the gap between trimmed pretrained backbones and downstream VTG tasks.
no code implementations • 29 Dec 2022 • Yang Xu, Chengchun Shi, Shikai Luo, Lan Wang, Rui Song
Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline data generated by a potentially different behavior policy.
no code implementations • 23 Dec 2022 • Zuyue Fu, Zhengling Qi, Zhuoran Yang, Zhaoran Wang, Lan Wang
To tackle the distributional mismatch, we leverage the idea of pessimism and use our OPE method to develop an off-policy learning algorithm for finding a desirable policy pair for both Alice and Bob.
no code implementations • 8 Dec 2022 • Huiyuan Chen, Yusan Lin, Menghai Pan, Lan Wang, Chin-Chia Michael Yeh, Xiaoting Li, Yan Zheng, Fei Wang, Hao Yang
Transformer-based sequential recommenders are very powerful for capturing both short-term and long-term sequential item dependencies.
1 code implementation • CVPR 2022 • Lan Wang, Vishnu Naresh Boddeti
Second, we apply NCINet to identify the causal relations between image representations of different pairs of attributes with known and unknown causal relations between the labels.
no code implementations • 4 Mar 2022 • Olukunle O. Owolabi, Kathryn Lawson, Sanhita Sengupta, Yingsi Huang, Lan Wang, Chaopeng Shen, Mila Getmansky Sherman, Deborah A. Sunter
Hydroelectric power (hydropower) is unique in that it can function as both a conventional source of electricity and as backup storage (pumped hydroelectric storage) for providing energy in times of high demand on the grid.
no code implementations • 24 Jan 2022 • Xurong Xie, Xiang Sui, Xunying Liu, Lan Wang
Meanwhile, approaches of multi-accent modelling including multi-style training, multi-accent decision tree state tying, DNN tandem and multi-level adaptive network (MLAN) tandem hidden Markov model (HMM) modelling are combined and compared in this paper.
no code implementations • 24 Jan 2022 • Xurong Xie, Rukiye Ruzi, Xunying Liu, Lan Wang
Dysarthric speech recognition is a challenging task due to acoustic variability and limited amount of available data.
no code implementations • 13 Jan 2022 • Lan Wang, Yusan Lin, Yuhang Wu, Huiyuan Chen, Fei Wang, Hao Yang
Today's cyber-world is vastly multivariate.
1 code implementation • 22 Dec 2021 • Mei-Ling E. Feng, Olukunle O. Owolabi, Toryn L. J. Schafer, Sanhita Sengupta, Lan Wang, David S. Matteson, Judy P. Che-Castaldo, Deborah A. Sunter
These flexible, species-specific estimates can allow future animal-indicators of grid reliability to be investigated in more diverse regions and ecological communities, providing a better understanding of the variation that exists in animal-outage relationship.
no code implementations • 10 Nov 2021 • Olukunle O. Owolabi, Toryn L. J. Schafer, Georgia E. Smits, Sanhita Sengupta, Sean E. Ryan, Lan Wang, David S. Matteson, Mila Getmansky Sherman, Deborah A. Sunter
After correcting for temporal effects, we found an increase in VRE penetration is associated with decrease in system electricity price in all ISOs studied.
1 code implementation • 7 Oct 2021 • Jin Li, Haibin Liu, Nan Yan, Lan Wang
Symbolic melodies generation is one of the essential tasks for automatic music generation.
1 code implementation • 12 Sep 2021 • Bashir Sadeghi, Lan Wang, Vishnu Naresh Boddeti
Adversarial representation learning aims to learn data representations for a target task while removing unwanted sensitive information at the same time.
1 code implementation • 19 Aug 2021 • Jin Li, Nan Yan, Lan Wang
However, cross-lingual SER remains a challenge in real-world applications due to a great difference between the source and target domain distributions.
1 code implementation • 18 Aug 2021 • Jin Li, Nan Yan, Lan Wang
For example, RawNet and RawNet2 extracted speaker's feature embeddings from waveforms automatically for recognizing their voice, which can vastly reduce the front-end computation and obtain state-of-the-art performance.
no code implementations • 18 Aug 2021 • Jin Li, Rongfeng Su, Xurong Xie, Nan Yan, Lan Wang
The shallow stream is used to acquire traditional shallow features that is beneficial for the classification of phones or words while the deep stream is used to obtain utterance-level speaker-invariant deep features for improving the feature diversity.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
no code implementations • 15 Aug 2021 • Yuhang Wu, Mengting Gu, Lan Wang, Yusan Lin, Fei Wang, Hao Yang
Modeling inter-dependencies between time-series is the key to achieve high performance in anomaly detection for multivariate time-series data.
no code implementations • ICLR 2021 • Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang
This paper theoretically investigates the following empirical phenomenon: given a high-complexity network with poor generalization bounds, one can distill it into a network with nearly identical predictions but low complexity and vastly smaller generalization bounds.
1 code implementation • 19 Jan 2021 • Judy P. Che-Castaldo, Rémi Cousin, Stefani Daryanto, Grace Deng, Mei-Ling E. Feng, Rajesh K. Gupta, Dezhi Hong, Ryan M. McGranaghan, Olukunle O. Owolabi, Tianyi Qu, Wei Ren, Toryn L. J. Schafer, Ashutosh Sharma, Chaopeng Shen, Mila Getmansky Sherman, Deborah A. Sunter, Lan Wang, David S. Matteson
We also provide relevant critical risk indicators (CRIs) across diverse domains that may influence electric power grid risks, including climate, ecology, hydrology, finance, space weather, and agriculture.
Applications
1 code implementation • 14 Dec 2020 • Xurong Xie, Xunying Liu, Tan Lee, Lan Wang
A key task for speech recognition systems is to reduce the mismatch between training and evaluation data that is often attributable to speaker differences.
no code implementations • 4 Nov 2020 • Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian
Training a code-switching end-to-end automatic speech recognition (ASR) model normally requires a large amount of data, while code-switching data is often limited.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
no code implementations • 25 Nov 2019 • Yunan Wu, Lan Wang
We first study a smoothed robust estimator that directly targets the parameter corresponding to the Bayes decision rule for optimal treatment regimes estimation.
no code implementations • 10 Aug 2019 • Yunan Wu, Lan Wang
Penalized (or regularized) regression, as represented by Lasso and its variants, has become a standard technique for analyzing high-dimensional data when the number of variables substantially exceeds the sample size.
no code implementations • ECCV 2018 • Lan Wang, Chenqiang Gao, Luyu Yang, Yue Zhao, WangMeng Zuo, Deyu Meng
As a result, using partial data channels to build a full representation of multi-modalities is clearly desired.
no code implementations • 7 Mar 2016 • Lan Wang, Chenqiang Gao, Jiang Liu, Deyu Meng
Detecting complex events in a large video collection crawled from video websites is a challenging task.