1 code implementation • 22 Sep 2024 • Le Yang, Ziwei Zheng, Yizeng Han, Shiji Song, Gao Huang, Fan Li
Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to find the high-performance architecture: First, the DARTS supernet that consists of mixed operations will be optimized via gradient descent.
1 code implementation • 17 Jul 2024 • Ziwei Zheng, Zechuan Zhang, Yulin Wang, Shiji Song, Gao Huang, Le Yang
Generic event boundary detection (GEBD), inspired by human visual cognitive behaviors of consistently segmenting videos into meaningful temporal chunks, finds utility in various applications such as video editing and.
no code implementations • 5 Jul 2024 • Ziwei Zheng, Lijun He, Le Yang, Fan Li
Generic event boundary detection (GEBD) aims at pinpointing event boundaries naturally perceived by humans, playing a crucial role in understanding long-form videos.
1 code implementation • 3 Jul 2024 • Le Yang, Ziwei Zheng, Yizeng Han, Hao Cheng, Shiji Song, Gao Huang, Fan Li
Based on DFA, the proposed dynamic encoder layer aggregates the temporal features within the action time ranges and guarantees the discriminability of the extracted representations.
Ranked #3 on Temporal Action Localization on HACS
no code implementations • 30 May 2024 • Hao Cheng, Erjia Xiao, Jiahang Cao, Le Yang, Kaidi Xu, Jindong Gu, Renjing Xu
Following the advent of the Artificial Intelligence (AI) era of large models, Multimodal Large Language Models (MLLMs) with the ability to understand cross-modal interactions between vision and text have attracted wide attention.
no code implementations • 15 Apr 2024 • Mufan Liu, Le Yang, Yiling Xu, Ye-kui Wang, Jenq-Neng Hwang
Neural representation for video (NeRV), which embeds the video content into neural network weights, allows video reconstruction with incomplete models.
no code implementations • 22 Mar 2024 • Yinggui Wang, Wei Huang, Le Yang
Thus, the SLU system needs to ensure that a potential malicious attacker cannot deduce the sensitive attributes of the users, while it should avoid greatly compromising the SLU accuracy.
no code implementations • 14 Mar 2024 • Yinggui Wang, Yuanqing Huang, Jianshu Li, Le Yang, Kai Song, Lei Wang
Specifically, face images are masked in the frequency domain using an adaptive MixUp strategy.
no code implementations • 29 Feb 2024 • Hao Cheng, Erjia Xiao, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu
Large Vision-Language Models (LVLMs) rely on vision encoders and Large Language Models (LLMs) to exhibit remarkable capabilities on various multi-modal tasks in the joint space of vision and language.
no code implementations • 27 Feb 2024 • Jiajian Zheng, Duan Xin, Qishuo Cheng, Miao Tian, Le Yang
The stock market is a crucial component of the financial market, playing a vital role in wealth accumulation for investors, financing costs for listed companies, and the stable development of the national macroeconomy.
no code implementations • 27 Feb 2024 • Le Yang, Miao Tian, Duan Xin, Qishuo Cheng, Jiajian Zheng
It achieves personal data privacy protection and detection through the use of machine learning's differential privacy protection algorithm.
no code implementations • 25 Feb 2024 • Qishuo Cheng, Le Yang, Jiajian Zheng, Miao Tian, Duan Xin
Finally, in this paper, the strategy is implemented by selecting the assets and actions with the largest Q value.
no code implementations • 18 Nov 2023 • Hao Cheng, Jiahang Cao, Erjia Xiao, Mengshu Sun, Le Yang, Jize Zhang, Xue Lin, Bhavya Kailkhura, Kaidi Xu, Renjing Xu
It posits that within dense neural networks, there exist winning tickets or subnetworks that are sparser but do not compromise performance.
no code implementations • 4 Jul 2023 • Yipeng Liu, Qi Yang, Yujie Zhang, Yiling Xu, Le Yang, Xiaozhong Xu, Shan Liu
Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from subjective experiments, by considering the domain relevance among samples located in the perception domain and learning a structured latent space.
1 code implementation • 13 Feb 2023 • Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin
Dynamic neural networks are a recent technique that promises a remedy for the increasing size of modern deep learning models by dynamically adapting their computational cost to the difficulty of the inputs.
no code implementations • 3 Feb 2023 • Chaowei Fang, Dingwen Zhang, Wen Zheng, Xue Li, Le Yang, Lechao Cheng, Junwei Han
We set up novel evaluation benchmarks based on a series of testing sets with evolving distributions.
Ranked #65 on Long-tail Learning on CIFAR-100-LT (ρ=100)
no code implementations • 22 Oct 2022 • Zhen Qin, Jun Tao, Le Yang, Ming Jiang
Motivated by the success of our recently proposed proportionate recursive least squares (PRLS) algorithm for sparse system identification, we propose to introduce the proportionate updating (PU) mechanism into the RMCC, leading to two sparsity-aware RMCC algorithms: the proportionate recursive MCC (PRMCC) algorithm and the combinational PRMCC (CPRMCC) algorithm.
1 code implementation • 10 Oct 2022 • Yujie Zhang, Qi Yang, Yifei Zhou, Xiaozhong Xu, Le Yang, Yiling Xu
The goal of objective point cloud quality assessment (PCQA) research is to develop quantitative metrics that measure point cloud quality in a perceptually consistent manner.
2 code implementations • 20 May 2022 • Le Yang, Junwei Han, Tao Zhao, Nian Liu, Dingwen Zhang
To tackle this issue, we make an early effort to study temporal action localization from the perspective of multi-modality feature learning, based on the observation that different actions exhibit specific preferences to appearance or motion modality.
1 code implementation • CVPR 2022 • Le Yang, Junwei Han, Dingwen Zhang
Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars.
Ranked #6 on Online Action Detection on TVSeries
1 code implementation • 24 Nov 2021 • Le Yang, Junwei Han, Tao Zhao, Tianwei Lin, Dingwen Zhang, Jianxin Chen
Weakly supervised temporal action localization aims at learning the instance-level action pattern from the video-level labels, where a significant challenge is action-context confusion.
1 code implementation • CVPR 2021 • Le Yang, Haojun Jiang, Ruojin Cai, Yulin Wang, Shiji Song, Gao Huang, Qi Tian
Reusing features in deep networks through dense connectivity is an effective way to achieve high computational efficiency.
no code implementations • 17 Feb 2021 • Le Yang, Gabriele Vajente, Mariana Fazio, Alena Ananyeva, GariLynn Billingsley, Ashot Markosyan, Riccardo Bassiri, Kiran Prasai, Martin M. Fejer, Carmen S. Menoni
Herein, we show the atomic arrangement of strong network forming GeO2 glass is modified at medium range (< 2 nm) through vapor deposition at elevated temperatures.
Materials Science
no code implementations • 9 Feb 2021 • Yizeng Han, Gao Huang, Shiji Song, Le Yang, Honghui Wang, Yulin Wang
Dynamic neural network is an emerging research topic in deep learning.
1 code implementation • 26 Jan 2021 • Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang
Due to the need to store the intermediate activations for back-propagation, end-to-end (E2E) training of deep networks usually suffers from high GPUs memory footprint.
no code implementations • ICLR 2021 • Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang
As InfoPro loss is difficult to compute in its original form, we derive a feasible upper bound as a surrogate optimization objective, yielding a simple but effective algorithm.
1 code implementation • 22 Dec 2020 • Yipeng Liu, Qi Yang, Yiling Xu, Le Yang
Full-reference (FR) point cloud quality assessment (PCQA) has achieved impressive progress in recent years.
Ranked #5 on Point Cloud Quality Assessment on WPC
1 code implementation • NeurIPS 2020 • Yulin Wang, Kangchen Lv, Rui Huang, Shiji Song, Le Yang, Gao Huang
The accuracy of deep convolutional neural networks (CNNs) generally improves when fueled with high resolution images.
no code implementations • 8 Oct 2020 • Le Yang, Mariana Fazio, Gabriele Vajente, Alena Ananyeva, GariLynn Billingsley, Ashot Markosyan, Riccardo Bassiri, Martin M. Fejer, Carmen S. Menoni
Internal friction in oxide thin films imposes a critical limitation to the sensitivity and stability of ultra-high finesse optical cavities for gravitational wave detectors.
Materials Science
1 code implementation • 22 Aug 2020 • Le Yang, Houwen Peng, Dingwen Zhang, Jianlong Fu, Junwei Han
To address this problem, this paper proposes a novel anchor-free action localization module that assists action localization by temporal points.
no code implementations • 18 Aug 2020 • Tao Zhao, Junwei Han, Le Yang, Dingwen Zhang
The existing methods can be categorized into two localization-by-classification pipelines, i. e., the pre-classification pipeline and the post-classification pipeline.
2 code implementations • CVPR 2020 • Le Yang, Yizeng Han, Xi Chen, Shiji Song, Jifeng Dai, Gao Huang
Adaptive inference is an effective mechanism to achieve a dynamic tradeoff between accuracy and computational cost in deep networks.
2 code implementations • 8 Dec 2019 • Songyang Zhang, Houwen Peng, Le Yang, Jianlong Fu, Jiebo Luo
In this report, we introduce the Winner method for HACS Temporal Action Localization Challenge 2019.
no code implementations • 9 Jul 2018 • Danil Kuzin, Le Yang, Olga Isupova, Lyudmila Mihaylova
The ensemble Kalman filter reduces the computational complexity required to obtain predictions with Gaussian processes preserving the accuracy level of these predictions.
no code implementations • CVPR 2018 • Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang
In this paper, we formulate this problem as a Markov Decision Process, where agents are learned to segment object regions under a deep reinforcement learning framework.
no code implementations • CVPR 2017 • Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han
Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags.