Search Results for author: Le Yang

Found 36 papers, 15 papers with code

OStr-DARTS: Differentiable Neural Architecture Search based on Operation Strength

1 code implementation22 Sep 2024 Le Yang, Ziwei Zheng, Yizeng Han, Shiji Song, Gao Huang, Fan Li

Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to find the high-performance architecture: First, the DARTS supernet that consists of mixed operations will be optimized via gradient descent.

Attribute Neural Architecture Search

Rethinking the Architecture Design for Efficient Generic Event Boundary Detection

1 code implementation17 Jul 2024 Ziwei Zheng, Zechuan Zhang, Yulin Wang, Shiji Song, Gao Huang, Le Yang

Generic event boundary detection (GEBD), inspired by human visual cognitive behaviors of consistently segmenting videos into meaningful temporal chunks, finds utility in various applications such as video editing and.

Boundary Detection Generic Event Boundary Detection +1

Fine-grained Dynamic Network for Generic Event Boundary Detection

no code implementations5 Jul 2024 Ziwei Zheng, Lijun He, Le Yang, Fan Li

Generic event boundary detection (GEBD) aims at pinpointing event boundaries naturally perceived by humans, playing a crucial role in understanding long-form videos.

Boundary Detection Generic Event Boundary Detection

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

1 code implementation3 Jul 2024 Le Yang, Ziwei Zheng, Yizeng Han, Hao Cheng, Shiji Song, Gao Huang, Fan Li

Based on DFA, the proposed dynamic encoder layer aggregates the temporal features within the action time ranges and guarantees the discriminability of the extracted representations.

Action Detection Temporal Action Localization

Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models

no code implementations30 May 2024 Hao Cheng, Erjia Xiao, Jiahang Cao, Le Yang, Kaidi Xu, Jindong Gu, Renjing Xu

Following the advent of the Artificial Intelligence (AI) era of large models, Multimodal Large Language Models (MLLMs) with the ability to understand cross-modal interactions between vision and text have attracted wide attention.

Diversity

EVAN: Evolutional Video Streaming Adaptation via Neural Representation

no code implementations15 Apr 2024 Mufan Liu, Le Yang, Yiling Xu, Ye-kui Wang, Jenq-Neng Hwang

Neural representation for video (NeRV), which embeds the video content into neural network weights, allows video reconstruction with incomplete models.

Video Reconstruction

Privacy-Preserving End-to-End Spoken Language Understanding

no code implementations22 Mar 2024 Yinggui Wang, Wei Huang, Le Yang

Thus, the SLU system needs to ensure that a potential malicious attacker cannot deduce the sensitive attributes of the users, while it should avoid greatly compromising the SLU accuracy.

Privacy Preserving speech-recognition +2

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

no code implementations29 Feb 2024 Hao Cheng, Erjia Xiao, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu

Large Vision-Language Models (LVLMs) rely on vision encoders and Large Language Models (LLMs) to exhibit remarkable capabilities on various multi-modal tasks in the joint space of vision and language.

Language Modelling Object Recognition +1

The Random Forest Model for Analyzing and Forecasting the US Stock Market in the Context of Smart Finance

no code implementations27 Feb 2024 Jiajian Zheng, Duan Xin, Qishuo Cheng, Miao Tian, Le Yang

The stock market is a crucial component of the financial market, playing a vital role in wealth accumulation for investors, financing costs for listed companies, and the stable development of the national macroeconomy.

AI-Driven Anonymization: Protecting Personal Data Privacy While Leveraging Machine Learning

no code implementations27 Feb 2024 Le Yang, Miao Tian, Duan Xin, Qishuo Cheng, Jiajian Zheng

It achieves personal data privacy protection and detection through the use of machine learning's differential privacy protection algorithm.

Pursing the Sparse Limitation of Spiking Deep Learning Structures

no code implementations18 Nov 2023 Hao Cheng, Jiahang Cao, Erjia Xiao, Mengshu Sun, Le Yang, Jize Zhang, Xue Lin, Bhavya Kailkhura, Kaidi Xu, Renjing Xu

It posits that within dense neural networks, there exist winning tickets or subnetworks that are sparser but do not compromise performance.

Once-Training-All-Fine: No-Reference Point Cloud Quality Assessment via Domain-relevance Degradation Description

no code implementations4 Jul 2023 Yipeng Liu, Qi Yang, Yujie Zhang, Yiling Xu, Le Yang, Xiaozhong Xu, Shan Liu

Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from subjective experiments, by considering the domain relevance among samples located in the perception domain and learning a structured latent space.

Point Cloud Quality Assessment regression

Fixing Overconfidence in Dynamic Neural Networks

1 code implementation13 Feb 2023 Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin

Dynamic neural networks are a recent technique that promises a remedy for the increasing size of modern deep learning models by dynamically adapting their computational cost to the difficulty of the inputs.

Decision Making Uncertainty Quantification

Proportionate Recursive Maximum Correntropy Criterion Adaptive Filtering Algorithms and their Performance Analysis

no code implementations22 Oct 2022 Zhen Qin, Jun Tao, Le Yang, Ming Jiang

Motivated by the success of our recently proposed proportionate recursive least squares (PRLS) algorithm for sparse system identification, we propose to introduce the proportionate updating (PU) mechanism into the RMCC, leading to two sparsity-aware RMCC algorithms: the proportionate recursive MCC (PRMCC) algorithm and the combinational PRMCC (CPRMCC) algorithm.

TCDM: Transformational Complexity Based Distortion Metric for Perceptual Point Cloud Quality Assessment

1 code implementation10 Oct 2022 Yujie Zhang, Qi Yang, Yifei Zhou, Xiaozhong Xu, Le Yang, Yiling Xu

The goal of objective point cloud quality assessment (PCQA) research is to develop quantitative metrics that measure point cloud quality in a perceptually consistent manner.

Point Cloud Quality Assessment

Structured Attention Composition for Temporal Action Localization

2 code implementations20 May 2022 Le Yang, Junwei Han, Tao Zhao, Nian Liu, Dingwen Zhang

To tackle this issue, we make an early effort to study temporal action localization from the perspective of multi-modality feature learning, based on the observation that different actions exhibit specific preferences to appearance or motion modality.

Action Detection Temporal Action Localization

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

1 code implementation CVPR 2022 Le Yang, Junwei Han, Dingwen Zhang

Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars.

Online Action Detection

Background-Click Supervision for Temporal Action Localization

1 code implementation24 Nov 2021 Le Yang, Junwei Han, Tao Zhao, Tianwei Lin, Dingwen Zhang, Jianxin Chen

Weakly supervised temporal action localization aims at learning the instance-level action pattern from the video-level labels, where a significant challenge is action-context confusion.

Position Weakly-supervised Temporal Action Localization +1

Enhanced Medium Range Order in Vapor Deposited Germania Glasses at Elevated Temperatures

no code implementations17 Feb 2021 Le Yang, Gabriele Vajente, Mariana Fazio, Alena Ananyeva, GariLynn Billingsley, Ashot Markosyan, Riccardo Bassiri, Kiran Prasai, Martin M. Fejer, Carmen S. Menoni

Herein, we show the atomic arrangement of strong network forming GeO2 glass is modified at medium range (< 2 nm) through vapor deposition at elevated temperatures.

Materials Science

Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

1 code implementation26 Jan 2021 Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang

Due to the need to store the intermediate activations for back-propagation, end-to-end (E2E) training of deep networks usually suffers from high GPUs memory footprint.

Revisiting Locally Supervised Training of Deep Neural Networks

no code implementations ICLR 2021 Yulin Wang, Zanlin Ni, Shiji Song, Le Yang, Gao Huang

As InfoPro loss is difficult to compute in its original form, we derive a feasible upper bound as a surrogate optimization objective, yielding a simple but effective algorithm.

Structural evolution of binary oxide nanolaminates with annealing and its impact on room-temperature internal friction

no code implementations8 Oct 2020 Le Yang, Mariana Fazio, Gabriele Vajente, Alena Ananyeva, GariLynn Billingsley, Ashot Markosyan, Riccardo Bassiri, Martin M. Fejer, Carmen S. Menoni

Internal friction in oxide thin films imposes a critical limitation to the sensitivity and stability of ultra-high finesse optical cavities for gravitational wave detectors.

Materials Science

Revisiting Anchor Mechanisms for Temporal Action Localization

1 code implementation22 Aug 2020 Le Yang, Houwen Peng, Dingwen Zhang, Jianlong Fu, Junwei Han

To address this problem, this paper proposes a novel anchor-free action localization module that assists action localization by temporal points.

Temporal Action Localization

Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization

no code implementations18 Aug 2020 Tao Zhao, Junwei Han, Le Yang, Dingwen Zhang

The existing methods can be categorized into two localization-by-classification pipelines, i. e., the pre-classification pipeline and the post-classification pipeline.

Classification General Classification +2

Resolution Adaptive Networks for Efficient Inference

2 code implementations CVPR 2020 Le Yang, Yizeng Han, Xi Chen, Shiji Song, Jifeng Dai, Gao Huang

Adaptive inference is an effective mechanism to achieve a dynamic tradeoff between accuracy and computational cost in deep networks.

Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization

2 code implementations8 Dec 2019 Songyang Zhang, Houwen Peng, Le Yang, Jianlong Fu, Jiebo Luo

In this report, we introduce the Winner method for HACS Temporal Action Localization Challenge 2019.

Temporal Action Localization

Ensemble Kalman Filtering for Online Gaussian Process Regression and Learning

no code implementations9 Jul 2018 Danil Kuzin, Le Yang, Olga Isupova, Lyudmila Mihaylova

The ensemble Kalman filter reduces the computational complexity required to obtain predictions with Gaussian processes preserving the accuracy level of these predictions.

Gaussian Processes regression

Reinforcement Cutting-Agent Learning for Video Object Segmentation

no code implementations CVPR 2018 Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang

In this paper, we formulate this problem as a Markov Decision Process, where agents are learned to segment object regions under a deep reinforcement learning framework.

Decision Making Object +6

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

no code implementations CVPR 2017 Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags.

Object Semantic Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.