Search Results for author: Yanan sun

Found 67 papers, 27 papers with code

CN: Channel Normalization For Point Cloud Recognition

no code implementations ECCV 2020 Zetong Yang, Yanan sun, Shu Liu, Xiaojuan Qi, Jiaya Jia

In 3D recognition, to fuse multi-scale structure information, existing methods apply hierarchical frameworks stacked by multiple fusion layers for integrating current relative locations with structure information from the previous level.

Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective

1 code implementation4 Jun 2025 Aojun Lu, Hangjie Yuan, Tao Feng, Yanan sun

While numerous CL methods aim to achieve this trade-off, they often overlook the impact of network architecture on stability and plasticity, restricting the trade-off to the parameter level.

Continual Learning

Adapt before Continual Learning

1 code implementation4 Jun 2025 Aojun Lu, Tao Feng, Hangjie Yuan, Chunhui Ding, Yanan sun

Continual Learning (CL) seeks to enable neural networks to incrementally acquire new knowledge (plasticity) while retaining existing knowledge (stability).

Continual Learning

LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language Models

1 code implementation14 May 2025 Long Chen, Xiaotian Song, Yanan sun

Experimental results on six language models and two vision-language models demonstrate that LAS achieves loss-less conversion.

Detect Anything 3D in the Wild

1 code implementation10 Apr 2025 Hanxue Zhang, Haoran Jiang, Qingsong Yao, Yanan sun, Renrui Zhang, Hao Zhao, Hongyang Li, Hongzi Zhu, Zetong Yang

Despite the success of deep learning in close-set 3D object detection, existing approaches struggle with zero-shot generalization to novel objects and camera configurations.

3D Object Detection Autonomous Driving +3

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

no code implementations25 Mar 2025 Kexian Tang, Junyao Gao, Yanhong Zeng, Haodong Duan, Yanan sun, Zhening Xing, Wenran Liu, Kaifeng Lyu, Kai Chen

Multi-step spatial reasoning entails understanding and reasoning about spatial relationships across multiple sequential steps, which is crucial for tackling complex real-world applications, such as robotic manipulation, autonomous navigation, and automated assembly.

Autonomous Navigation Question Answering +1

FaceShot: Bring Any Character into Life

no code implementations2 Mar 2025 Junyao Gao, Yanan sun, Fei Shen, Xin Jiang, Zhening Xing, Kai Chen, Cairong Zhao

With this powerful generalization capability, FaceShot can significantly extend the application of portrait animation by breaking the limitation of realistic portrait landmark detection for any stylized character and driven video.

Portrait Animation

FAS: Fast ANN-SNN Conversion for Spiking Large Language Models

1 code implementation6 Feb 2025 Long Chen, Xiaotian Song, Andy Song, Badong Chen, Jiancheng Lv, Yanan sun

Existing methods for creating Spiking LLMs, i. e., direct training and ANN-SNN conversion, often suffer from performance degradation and relatively high computational costs.

E-3SFC: Communication-Efficient Federated Learning with Double-way Features Synthesizing

1 code implementation5 Feb 2025 Yuhao Zhou, Yuxin Tian, Mingjia Shi, Yuanxi Li, Yanan sun, Qing Ye, Jiancheng Lv

Specifically, we propose a systematical algorithm termed Extended Single-Step Synthetic Features Compressing (E-3SFC), which consists of three sub-components, i. e., the Single-Step Synthetic Features Compressor (3SFC), a double-way compression algorithm, and a communication budget scheduler.

Federated Learning Scheduling

Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting

no code implementations10 Dec 2024 Zetong Yang, Hanxue Zhang, Yanan sun, Li Chen, Fei Xia, Fatma Güney, Hongyang Li

This paper introduces Test-time Correction (TTC) system, a novel online 3D detection system designated for online correction of test-time errors via human feedback, to guarantee the safety of deployed autonomous driving systems.

Autonomous Driving Visual Prompting

An Architectural Approach to Enhance Deep Long-Tailed Learning

no code implementations9 Nov 2024 Yuhan Pan, Yanan sun, Wei Gong

This ensures that the architecture obtained through our search process incorporates superior components.

StyleShot: A Snapshot on Any Style

2 code implementations1 Jul 2024 Junyao Gao, Yanchen Liu, Yanan sun, Yinhao Tang, Yanhong Zeng, Kai Chen, Cairong Zhao

In this paper, we show that, a good style representation is crucial and sufficient for generalized style transfer without test-time tuning.

Image Generation Style Transfer

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

1 code implementation27 Jun 2024 Yanan sun, Yanchen Liu, Yinhao Tang, Wenjie Pei, Kai Chen

To address these challenges, we propose AnyControl, a multi-control image synthesis framework that supports arbitrary combinations of diverse control signals.

Text to Image Generation Text-to-Image Generation

CAP: A Context-Aware Neural Predictor for NAS

1 code implementation4 Jun 2024 Han Ji, Yuqi Feng, Yanan sun

Neural predictors are effective in boosting the time-consuming performance evaluation stage in neural architecture search (NAS), owing to their direct estimation of unseen architectures.

Neural Architecture Search

Towards Accurate and Robust Architectures via Neural Architecture Search

no code implementations CVPR 2024 Yuwei Ou, Yuqi Feng, Yanan sun

To defend deep neural networks from adversarial attacks, adversarial training has been drawing increasing attention for its effectiveness.

Neural Architecture Search

Revisiting Neural Networks for Continual Learning: An Architectural Perspective

1 code implementation23 Apr 2024 Aojun Lu, Tao Feng, Hangjie Yuan, Xiaotian Song, Yanan sun

This paper seeks to bridge this gap between network architecture design and CL, and to present a holistic study on the impact of network architectures on CL.

Class Incremental Learning Incremental Learning

Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models

no code implementations20 Mar 2024 Chengzhe Feng, Yanan sun, Ke Li, Pan Zhou, Jiancheng Lv, Aojun Lu

We conduct GenAP on three popular code intelligence PLMs with three canonical code intelligence tasks including defect prediction, code summarization, and code translation.

Code Summarization Code Translation +1

A First Step Towards Runtime Analysis of Evolutionary Neural Architecture Search

no code implementations22 Jan 2024 Zeqiong Lv, Chao Qian, Yanan sun

Evolutionary neural architecture search (ENAS) employs evolutionary algorithms to find high-performing neural architectures automatically, and has achieved great success.

Binary Classification Evolutionary Algorithms +1

Visual Point Cloud Forecasting enables Scalable Autonomous Driving

1 code implementation CVPR 2024 Zetong Yang, Li Chen, Yanan sun, Hongyang Li

To resolve this, we bring up a new pre-training task termed as visual point cloud forecasting - predicting future point clouds from historical visual input.

3D geometry Motion Forecasting

Exploring the Robustness of Decentralized Training for Large Language Models

no code implementations1 Dec 2023 Lin Lu, Chenxi Dai, Wangcheng Tao, Binhang Yuan, Yanan sun, Pan Zhou

Decentralized training of large language models has emerged as an effective way to democratize this technology.

Federated Learning

TL-nvSRAM-CIM: Ultra-High-Density Three-Level ReRAM-Assisted Computing-in-nvSRAM with DC-Power Free Restore and Ternary MAC Operations

no code implementations6 Jul 2023 Dengfeng Wang, Liukai Xu, Songyuan Liu, Zhi Li, Yiming Chen, Weifeng He, Xueqing Li, Yanan sun

Accommodating all the weights on-chip for large-scale NNs remains a great challenge for SRAM based computing-in-memory (SRAM-CIM) with limited on-chip capacity.

UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks

no code implementations7 Jun 2023 Yanan sun, Zihan Zhong, Qi Fan, Chi-Keung Tang, Yu-Wing Tai

Our thorough studies validate that models pre-trained as such can learn rich representations of both modalities, improving their ability to understand how images and text relate to each other.

Semantic Segmentation

Differentiable Genetic Programming for High-dimensional Symbolic Regression

no code implementations18 Apr 2023 Peng Zeng, Xiaotian Song, Andrew Lensen, Yuwei Ou, Yanan sun, Mengjie Zhang, Jiancheng Lv

With these designs, the proposed DGP method can efficiently search for the GP trees with higher performance, thus being capable of dealing with high-dimensional SR. To demonstrate the effectiveness of DGP, we conducted various experiments against the state of the arts based on both GP and deep neural networks.

Interpretable Machine Learning regression +2

Efficient Evaluation Methods for Neural Architecture Search: A Survey

no code implementations14 Jan 2023 Xiaotian Song, Xiangning Xie, Zeqiong Lv, Gary G. Yen, Weiping Ding, Jiancheng Lv, Yanan sun

In surveying each category, we further discuss the design principles and analyze the strengths and weaknesses to clarify the landscape of existing EEMs, thus making easily understanding the research trends of EEMs.

Neural Architecture Search Survey

Differentiable Search of Accurate and Robust Architectures

no code implementations28 Dec 2022 Yuwei Ou, Xiangning Xie, Shangce Gao, Yanan sun, Kay Chen Tan, Jiancheng Lv

Deep neural networks (DNNs) are found to be vulnerable to adversarial attacks, and various methods have been proposed for the defense.

DAS: Neural Architecture Search via Distinguishing Activation Score

no code implementations23 Dec 2022 Yuqiao Liu, Haipeng Li, Yanan sun, Shuaicheng Liu

NAS without training (WOT) score is such a metric, which estimates the final trained accuracy of the architecture through the ability to distinguish different inputs in the activation layer.

Neural Architecture Search

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions

no code implementations21 Nov 2022 Changlin Li, Guangyang Wu, Yanan sun, Xin Tao, Chi-Keung Tang, Yu-Wing Tai

The learnt deformable kernel is then utilized in convolving the input frames for predicting the interpolated frame.

Video Frame Interpolation

Bridge the Gap Between Architecture Spaces via A Cross-Domain Predictor

2 code implementations NeurIPS 2022 Yuqiao Liu, Yehui Tang ~Yehui_Tang1, Zeqiong Lv, Yunhe Wang, Yanan sun

To solve this issue, we propose a Cross-Domain Predictor (CDP), which is trained based on the existing NAS benchmark datasets (e. g., NAS-Bench-101), but can be used to find high-performance architectures in large-scale search spaces.

Neural Architecture Search

Analyzing the Expected Hitting Time of Evolutionary Computation-based Neural Architecture Search Algorithms

no code implementations11 Oct 2022 Zeqiong Lv, Chao Qian, Gary G. Yen, Yanan sun

Evolutionary computation-based neural architecture search (ENAS) is a popular technique for automating architecture design of deep neural networks.

Neural Architecture Search

Continuously Controllable Facial Expression Editing in Talking Face Videos

no code implementations17 Sep 2022 Zhiyao Sun, Yu-Hui Wen, Tian Lv, Yanan sun, Ziyang Zhang, Yaoyuan Wang, Yong-Jin Liu

In this paper, we propose a high-quality facial expression editing method for talking face videos, allowing the user to control the target emotion in the edited video continuously.

Image-to-Image Translation Video Generation

Architecture Augmentation for Performance Predictor Based on Graph Isomorphism

no code implementations3 Jul 2022 Xiangning Xie, Yuqiao Liu, Yanan sun, Mengjie Zhang, Kay Chen Tan

Performance predictors can greatly alleviate the prohibitive cost of NAS by directly predicting the performance of DNNs.

Neural Architecture Search

Human Instance Matting via Mutual Guidance and Multi-Instance Refinement

1 code implementation CVPR 2022 Yanan sun, Chi-Keung Tang, Yu-Wing Tai

A new instance matting metric called instance matting quality (IMQ) is proposed, which addresses the absence of a unified and fair means of evaluation emphasizing both instance recognition and matting quality.

Image Matting Instance Segmentation +1

Automating Neural Architecture Design without Search

no code implementations21 Apr 2022 Zixuan Liang, Yanan sun

Specifically, the proposed approach is built by learning the knowledge of high-level experts in designing state-of-the-art architectures, and then the new architecture is directly generated upon the knowledge learned.

Graph Neural Network Link Prediction

Dynamic Neural Textures: Generating Talking-Face Videos with Continuously Controllable Expressions

no code implementations13 Apr 2022 Zipeng Ye, Zhiyao Sun, Yu-Hui Wen, Yanan sun, Tian Lv, Ran Yi, Yong-Jin Liu

In this paper, we propose a method to generate talking-face videos with continuously controllable expressions in real-time.

Video Generation

A Unified Query-based Paradigm for Point Cloud Understanding

1 code implementation CVPR 2022 Zetong Yang, Li Jiang, Yanan sun, Bernt Schiele, Jiaya Jia

This is achieved by introducing an intermediate representation, i. e., Q-representation, in the querying stage to serve as a bridge between the embedding stage and task heads.

Autonomous Driving object-detection +2

Evolving Deep Neural Networks for Collaborative Filtering

no code implementations15 Nov 2021 Yuhan Fang, Yuqiao Liu, Yanan sun

As a consequence, it requires the designers to develop expertise in both CF and DNNs, which limits the application of deep learning methods in CF and the accuracy of recommended results.

Collaborative Filtering Recommendation Systems

BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search

1 code implementation9 Aug 2021 Xiangning Xie, Yuqiao Liu, Yanan sun, Gary G. Yen, Bing Xue, Mengjie Zhang

The paper conducts efficient comparison experiments on eight ENAS algorithms with high GPU utilization on this platform.

Benchmarking Neural Architecture Search

Homogeneous Architecture Augmentation for Neural Predictor

1 code implementation ICCV 2021 Yuqiao Liu, Yehui Tang, Yanan sun

Specifically, a homogeneous architecture augmentation algorithm is proposed in HAAP to generate sufficient training data taking the use of homogeneous representation.

Neural Architecture Search

Autoregressive Stylized Motion Synthesis With Generative Flow

no code implementations CVPR 2021 Yu-Hui Wen, Zhipeng Yang, Hongbo Fu, Lin Gao, Yanan sun, Yong-Jin Liu

Motion style transfer is an important problem in many computer graphics and computer vision applications, including human animation, games, and robotics.

Human Animation Motion Style Transfer +1

Heart-Darts: Classification of Heartbeats Using Differentiable Architecture Search

no code implementations3 May 2021 Jindi Lv, Qing Ye, Yanan sun, Juan Zhao, Jiancheng Lv

In this paper, we propose a novel approach, Heart-Darts, to efficiently classify the ECG signals by automatically designing the CNN model with the differentiable architecture search (i. e., Darts, a cell-based neural architecture search method).

Arrhythmia Detection Classification +4

Deep Video Matting via Spatio-Temporal Alignment and Aggregation

1 code implementation CVPR 2021 Yanan sun, Guanzhi Wang, Qiao Gu, Chi-Keung Tang, Yu-Wing Tai

Despite the significant progress made by deep learning in natural image matting, there has been so far no representative work on deep learning for video matting due to the inherent technical challenges in reasoning temporal domain and lack of large-scale video matting datasets.

Decoder Deep Learning +3

Semantic Image Matting

1 code implementation CVPR 2021 Yanan sun, Chi-Keung Tang, Yu-Wing Tai

Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap.

Semantic Image Matting Transparent objects

A Novel Training Protocol for Performance Predictors of Evolutionary Neural Architecture Search Algorithms

no code implementations30 Aug 2020 Yanan Sun, Xian Sun, Yuhan Fang, Gary Yen

Performance predictors are a type of regression models which can assist to accomplish the search, while without exerting much computational resource.

Neural Architecture Search regression

Evolving Deep Convolutional Neural Networks for Hyperspectral Image Denoising

no code implementations15 Aug 2020 Yuqiao Liu, Yanan sun, Bing Xue, Mengjie Zhang

Hyperspectral images (HSIs) are susceptible to various noise factors leading to the loss of information, and the noise restricts the subsequent HSIs object detection and classification tasks.

Hyperspectral Image Denoising Image Denoising +2

DBS: Dynamic Batch Size For Distributed Deep Neural Network Training

1 code implementation23 Jul 2020 Qing Ye, Yuhao Zhou, Mingjia Shi, Yanan sun, Jiancheng Lv

Specifically, the performance of each worker is evaluatedfirst based on the fact in the previous epoch, and then the batch size and datasetpartition are dynamically adjusted in consideration of the current performanceof the worker, thereby improving the utilization of the cluster.

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Face Photos

1 code implementation15 Mar 2020 Zipeng Ye, Mengfei Xia, Yanan sun, Ran Yi, MinJing Yu, Juyong Zhang, Yu-Kun Lai, Yong-Jin Liu

The most challenging issue for our system is that the source domain of face photos (characterized by normal 2D faces) is significantly different from the target domain of 3D caricatures (characterized by 3D exaggerated face shapes and textures).

Caricature

3DSSD: Point-based 3D Single Stage Object Detector

2 code implementations CVPR 2020 Zetong Yang, Yanan sun, Shu Liu, Jiaya Jia

Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.

Object

ArcText: A Unified Text Approach to Describing Convolutional Neural Network Architectures

no code implementations16 Feb 2020 Yanan Sun, Ziyao Ren, Gary G. Yen, Bing Xue, Mengjie Zhang, Jiancheng Lv

Data mining on existing CNN can discover useful patterns and fundamental sub-comments from their architectures, providing researchers with strong prior knowledge to design proper CNN architectures when they have no expertise in CNNs.

Evolving Deep Neural Networks by Multi-objective Particle Swarm Optimization for Image Classification

1 code implementation21 Mar 2019 Bin Wang, Yanan sun, Bing Xue, Mengjie Zhang

In recent years, convolutional neural networks (CNNs) have become deeper in order to achieve better classification accuracy in image classification.

Classification General Classification +2

A Hybrid GA-PSO Method for Evolving Architecture and Short Connections of Deep Convolutional Neural Networks

no code implementations10 Mar 2019 Bin Wang, Yanan sun, Bing Xue, Mengjie Zhang

Three major contributions of this work are: Firstly, a new encoding strategy is proposed to encode a CNN, where the architecture and the shortcut connections are encoded separately; Secondly, a hybrid two-level EC method, which combines particle swarm optimisation and genetic algorithms, is developed to search for the optimal CNNs; Lastly, an adjustable learning rate is introduced for the fitness evaluations, which provides a better learning rate for the training process given a fixed number of epochs.

General Classification image-classification +1

Automatically Evolving CNN Architectures Based on Blocks

no code implementations28 Oct 2018 Yanan Sun, Bing Xue, Mengjie Zhang, Gary G. Yen

The proposed algorithm is evaluated on CIFAR10 and CIFAR100 against 18 state-of-the-art peer competitors.

General Classification

A Hybrid Differential Evolution Approach to Designing Deep Convolutional Neural Networks for Image Classification

no code implementations20 Aug 2018 Bin Wang, Yanan sun, Bing Xue, Mengjie Zhang

In this paper, a new hybrid differential evolution (DE) algorithm with a newly added crossover operator is proposed to evolve the architectures of CNNs of any lengths, which is named DECNN.

General Classification image-classification +1

Automatically designing CNN architectures using genetic algorithm for image classification

4 code implementations11 Aug 2018 Yanan Sun, Bing Xue, Mengjie Zhang, Gary G. Yen

Convolutional Neural Networks (CNNs) have gained a remarkable success on many image classification tasks in recent years.

Classification General Classification +2

Evolving Deep Convolutional Neural Networks by Variable-length Particle Swarm Optimization for Image Classification

no code implementations17 Mar 2018 Bin Wang, Yanan sun, Bing Xue, Mengjie Zhang

Convolutional neural networks (CNNs) are one of the most effective deep learning methods to solve image classification problems, but the best architecture of a CNN to solve a specific problem can be extremely complicated and hard to design.

General Classification image-classification +1

IGD Indicator-based Evolutionary Algorithm for Many-objective Optimization Problems

no code implementations24 Feb 2018 Yanan Sun, Gary G. Yen, Zhang Yi

Inverted Generational Distance (IGD) has been widely considered as a reliable performance indicator to concurrently quantify the convergence and diversity of multi- and many-objective evolutionary algorithms.

Diversity Evolutionary Algorithms

Improved Regularity Model-based EDA for Many-objective Optimization

no code implementations24 Feb 2018 Yanan Sun, Gary G. Yen, Zhang Yi

Finally, by assigning the Pareto-optimal solutions to the uniformly distributed reference vectors, a set of solutions with excellent diversity and convergence is obtained.

Dimensionality Reduction Diversity +1

Evolving Unsupervised Deep Neural Networks for Learning Meaningful Representations

no code implementations13 Dec 2017 Yanan Sun, Gary G. Yen, Zhang Yi

Specifically, error classification rate on MNIST with $1. 15\%$ is reached by the proposed algorithm consistently, which is a very promising result against state-of-the-art unsupervised DL algorithms.

Evolutionary Algorithms General Classification +1

A Particle Swarm Optimization-based Flexible Convolutional Auto-Encoder for Image Classification

1 code implementation13 Dec 2017 Yanan Sun, Bing Xue, Mengjie Zhang, Gary G. Yen

Convolutional auto-encoders have shown their remarkable performance in stacking to deep convolutional neural networks for classifying image data during past several years.

General Classification image-classification +1

Evolving Deep Convolutional Neural Networks for Image Classification

1 code implementation30 Oct 2017 Yanan Sun, Bing Xue, Mengjie Zhang, Gary G. Yen

Evolutionary computation methods have been successfully applied to neural networks since two decades ago, while those methods cannot scale well to the modern deep neural networks due to the complicated architectures and large quantities of connection weights.

Classification General Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.