Search Results for author: Jia Ning

Found 7 papers, 5 papers with code

BCN: Batch Channel Normalization for Image Classification

1 code implementation1 Dec 2023 Afifa Khaled, Chao Li, Jia Ning, Kun He

Normalization techniques have been widely used in the field of deep learning due to their capability of enabling higher learning rates and are less careful in initialization.

Classification Image Classification

GMConv: Modulating Effective Receptive Fields for Convolutional Kernels

no code implementations9 Feb 2023 Qi Chen, Chao Li, Jia Ning, Stephen Lin, Kun He

Inspired by the property that ERFs typically exhibit a Gaussian distribution, we propose a Gaussian Mask convolutional kernel (GMConv) in this work.

Image Classification object-detection +1

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token

1 code implementation ICCV 2023 Jia Ning, Chen Li, Zheng Zhang, Zigang Geng, Qi Dai, Kun He, Han Hu

With these new techniques and other designs, we show that the proposed general-purpose task-solver can perform both instance segmentation and depth estimation well.

Instance Segmentation Monocular Depth Estimation +1

Enhancing the Robustness, Efficiency, and Diversity of Differentiable Architecture Search

no code implementations10 Apr 2022 Chao Li, Jia Ning, Han Hu, Kun He

Differentiable architecture search (DARTS) has attracted much attention due to its simplicity and significant improvement in efficiency.

Swin Transformer V2: Scaling Up Capacity and Resolution

19 code implementations CVPR 2022 Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo

Three main techniques are proposed: 1) a residual-post-norm method combined with cosine attention to improve training stability; 2) A log-spaced continuous position bias method to effectively transfer models pre-trained using low-resolution images to downstream tasks with high-resolution inputs; 3) A self-supervised pre-training method, SimMIM, to reduce the needs of vast labeled images.

Ranked #4 on Image Classification on ImageNet V2 (using extra training data)

Action Classification Image Classification +3

Video Swin Transformer

14 code implementations CVPR 2022 Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu

The vision community is witnessing a modeling shift from CNNs to Transformers, where pure Transformer architectures have attained top accuracy on the major video recognition benchmarks.

Ranked #28 on Action Classification on Kinetics-600 (using extra training data)

Action Classification Action Recognition +5

Cannot find the paper you are looking for? You can Submit a new open access paper.