Search Results for author: Jingwen Fu

Found 8 papers, 0 papers with code

Closing the Gap Between the Upper Bound and the Lower Bound of Adam's Iteration Complexity

no code implementations27 Oct 2023 Bohan Wang, Jingwen Fu, Huishuai Zhang, Nanning Zheng, Wei Chen

Recently, Arjevani et al. [1] established a lower bound of iteration complexity for the first-order optimization under an $L$-smooth condition and a bounded noise variance assumption.

LEMMA valid

Breaking through the learning plateaus of in-context learning in Transformer

no code implementations12 Sep 2023 Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng

To study the mechanism behind the learning plateaus, we conceptually seperate a component within the model's internal representation that is exclusively affected by the model's weights.

In-Context Learning Representation Learning

Generalization error bounds for iterative learning algorithms with bounded updates

no code implementations10 Sep 2023 Jingwen Fu, Nanning Zheng

This paper explores the generalization characteristics of iterative learning algorithms with bounded updates for non-convex loss functions, employing information-theoretic techniques.

When and Why Momentum Accelerates SGD:An Empirical Study

no code implementations15 Jun 2023 Jingwen Fu, Bohan Wang, Huishuai Zhang, Zhizheng Zhang, Wei Chen, Nanning Zheng

In the comparison of SGDM and SGD with the same effective learning rate and the same batch size, we observe a consistent pattern: when $\eta_{ef}$ is small, SGDM and SGD experience almost the same empirical training losses; when $\eta_{ef}$ surpasses a certain threshold, SGDM begins to perform better than SGD.

StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition

no code implementations CVPR 2023 Yanqing Shen, Sanping Zhou, Jingwen Fu, Ruotong Wang, Shitao Chen, Nanning Zheng

In this paper, we propose StructVPR, a novel training architecture for VPR, to enhance structural knowledge in RGB global features and thus improve feature stability in a constantly changing environment.

Image Retrieval Knowledge Distillation +3

Understanding Mobile GUI: from Pixel-Words to Screen-Sentences

no code implementations25 May 2021 Jingwen Fu, Xiaoyi Zhang, Yuwang Wang, Wenjun Zeng, Sam Yang, Grayson Hilliard

A dataset, RICO-PW, of screenshots with Pixel-Words annotations is built based on the public RICO dataset, which will be released to help to address the lack of high-quality training data in this area.

Retrieval Sentence

Model Adaption Object Detection System for Robot

no code implementations7 Nov 2019 Jingwen Fu, Licheng Zong, Yinbing Li, Ke Li, Bingqian Yang, Xibei Liu

Object detection for robot guidance is a crucial mission for autonomous robots, which has provoked extensive attention for researchers.

Object object-detection +2

Recognition Of Surface Defects On Steel Sheet Using Transfer Learning

no code implementations7 Sep 2019 Jingwen Fu, Xiaoyan Zhu, Yingbin Li

Automatic defect recognition is one of the research hotspots in steel production, but most of the current methods mainly extract features manually and use machine learning classifiers to recognize defects, which cannot tackle the situation, where there are few data available to train and confine to a certain scene.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.