Search Results for author: Mingze Wang

Found 11 papers, 3 papers with code

RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model

1 code implementation • 12 Mar 2024 • Mingze Wang, Lili Su, Cilin Yan, Sheng Xu, Pengcheng Yuan, XiaoLong Jiang, Baochang Zhang

RSBuilding is designed to enhance cross-scene generalization and task universality.

Change Detection Zero-shot Generalization

Paper
Code

The Implicit Bias of Gradient Noise: A Symmetry Perspective

no code implementations • 11 Feb 2024 • Liu Ziyin, Mingze Wang, Lei Wu

For one class of symmetry, SGD naturally converges to solutions that have a balanced and aligned gradient noise.

Paper
Add Code

Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling

no code implementations • 1 Feb 2024 • Mingze Wang, Weinan E

We conduct a systematic study of the approximation properties of Transformer for sequence modeling with long, sparse and complicated memory.

Paper
Add Code

Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling

no code implementations • 24 Nov 2023 • Mingze Wang, Zeping Min, Lei Wu

Inspired by this analysis, we propose a novel algorithm called Progressive Rescaling Gradient Descent (PRGD) and show that PRGD can maximize the margin at an {\em exponential rate}.

Paper
Add Code

A Theoretical Analysis of Noise Geometry in Stochastic Gradient Descent

no code implementations • 1 Oct 2023 • Mingze Wang, Lei Wu

In this paper, we provide a theoretical study of noise geometry for minibatch stochastic gradient descent (SGD), a phenomenon where noise aligns favorably with the geometry of local landscape.

Navigate

Paper
Add Code

Q-YOLO: Efficient Inference for Real-time Object Detection

no code implementations • 1 Jul 2023 • Mingze Wang, Huixin Sun, Jun Shi, Xuhui Liu, Baochang Zhang, Xianbin Cao

Real-time object detection plays a vital role in various computer vision applications.

Object object-detection +2

Paper
Add Code

Understanding Multi-phase Optimization Dynamics and Rich Nonlinear Behaviors of ReLU Networks

1 code implementation • NeurIPS 2023 • Mingze Wang, Chao Ma

The training process of ReLU neural networks often exhibits complicated nonlinear phenomena.

Paper
Code

The alignment property of SGD noise and how it helps select flat minima: A stability analysis

no code implementations • 6 Jul 2022 • Lei Wu, Mingze Wang, Weijie Su

In this paper, we provide an explanation of this striking phenomenon by relating the particular noise structure of SGD to its \emph{linear stability} (Wu et al., 2018).

Paper
Add Code

Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars

no code implementations • 21 Jun 2022 • Mingze Wang, Ziyang Zhang, Grace Hui Yang

This paper presents a novel approach that supports natural language voice instructions to guide deep reinforcement learning (DRL) algorithms when training self-driving cars.

Model-based Reinforcement Learning reinforcement-learning +2

Paper
Add Code

Generalization Error Bounds for Deep Neural Networks Trained by SGD

no code implementations • 7 Jun 2022 • Mingze Wang, Chao Ma

Generalization error bounds for deep neural networks trained by stochastic gradient descent (SGD) are derived by combining a dynamical control of an appropriate parameter norm and the Rademacher complexity estimate based on parameter norms.

Paper
Add Code

Early Stage Convergence and Global Convergence of Training Mildly Parameterized Neural Networks

1 code implementation • 5 Jun 2022 • Mingze Wang, Chao Ma

The convergence of GD and SGD when training mildly parameterized neural networks starting from random initialization is studied.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.