Search Results for author: Daouda Sow

Found 7 papers, 2 papers with code

Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization

1 code implementation • 22 Feb 2024 • Xuxi Chen, Zhendong Wang, Daouda Sow, Junjie Yang, Tianlong Chen, Yingbin Liang, Mingyuan Zhou, Zhangyang Wang

Our study starts from an empirical strategy for the light continual training of LLMs using their original pre-training data sets, with a specific focus on selective retention of samples that incur moderately high losses.

Paper
Code

Doubly Robust Instance-Reweighted Adversarial Training

no code implementations • 1 Aug 2023 • Daouda Sow, Sen Lin, Zhangyang Wang, Yingbin Liang

Experiments on standard classification datasets demonstrate that our proposed approach outperforms related state-of-the-art baseline methods in terms of average robust performance, and at the same time improves the robustness against attacks on the weakest data points.

Paper
Add Code

Algorithm Design for Online Meta-Learning with Task Boundary Detection

no code implementations • 2 Feb 2023 • Daouda Sow, Sen Lin, Yingbin Liang, Junshan Zhang

More specifically, we first propose two simple but effective detection mechanisms of task switches and distribution shift based on empirical observations, which serve as a key building block for more elegant online model updates in our algorithm: the task switch detection mechanism allows reusing of the best model available for the current task at hand, and the distribution shift detection mechanism differentiates the meta model update in order to preserve the knowledge for in-distribution tasks and quickly learn the new knowledge for out-of-distribution tasks.

Boundary Detection Meta-Learning

Paper
Add Code

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

no code implementations • 1 Mar 2022 • Daouda Sow, Kaiyi Ji, Ziwei Guan, Yingbin Liang

Existing algorithms designed for such a problem were applicable to restricted situations and do not come with a full guarantee of convergence.

Bilevel Optimization Hyperparameter Optimization +2

Paper
Add Code

On the Convergence Theory for Hessian-Free Bilevel Algorithms

1 code implementation • 13 Oct 2021 • Daouda Sow, Kaiyi Ji, Yingbin Liang

Bilevel optimization has arisen as a powerful tool in modern machine learning.

Bilevel Optimization Meta-Learning

Paper
Code

ES-Based Jacobian Enables Faster Bilevel Optimization

no code implementations • 29 Sep 2021 • Daouda Sow, Kaiyi Ji, Yingbin Liang

Bilevel optimization (BO) has arisen as a powerful tool for solving many modern machine learning problems.

Bilevel Optimization Meta-Learning

Paper
Add Code

A sequential guiding network with attention for image captioning

no code implementations • 1 Nov 2018 • Daouda Sow, Zengchang Qin, Mouhamed Niasse, Tao Wan

The recent advances of deep learning in both computer vision (CV) and natural language processing (NLP) provide us a new way of understanding semantics, by which we can deal with more challenging tasks such as automatic description generation from natural images.

Image Captioning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.