no code implementations • 9 May 2025 • Yuxin Zhou, Zheng Li, Jun Zhang, Jue Wang, Yiping Wang, Zhongle Xie, Ke Chen, Lidan Shou
With the widespread adoption of Mixture-of-Experts (MoE) models, there is a growing demand for efficient inference on memory-constrained devices.
1 code implementation • 29 Apr 2025 • Yiping Wang, Qing Yang, Zhiyuan Zeng, Liliang Ren, Lucas Liu, Baolin Peng, Hao Cheng, Xuehai He, Kuan Wang, Jianfeng Gao, Weizhu Chen, Shuohang Wang, Simon Shaolei Du, Yelong Shen
We also show the critical role of promoting exploration (e. g., by adding entropy loss with an appropriate coefficient) in 1-shot RLVR training.
no code implementations • 11 Apr 2025 • Jiaqi He, Xiangwen Luo, Yiping Wang
At the current stage, deep learning-based methods have demonstrated excellent capabilities in evaluating aerodynamic performance, significantly reducing the time and cost required for traditional computational fluid dynamics (CFD) simulations.
no code implementations • 17 Dec 2024 • Yiping Wang, Xuehai He, Kuan Wang, Luyao Ma, Jianwei Yang, Shuohang Wang, Simon Shaolei Du, Yelong Shen
However, they still struggle to coherently present multiple sequential events in the stories specified by the prompts, which is foreseeable an essential capability for future long video generation scenarios.
no code implementations • 12 Dec 2024 • Xuehai He, Shuohang Wang, Jianwei Yang, Xiaoxia Wu, Yiping Wang, Kuan Wang, Zheng Zhan, Olatunji Ruwase, Yelong Shen, Xin Eric Wang
Recent advancements in diffusion models have shown great promise in producing high-quality video content.
1 code implementation • 26 Sep 2024 • Yanming Wan, Yue Wu, Yiping Wang, Jiayuan Mao, Natasha Jaques
We propose a new framework, Follow Instructions with Social and Embodied Reasoning (FISER), aiming for better natural language instruction following in collaborative embodied tasks.
no code implementations • 12 Jun 2024 • Yiping Wang, Yanhao Wang, Cen Chen
The sliding window model of computation captures scenarios in which data are continually arriving in the form of a stream, and only the most recent $w$ items are used for analysis.
2 code implementations • 29 May 2024 • Yiping Wang, Yifang Chen, Wendan Yan, Alex Fang, Wenjing Zhou, Kevin Jamieson, Simon Shaolei Du
Three main data selection approaches are: (1) leveraging external non-CLIP models to aid data selection, (2) training new CLIP-style embedding models that are more effective at selecting high-quality data than the original OpenAI CLIP model, and (3) designing better metrics or strategies universally applicable to any CLIP embedding without requiring specific model properties (e. g., CLIPScore is one popular metric).
2 code implementations • 3 Feb 2024 • Yiping Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du
In recent years, data selection has emerged as a core issue for large-scale visual-language model pretraining, especially on noisy web-curated datasets.
1 code implementation • 1 Oct 2023 • Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Du
We propose Joint MLP/Attention (JoMA) dynamics, a novel mathematical framework to understand the training procedure of multilayer Transformer architectures.
no code implementations • 5 Jun 2023 • Yiping Wang, Yifang Chen, Kevin Jamieson, Simon S. Du
In addition to our sample complexity results, we also characterize the potential of our $\nu^1$-based strategy in sample-cost-sensitive settings.
1 code implementation • 11 Oct 2022 • Huaxiu Yao, Yiping Wang, Linjun Zhang, James Zou, Chelsea Finn
In this paper, we propose a simple yet powerful algorithm, C-Mixup, to improve generalization on regression tasks.
no code implementations • 14 Oct 2021 • Ahmad Pesaranghader, Yiping Wang, Mohammad Havaei
Diversity in data is critical for the successful training of deep learning models.
no code implementations • ICML Workshop URL 2021 • Yiping Wang, Michael Brandon Haworth
We qualitatively and quantitatively demonstrate that, in terms of multi-agent ($\geq$ 8 agents) navigation and steering, $\textit{Students}$ trained by our approach outperform agents using heuristic search, as well as agents trained by domain randomization.
no code implementations • 8 Dec 2020 • Mohammad Havaei, Ximeng Mao, Yiping Wang, Qicheng Lao
Current practices in using cGANs for medical image generation, only use a single variable for image generation (i. e., content) and therefore, do not provide much flexibility nor control over the generated image.
1 code implementation • MIDL 2019 • Yiping Wang, David Farnell, Hossein Farahani, Mitchell Nursey, Basile Tessier-Cloutier, Steven J. M. Jones, David G. Huntsman, C. Blake Gilks, Ali Bashashati
The proposed algorithm achieved a mean accuracy of $87. 54\%$ and Cohen's kappa of $0. 8106$ in the slide-level classification of $305$ WSIs; performing better than a standard CNN and pathologists without gynecology-specific training.