1 code implementation • 21 Mar 2025 • Mingyang Song, Mao Zheng, Zheng Li, Wenjie Yang, Xuan Luo, Yue Pan, Feng Zhang
In this paper, we propose \textbf{\textsc{FastCuRL}}, a simple yet efficient \textbf{Cu}rriculum \textbf{R}einforcement \textbf{L}earning approach with context window extending strategy to accelerate the reinforcement learning training efficiency for R1-like reasoning models while enhancing their performance in tackling complex reasoning tasks with long chain-of-thought rationales, particularly with a 1. 5B parameter language model.
no code implementations • 8 Mar 2025 • Mingyang Song, Mao Zheng, Xuan Luo
To address the above issue, in this paper, we propose a Goal-Reversed Prompting (GRP) approach for pairwise evaluation that shifts the original task from selecting the better answer to choosing the worse one.
no code implementations • 25 Nov 2024 • John Flynn, Michael Broxton, Lukas Murmann, Lucy Chai, Matthew DuVall, Clément Godard, Kathryn Heal, Srinivas Kaza, Stephen Lombardi, Xuan Luo, Supreeth Achar, Kira Prabhu, Tiancheng Sun, Lynn Tsai, Ryan Overbeck
Our feed-forward network generalizes across a wide variety of datasets and scenes and produces state-of-the-art quality for a real-time method.
no code implementations • 17 Jun 2024 • Mingyang Song, Mao Zheng, Xuan Luo, Yue Pan
However, this kind of evaluation approach is affected by potential biases within LLMs, raising concerns about the accuracy and reliability of the evaluation results of LLMs.
1 code implementation • 6 Jun 2024 • Chenxin Tao, Xizhou Zhu, Shiqian Su, Lewei Lu, Changyao Tian, Xuan Luo, Gao Huang, Hongsheng Li, Yu Qiao, Jie zhou, Jifeng Dai
The issue of "over-focus" hinders the model's ability to extract diverse visual features and to receive effective gradients for optimization.
1 code implementation • 18 Mar 2024 • Mingyang Song, Mao Zheng, Xuan Luo
Despite recent efforts to develop large language models with robust long-context capabilities, the lack of long-context benchmarks means that relatively little is known about their performance.
no code implementations • 10 Oct 2023 • Xuan Luo, Mingqing Huang, Rui Lv, Hui Zhao
Sequential location recommendation plays a huge role in modern life, which can enhance user experience, bring more profit to businesses and assist in government administration.
1 code implementation • 10 May 2023 • Hong Wang, Xuan Luo, Weizhi Wang, Xifeng Yan
Large language models (LLMs) like GPT-4 have recently demonstrated impressive capabilities in natural language understanding and generation.
no code implementations • 20 Feb 2023 • Weihong Zhong, Mao Zheng, Duyu Tang, Xuan Luo, Heng Gong, Xiaocheng Feng, Bing Qin
Although large-scale video-language pre-training models, which usually build a global alignment between the video and the text, have achieved remarkable progress on various downstream tasks, the idea of adopting fine-grained information during the pre-training stage is not well explored.
1 code implementation • 6 Jan 2022 • Xuan Luo, Zhen Han, Lingkang Yang, Lingling Zhang
Recently, attentional arbitrary style transfer methods have been proposed to achieve fine-grained results, which manipulates the point-wise similarity between content and style features for stylization.
1 code implementation • CVPR 2022 • Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman
We introduce a high resolution, 3D-consistent image and shape generation technique which we call StyleSDF.
no code implementations • 18 Aug 2021 • Zicun Cong, Xuan Luo, Pei Jian, Feida Zhu, Yong Zhang
We also investigate pricing in the step of collaborative training of machine learning models, and overview pricing machine learning models for end users in the step of machine learning deployment.
1 code implementation • CVPR 2021 • Tong Wu, Junshi Huang, Guangyu Gao, Xiaoming Wei, Xiaolin Wei, Xuan Luo, Chi Harold Liu
In inference, we directly use the activation masks from the DA layer as pseudo-labels for segmentation.
no code implementations • 12 Mar 2021 • Archana Tiwari, Fangchu Chen, Shazhou Zhong, Elizabeth Drueke, Jahyun Koo, Austin Kaczmarek, Cong Xiao, Jingjing Gao, Xuan Luo, Qian Niu, Yuping Sun, Binghai Yan, Liuyan Zhao, Adam W. Tsen
While the anomalous Hall effect can manifest even without an external magnetic field, time reversal symmetry is nonetheless still broken by the internal magnetization of the sample.
Mesoscale and Nanoscale Physics
no code implementations • 11 Mar 2021 • Chenhaoping Wen, Jingjing Gao, Yuan Xie, Qing Zhang, Pengfei Kong, Jinghui Wang, Yilan Jiang, Xuan Luo, Jun Li, Wenjian Lu, Yu-Ping Sun, Shichao Yan
In contrast, in bulk 1T-TaS$_2$, there is an interlayer CDW coupling induced insulating gap.
Mesoscale and Nanoscale Physics Materials Science
no code implementations • 10 Mar 2021 • Shaowei Wang, Lingling Zhang, Xuan Luo, Yi Yang, Xin Hu, Jun Liu
Another type of diagrams such as from Computer Science is composed of graphics containing complex topologies and relations, and research on this type of diagrams is still blank.
1 code implementation • 22 Dec 2020 • Xuan Luo, Xuaner Zhang, Paul Yoo, Ricardo Martin-Brualla, Jason Lawrence, Steven M. Seitz
Many historical people were only ever captured by old, faded, black and white photos, that are distorted due to the limitations of early cameras and the passage of time.
3 code implementations • 30 Apr 2020 • Xuan Luo, Jia-Bin Huang, Richard Szeliski, Kevin Matzen, Johannes Kopf
We present an algorithm for reconstructing dense, geometrically consistent depth for all pixels in a monocular video.
no code implementations • 21 Aug 2019 • Xuan Luo, Yanmeng Kong, Jason Lawrence, Ricardo Martin-Brualla, Steve Seitz
This paper introduces the largest and most diverse collection of rectified stereo image pairs to the research community, KeystoneDepth, consisting of tens of thousands of stereographs of historical people, events, objects, and scenes between 1860 and 1963.
no code implementations • 17 Jan 2019 • Taylor Sweet, Austin Rothwell, Xuan Luo
The social media revolution has changed the way that brands interact with consumers.
no code implementations • 28 Sep 2015 • Xuan Luo, Xuejiao Bai, Shuo Li, Hongtao Lu, Sei-ichiro Kamata
This is partially because our DTs overcome the extreme greediness of the MST.
1 code implementation • 19 Dec 2014 • Min Lin, Shuo Li, Xuan Luo, Shuicheng Yan
In this paper, we introduce a novel deep learning framework, termed Purine.