Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization

Alibaba-NLP/CHRONOS 1 Jan 2025

In the fast-changing realm of information, the capacity to construct coherent timelines from extensive event-related content has become increasingly significant and challenging.

News Retrieval Retrieval +1

54
0.37 stars / hour

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

openrlhf/openrlhf 4 Jan 2025

Reinforcement Learning from Human Feedback (RLHF) has emerged as a critical approach for aligning large language models with human preferences, witnessing rapid algorithmic evolution through methods such as Proximal Policy Optimization (PPO), Direct Preference Optimization (DPO), REINFORCE Leave One-Out (RLOO), ReMax, and Group Relative Policy Optimization (GRPO).

Computational Efficiency

3,703
0.47 stars / hour

Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO

OpenLLMAI/OpenRLHF 25 May 2020

We study the roots of algorithmic progress in deep policy gradient algorithms through a case study on two popular algorithms: Proximal Policy Optimization (PPO) and Trust Region Policy Optimization (TRPO).

Deep Reinforcement Learning reinforcement-learning +1

3,705
0.47 stars / hour

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

FoundationVision/Infinity 5 Dec 2024

We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.

Image Generation

844
0.46 stars / hour

PLAPT: Protein-Ligand Binding Affinity Prediction Using Pretrained Transformers

trrt-good/WELP-PLAPT bioRxiv 2024

Understanding protein-ligand binding affinity is crucial for drug discovery, enabling the identification of promising drug candidates efficiently.

Drug Discovery Protein-Ligand Affinity Prediction +1

65
0.46 stars / hour

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

internlm/swe-fixer 9 Jan 2025

The retrieval module employs BM25 along with a lightweight LLM model to achieve coarse-to-fine file retrieval.

GitHub issue resolution Retrieval

39
0.54 stars / hour

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

hustvl/LightningDiT 2 Jan 2025

The integrated system achieves state-of-the-art (SOTA) performance on ImageNet 256x256 generation with an FID score of 1. 35 while demonstrating remarkable training efficiency by reaching an FID score of 2. 11 in just 64 epochs--representing an over 21 times convergence speedup compared to the original DiT.

193
0.35 stars / hour

The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features

liam-sbhoo/tabpfn-time-series 6 Jan 2025

Foundation models have become popular in forecasting due to their ability to make accurate predictions, even with minimal fine-tuning on specific datasets.

Feature Engineering Time Series +1

30
0.31 stars / hour

SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models

snowfallingplum/shmt 15 Dec 2024

To address these issues, we propose a novel Self-supervised Hierarchical Makeup Transfer (SHMT) method via latent diffusion models.

127
0.31 stars / hour

ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle

sdwyc/rolo 4 Jan 2025

In this article, a LiDAR-based SLAM method is presented to improve the accuracy of pose estimations for ground vehicles in rough terrains, which is termed Rotation-Optimized LiDAR-Only (ROLO) SLAM.

Pose Estimation

43
0.31 stars / hour