NdLinear Is All You Need for Representation Learning

ensemble-core/ndlinear 21 Mar 2025

We propose NdLinear as a drop-in replacement for standard linear layers -- marking an important step toward next-generation neural architectures.

All Representation Learning

200
0.66 stars / hour

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

bytedance/uno 2 Apr 2025

In this study, we propose a highly-consistent data synthesis pipeline to tackle this challenge.

Conditional Image Generation Personalized Image Generation +1

789
0.62 stars / hour

LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion

robfiras/loco-mujoco 4 Nov 2023

Imitation Learning (IL) holds great promise for enabling agile locomotion in embodied agents.

Benchmarking Imitation Learning

756
0.60 stars / hour

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

index-tts/index-tts 8 Feb 2025

Recently, large language model (LLM) based text-to-speech (TTS) systems have gradually become the mainstream in the industry due to their high naturalness and powerful zero-shot voice cloning capabilities. Here, we introduce the IndexTTS system, which is mainly based on the XTTS and Tortoise model.

Decoder Language Modeling +5

1,008
0.58 stars / hour

Affordable AI Assistants with Knowledge Graph of Thoughts

spcl/knowledge-graph-of-thoughts 3 Apr 2025

Such structured representation of task-relevant knowledge enables low-cost models to solve complex tasks effectively.

Knowledge Graphs Math

67
0.58 stars / hour

MonSter: Marry Monodepth to Stereo Unleashes Power

junda24/monster 15 Jan 2025

The refined monodepth is in turn guides stereo effectively at ill-posed regions.

Monocular Depth Estimation Stereo Matching +1

319
0.56 stars / hour

VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets

alejandrofontan/vslam-lab 6 Apr 2025

Visual Simultaneous Localization and Mapping (VSLAM) research faces significant challenges due to fragmented toolchains, complex system configurations, and inconsistent evaluation methodologies.

Simultaneous Localization and Mapping

168
0.51 stars / hour

OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation

octree-nn/octgpt 14 Apr 2025

In this paper, we introduce OctGPT, a novel multiscale autoregressive model for 3D shape generation that dramatically improves the efficiency and performance of prior 3D autoregressive approaches, while rivaling or surpassing state-of-the-art diffusion models.

3D Shape Generation

89
0.51 stars / hour

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

rlhflow/minimal-rl 15 Apr 2025

In this work, we revisit GRPO from a reinforce-like algorithm perspective and analyze its core components.

Reinforcement Learning (RL)

66
0.50 stars / hour