LettuceDetect: A Hallucination Detection Framework for RAG Applications

KRLabsOrg/LettuceDetect 24 Feb 2025

Retrieval Augmented Generation (RAG) systems remain vulnerable to hallucinated answers despite incorporating external knowledge sources.

8k Hallucination +3

167
0.66 stars / hour

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

allenai/olmocr 25 Feb 2025

PDF documents have the potential to provide trillions of novel, high-quality tokens for training language models.

Diversity Language Modeling +1

9,848
0.66 stars / hour

ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system

rupertl/eliza-ctss 12 Jan 2025

The entire stack is open source, so that any user of a unix-like OS can run the world's first chatbot on the world's first time-sharing system.

Chatbot

213
0.62 stars / hour

GENERator: A Long-Context Generative Genomic Foundation Model

generteam/generator 11 Feb 2025

Recent developments in genomic language models have underscored the potential of LLMs in deciphering DNA sequences.

model

322
0.62 stars / hour

Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation

xiaofeng-life/colorcode 29 Sep 2024

ColorCode offers three key features: 1) color enhancement, producing an enhanced image with a fixed color; 2) color adaptation, enabling controllable adjustments of long-wavelength color components using guidance images; and 3) color interpolation, allowing for the smooth generation of multiple colors through continuous sampling of the color code.

Image Enhancement

61
0.62 stars / hour

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

alexfrom0815/Online-3D-BPP-DRL 26 Jun 2020

We solve a challenging yet practically useful variant of 3D Bin Packing Problem (3D-BPP).

3D Bin Packing Collision Avoidance +3

430
0.61 stars / hour

Liquid: Language Models are Scalable Multi-modal Generators

foundationvision/liquid 5 Dec 2024

We present Liquid, an auto-regressive generation paradigm that seamlessly integrates visual comprehension and generation by tokenizing images into discrete codes and learning these code embeddings alongside text tokens within a shared feature space for both vision and language.

Language Modeling Language Modelling +2

222
0.71 stars / hour

CorruptEncoder: Data Poisoning based Backdoor Attacks to Contrastive Learning

jsrdcht/SSL-Backdoor 15 Nov 2022

In this work, we take the first step to analyze the limitations of existing backdoor attacks and propose new DPBAs called CorruptEncoder to CL.

Backdoor Attack Contrastive Learning +2

65
0.41 stars / hour

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

opendrivelab/agibot-world 9 Mar 2025

Introducing AgiBot World, a large-scale platform comprising over 1 million trajectories across 217 tasks in five deployment scenarios, we achieve an order-of-magnitude increase in data scale compared to existing datasets.

1,761
0.56 stars / hour

Fooling the Image Dehazing Models by First Order Gradient

guijiejie/aadn 30 Mar 2023

In this paper, we focus on designing a group of attack methods based on first order gradient to verify the robustness of the existing dehazing algorithms.

Adversarial Attack Image Dehazing +1

54
0.55 stars / hour