CursorCore: Assist Programming through Aligning Anything

TechxGenus/CursorCore 9 Oct 2024

In this work, we propose a new conversational framework that comprehensively integrates these information sources, collect data to train our models and evaluate their performance.

Code Completion

46
0.50 stars / hour

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

verazuo/jailbreak_llms 7 Aug 2023

We hope that our study can facilitate the research community and LLM vendors in promoting safer and regulated LLMs.

Community Detection

2,583
0.49 stars / hour

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

mathllm/mathcoder2 10 Oct 2024

Training several popular base models with this corpus significantly improves their mathematical abilities, leading to the creation of the MathCoder2 family of models.

Math Mathematical Reasoning

29
0.43 stars / hour

NTK-Guided Few-Shot Class Incremental Learning

Programmergg/NTK-FSCIL 19 Mar 2024

Through the combined effects of these measures, our network acquires robust NTK properties, ensuring optimal convergence and stability of the NTK matrix and minimizing the NTK-related generalization loss, significantly enhancing its theoretical generalization.

class-incremental learning Few-Shot Class-Incremental Learning +2

12
0.38 stars / hour
41
0.37 stars / hour

ToolGen: Unified Tool Retrieval and Calling via Generation

Reason-Wang/ToolGen 4 Oct 2024

As large language models (LLMs) advance, their inability to autonomously execute tasks by directly interacting with external tools remains a critical limitation.

Retrieval Text Generation

37
0.37 stars / hour

QT-DoG: Quantization-aware Training for Domain Generalization

saqibjaved1/QT-DoG 8 Oct 2024

In this work, we propose Quantization-aware Training for Domain Generalization (QT-DoG) and demonstrate that weight quantization effectively leads to flatter minima in the loss landscape, thereby enhancing domain generalization.

Domain Generalization Model Compression +1

10
0.36 stars / hour

Conditional Image Synthesis with Diffusion Models: A Survey

zju-pi/awesome-conditional-diffusion-models 28 Sep 2024

In this survey, we categorize existing works based on how conditions are integrated into the two fundamental components of diffusion-based modeling, i. e., the denoising network and the sampling process.

Denoising Diversity +2

47
0.35 stars / hour

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

qwenlm/qwen2-vl 18 Sep 2024

We present the Qwen2-VL Series, an advanced upgrade of the previous Qwen-VL models that redefines the conventional predetermined-resolution approach in visual processing.

Temporal Relation Extraction Visual Question Answering

2,590
0.34 stars / hour

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

ultralytics/ultralytics 21 Feb 2024

It can be used to obtain complete information, so that train-from-scratch models can achieve better results than state-of-the-art models pre-trained using large datasets, the comparison results are shown in Figure 1.

object-detection Real-Time Object Detection

30,663
0.33 stars / hour