LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks

ipa-lab/hackingBuddyGPT 17 Oct 2023

We explore the intersection of LLMs and penetration testing to gain insight into their capabilities and challenges in the context of privilege escalation.

In-Context Learning

178
0.35 stars / hour

Wav-KAN: Wavelet Kolmogorov-Arnold Networks

zavareh1/Wav-KAN 21 May 2024

In this paper , we introduce Wav-KAN, an innovative neural network architecture that leverages the Wavelet Kolmogorov-Arnold Networks (Wav-KAN) framework to enhance interpretability and performance.

Computational Efficiency

22
0.35 stars / hour

RLHF Workflow: From Reward Modeling to Online RLHF

RLHFlow/RLHF-Reward-Modeling 13 May 2024

We present the workflow of Online Iterative Reinforcement Learning from Human Feedback (RLHF) in this technical report, which is widely reported to outperform its offline counterpart by a large margin in the recent large language model (LLM) literature.

Chatbot Language Modelling +1

219
0.34 stars / hour

Internet sentiment exacerbates intraday overtrading, evidence from A-Share market

HeliumPeng/Eastmoney.guba.com-sentiment 18 Apr 2024

Additionally, the effect of sentiment on overtrading is observed to be more pronounced among individual investors in large-cap stocks compared to small- and mid-cap stocks.

34
0.34 stars / hour

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

GaussianObject/GaussianObject 15 Feb 2024

Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined.

Neural Rendering Object

689
0.33 stars / hour

4D Panoptic Scene Graph Generation

jingkang50/psg4d NeurIPS 2023

To facilitate research in this new area, we build a richly annotated PSG-4D dataset consisting of 3K RGB-D videos with a total of 1M frames, each of which is labeled with 4D panoptic segmentation masks as well as fine-grained, dynamic scene graphs.

4D Panoptic Segmentation Graph Generation +5

65
0.31 stars / hour

UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization

intellisensing/uav-visloc 20 May 2024

Our dataset includes 6, 742 drone images and 11 satellite maps, with metadata such as latitude, longitude, altitude, and capture date.

Visual Localization

52
0.30 stars / hour

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

ibm-granite/granite-code-models 7 May 2024

Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously.

Code Generation Decoder

830
0.29 stars / hour

Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

vikparuchuri/marker 11 Jan 2021

We design models based off T5-Base and T5-Large to obtain up to 7x increases in pre-training speed with the same computational resources.

Language Modelling Question Answering

9,077
0.29 stars / hour

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

facebookresearch/generative-recommenders 27 Feb 2024

Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis.

 Ranked #1 on Recommendation Systems on Amazon-Book (HR@10 metric)

Recommendation Systems

358
0.29 stars / hour