LLM real-life tasks
2 papers with code • 0 benchmarks • 0 datasets
This task has no description! Would you like to contribute one?
Benchmarks
These leaderboards are used to track progress in LLM real-life tasks
No evaluation results yet. Help compare methods by
submitting
evaluation metrics.
Most implemented papers
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
LLaVA-Plus is a general-purpose multimodal assistant that expands the capabilities of large multimodal models.
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Through conducting extensive experiments on a large scale of harmful and safe prompts, we validate the effectiveness of the proposed AutoDefense in improving the robustness against jailbreak attacks, while maintaining the performance at normal user request.