Search Results for author: Carlo Siebenschuh

Found 3 papers, 1 papers with code

Connecting Large Language Model Agent to High Performance Computing Resource

no code implementations17 Feb 2025 Heng Ma, Alexander Brace, Carlo Siebenschuh, Greg Pauloski, Ian Foster, Arvind Ramanathan

The Large Language Model agent workflow enables the LLM to invoke tool functions to increase the performance on specific scientific domain questions.

Language Modeling Language Modelling +1

LSHBloom: Memory-efficient, Extreme-scale Document Deduplication

no code implementations6 Nov 2024 Arham Khan, Robert Underwood, Carlo Siebenschuh, Yadu Babuji, Aswathy Ajith, Kyle Hippe, Ozan Gokdemir, Alexander Brace, Kyle Chard, Ian Foster

Deduplication is a major focus for assembling and curating training datasets for large language models (LLM) -- detecting and eliminating additional instances of the same content -- in large collections of technical documents.

Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.