/u/AbheekG's posts

Year:

Only show posts with narrations

The software-pain of running local LLM finally got to me - so I made my own inferencing server that you don't need to compile or update anytime a new model/tokenizer drops; you don't need to quantize or even download your LLMs - just give it a name & run LLMs the moment they're posted on HuggingFace

219 upvotes • r/LocalLLaMA

Mark as read: Add to a list

Title	Upvotes	Subreddit	Mark as read	Favorited	Rating	Add to a list
The software-pain of running local LLM finally got to me - so I made my own inferencing server that you don't need to compile or update anytime a new model/tokenizer drops; you don't need to quantize or even download your LLMs - just give it a name & run LLMs the moment they're posted on HuggingFace	219	LocalLLaMA