/r/LocalLLaMA

Year:

Only show posts with narrations

Which LLM to use for writing an eBook?

1 upvotes • kareemamr50

Mark as read: Add to a list

Has anyone here successfully managed to use LLM as a scoring/evaluating agent?

1 upvotes • Konedi23

Mark as read: Add to a list

Production Inference for Llama

1 upvotes • joshyii02

Mark as read: Add to a list

Introducing vectorlite: A Fast and Tunable Vector Search Extension for SQLite

1 upvotes • QuestionMarkFromEmo

Mark as read: Add to a list

What LLM can I run locally on my cheapo Samsung Android phone?

1 upvotes • asmileischarity

Mark as read: Add to a list

Looking for best LLM Model for Summarizing Long Texts?

1 upvotes • xenstar1

Mark as read: Add to a list

Context caching for different llm user sessions, performance of CPU inference, blending CPU&GPU inference. What results can I expect? Or just use API?

1 upvotes • jakub37

Mark as read: Add to a list

Semantic Router For Multiple Documents

1 upvotes • AsideRepresentative3

Mark as read: Add to a list

Seeking Help with Embedding Errors in 'gemma2' Using llama-cpp-python

1 upvotes • dewijones92

Mark as read: Add to a list

My adapter model dominating the entire base model

1 upvotes • AcademicHedgehog4562

Mark as read: Add to a list

Title	Upvotes	Author	Mark as read	Favorited	Rating	Add to a list
Which LLM to use for writing an eBook?	1	kareemamr50
Has anyone here successfully managed to use LLM as a scoring/evaluating agent?	1	Konedi23
Production Inference for Llama	1	joshyii02
Introducing vectorlite: A Fast and Tunable Vector Search Extension for SQLite	1	QuestionMarkFromEmo
What LLM can I run locally on my cheapo Samsung Android phone?	1	asmileischarity
Looking for best LLM Model for Summarizing Long Texts?	1	xenstar1
Context caching for different llm user sessions, performance of CPU inference, blending CPU&GPU inference. What results can I expect? Or just use API?	1	jakub37
Semantic Router For Multiple Documents	1	AsideRepresentative3
Seeking Help with Embedding Errors in 'gemma2' Using llama-cpp-python	1	dewijones92
My adapter model dominating the entire base model	1	AcademicHedgehog4562