raketenkater / llm-server
PublicSmart launcher for llama.cpp / ik_llama.cpp — auto-detects GPUs, optimizes MoE placement, crash recovery
A user-friendly launcher that automatically configures and starts AI language model servers based on your hardware, with built-in model downloading.
How It Works
You find this handy tool on GitHub that makes running powerful AI chats on your own computer super easy, without fiddling with settings.
You grab the files and run a simple setup script that puts everything in place on your computer.
Tell it a model name, and it smartly picks the best version for your computer's memory and downloads it smoothly.
Point it to a model file you already have, and it takes care of the rest.
Hit go, and it automatically detects your hardware, tunes everything perfectly, and starts your personal AI server in seconds.
Connect to your AI and enjoy super-fast responses, benchmarks, or even vision features if your model supports it.
Now you have a blazing-fast, private AI assistant running on your machine, optimized just for you—ready for endless conversations!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.