alekk89 / llama-cpp-windows-manager
PublicWindows desktop console for llama.cpp runtimes, models, and local coding workflows
llama.cpp Windows Manager is a desktop application that helps you run powerful AI models entirely on your own computer. It downloads and manages AI model files, installs the necessary runtime software optimized for your hardware (whether you have a regular processor, NVIDIA GPU, or other acceleration), and starts the model server so other programs can chat with your AI. You can run multiple models at once, each on their own private port, and everything stays secure on your local machine by default. The app includes safety features like automatic API key generation, local-only network binding, and protection for your settings and data.
How It Works
You download the installer or portable zip and run it on your Windows computer to get started.
You open the Runtimes section and install an official prebuilt runtime for your hardware - whether you use a regular processor, NVIDIA GPU, or other acceleration.
You search Hugging Face directly from the app, pick a model file, and download it with automatic verification.
You choose the runtime, adjust settings like memory usage and token limits, and save everything for that specific model.
You click Load and watch as your model starts up, with live status updates showing progress and resource usage.
Use any chat interface that supports custom API endpoints to talk to your model directly.
Add your local model to OpenCode so an AI coding assistant can help you work on your projects.
Your model is running privately on your computer, protected by a secure key, and ready to help you whenever you need it.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.