Model swapping for llama.cpp (or any local OpenAI API compatible server)
#大语言模型#✨ Kubectl plugin to create manifests with LLMs
#大语言模型#The easiest way to use Ollama in .NET
#大语言模型#🏗️ Fine-tune, build, and deploy open-source LLMs easily!
#大语言模型#Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
#自然语言处理#[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
#自然语言处理#AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
#大语言模型#Social and customizable AI writing assistant! ✍️
#大语言模型#🏗️ Build, fine-tune, and run generative models locally!
#大语言模型#A local and uncensored AI entity.
LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥
#大语言模型#Run gguf LLM models in Latest Version TextGen-webui and koboldcpp
Secure Flutter desktop app connecting Auth0 authentication with local Ollama AI models via encrypted tunneling. Access your private AI instances remotely while keeping data on your hardware.
Full featured demo application for OllamaSharp
#大语言模型#A control server for managing multiple Llama Server instances with a web-based dashboard.
#大语言模型#📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA
Copilot hack for running local copilot without auth and proxying
#大语言模型#A set of guides for fully contained, daemonless, secure methods of storing and using LLMs locally on a mounted SSD. Uses Podman, supports AMD with Vulkan, uses llama.cpp, llamafiles, ollama w/ Openhan...
Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine