#大语言模型#✨ Kubectl plugin to create manifests with LLMs
Model swapping for llama.cpp (or any local OpenAPI compatible server)
#大语言模型#The easiest way to use Ollama in .NET
#大语言模型#🏗️ Fine-tune, build, and deploy open-source LLMs easily!
#大语言模型#Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
#自然语言处理#[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
#自然语言处理#AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
#大语言模型#Social and customizable AI writing assistant! ✍️
#大语言模型#🏗️ Build, fine-tune, and run generative models locally!
#大语言模型#A local and uncensored AI entity.
LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥
Full featured demo application for OllamaSharp
#大语言模型#Run gguf LLM models in Latest Version TextGen-webui
#大语言模型#📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA
Copilot hack for running local copilot without auth and proxying
#大语言模型#A chat interface in Streamlit for LLMs using Ollama.
#搜索#Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.
Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine
A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)