llama-cpp · GitHub Topics

#大语言模型#A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

人工智能 ChatGPT gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp 大语言模型 localai openai 自托管 codellama

TypeScript 10.99 k

1 年前

SciSharp / LLamaSharp

#大语言模型#A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

聊天机器人 gpt llama llamacpp 大语言模型 semantic-kernel llava multi-modal llama2 llama3 llama-cpp

C# 3.3 k

10 天前

Mobile-Artificial-Intelligence / maid

#安卓#Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Android 聊天机器人 ChatGPT Facebook Flutter gguf large-language-models llama llama-cpp llama2 llamacpp mistral openai local-ai ollama free-chatgpt

Dart 2.09 k

3 天前

withcatai / node-llama-cpp

#大语言模型#Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

人工智能 bindings llama llama-cpp 大语言模型 Node.js Parsing gguf CUDA metal JSON Schema cmake 自托管 embedding function-calling gpu vulkan

TypeScript 1.6 k

2 天前

gotzmann / llama.go

#大语言模型#llama.go is like llama.cpp in pure Golang!

llama alpaca ChatGPT dalai gpt gpt3 gpt4 llama-cpp 大语言模型 gpt4all vicuna

Go 1.37 k

10 个月前

undreamai / LLMUnity

#大语言模型#Create characters in Unity with LLMs!

人工智能 chat 聊天机器人 conversational-ai dialogue generative-ai llama 大语言模型 npc Unity unity2d character 游戏开发 llama-cpp rag

C# 1.23 k

14 天前

Lizonghang / prima.cpp

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

distributed-ai llm-inference on-device-llms llama-cpp distributed-inference

C++ 992

11 天前

mybigday / llama.rn

#安卓#React Native binding of llama.cpp

llama llama-cpp 大语言模型 React Native Android iOS

C 596

1 天前

the-crypt-keeper / can-ai-code

#大语言模型#Self-evaluating interview for AI coders

人工智能 ggml langchain llama-cpp 大语言模型 humaneval transformers

Python 592

1 个月前

withcatai / catai

#大语言模型#Run AI ✨ assistant locally! with simple API for Node.js 🚀

ChatGPT 人工智能 dalai openai 聊天机器人 llama-cpp ai-assistant vicuna wizardlm 大语言模型 local-llm localai gguf Node.js

TypeScript 475

1 年前

mdrokz / rust-llama.cpp

#计算机科学#LLama.cpp rust bindings

ffi llama llama-cpp 机器学习 model Rust C++

Rust 394

1 年前

dipampaul17 / KVSplit

#大语言模型#Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit keys & 4-bit values, reducing memory by 59% with <1% quality lo...

apple-silicon generative-ai kv-cache llama-cpp 大语言模型 m1 m3 memory-optimization metal optimization quantization

Python 356

2 个月前

docker / compose-for-agents

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

ai-agents Docker Docker Compose Example large-language-models llama-cpp openai-gym 自托管

TypeScript 310

6 天前

jlonge4 / local_llama

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

llamaindex Python 人工智能 langchain 机器学习 llama-cpp offline

Python 283

1 年前

gpustack / gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

gguf Go llama-cpp

Go 189

6 天前

ptsochantaris / emeltal

#自然语言处理#Local ML voice chat using high-end models.

人工智能 llama-cpp 机器学习 macOS 自然语言处理 speech-recognition Swift SwiftUI ui whisper-cpp

C++ 175

25 天前

phronmophobic / llama.clj

#大语言模型#Run LLMs locally. A clojure wrapper for llama.cpp.

Clojure llama llama-cpp 大语言模型

Clojure 166

4 个月前

gotzmann / booster

#大语言模型#Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

大语言模型 ChatGPT gpt llama openai llama-cpp ggml vllm llamacpp ollama

C++ 159

1 年前

lucasjinreal / Crane

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

llama-cpp mllm qwen2-vl Rust qwen3

Rust 143

5 天前

BrutalCoding / shady.ai

#安卓#Making offline AI models accessible to all types of edge devices.

Flutter LLVM rwkv Android cross-platform Dart fastlane iOS Linux macOS Material Design Web Windows llama-cpp whisper-cpp gguf

Dart 142

1 年前