#

gguf

https://static.github-zh.com/github_avatars/menloresearch?size=40
C++ 2.76 k
2 个月前
Mobile-Artificial-Intelligence/maid
https://static.github-zh.com/github_avatars/Mobile-Artificial-Intelligence?size=40

#安卓#Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Dart 2.15 k
2 个月前
https://static.github-zh.com/github_avatars/heshengtao?size=40

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces,...

Python 1.91 k
8 天前
https://static.github-zh.com/github_avatars/datawhalechina?size=40

#大语言模型#动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/

Jupyter Notebook 1.87 k
3 个月前
withcatai/node-llama-cpp
https://static.github-zh.com/github_avatars/withcatai?size=40

#大语言模型#Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

TypeScript 1.66 k
6 天前
https://static.github-zh.com/github_avatars/edwko?size=40
Python 1.38 k
3 个月前
https://static.github-zh.com/github_avatars/Michael-A-Kuykendall?size=40

#计算机科学#⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 857
14 小时前
https://static.github-zh.com/github_avatars/eastriverlee?size=40

#IOS#LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

C 726
1 个月前
https://static.github-zh.com/github_avatars/kelindar?size=40

#搜索#Go library for embedded vector search and semantic embeddings using llama.cpp

Go 485
3 个月前
https://static.github-zh.com/github_avatars/antirez?size=40

#大语言模型#GGUF implementation in C as a library and a tools CLI program

C 289
18 天前
https://static.github-zh.com/github_avatars/OEvortex?size=40

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails an...

Python 286
3 天前
https://static.github-zh.com/github_avatars/gpustack?size=40

LM inference server implementation based on *.cpp.

C++ 274
1 个月前
https://static.github-zh.com/github_avatars/ShelbyJenkins?size=40

#大语言模型#The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes

Rust 234
1 个月前
https://static.github-zh.com/github_avatars/gpustack?size=40

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

Go 205
1 个月前
loading...
Website
Wikipedia