#大语言模型#SGLang is a fast serving framework for large language models and vision language models.
#大语言模型#A Next-Generation Training Engine Built for Ultra-Large MoE Models
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
#大语言模型#🚀 DeepSeek-V3 & R1大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,R1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
娜迦本地智能体,基于多智能体与多MCP兼容架构的通用型 AI 助手
一个用于管理和切换 Claude Code 和 Codex 不同供应商配置的桌面应用
#大语言模型#A powerful toolkit for compressing large models including LLM, VLM, and video generation models.
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
#自然语言处理#[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
#自然语言处理#⚡️ A robust and developer-friendly, and community-driven PHP Client that provides a clean, extensible interface for integrating with the DeepSeek AI API.
#大语言模型#AI coding agent for your terminal.
#大语言模型#A tinystruct-based chat module which integrated with @OpenAI GPT-4 / 3.5-turbo / ChatGPT. @tinystruct
#大语言模型#多平台模型接入,可扩展,多种输出格式,提供大语言模型聊天服务的插件 | A bot plugin for LLM chat with multi-model integration, extensibility, and various output formats
#大语言模型#The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1
Model Context Protocol server for DeepSeek's advanced language models
🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.
Deepseek V3 and R1 private API, deep thinking, search, full requests. pow challenge reversed. deepseek api.
Go (Golang) client for Deepseek API. Deepseek Go supports DeepSeek-V3, DeepSeek-R1 and more