#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs
OpenAI Whisper语音识别模型,C++移植版本。
#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍
#安卓#MediaPipe 是一个跨平台实时、流媒体机器学习解决方案。提供了人脸识别、人体姿势识别与跟踪、物体检测、自拍分割、即时运动跟踪等功能
#计算机科学#Faster Whisper transcription with CTranslate2
#大语言模型#SGLang is a fast serving framework for large language models and vision language models.
#大语言模型#Machine Learning Engineering Open Book
🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.
#计算机科学#NVIDIA®TensorRT™是一款用于在NVIDIA GPU上进行高性能深度学习推理的SDK。此存储库包含TensorRT的开源组件。
#计算机科学#Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
#计算机科学#The Triton Inference Server provides an optimized cloud and edge inferencing solution.
#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
#计算机科学#Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!
#大语言模型#Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
#人脸识别# 💎1MB lightweight face detection model (1MB轻量级人脸检测模型)