GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

inference

Website
Wikipedia
https://static.github-zh.com/github_avatars/vllm-project?size=40
vllm-project / vllm

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

gpt大语言模型PyTorchllmopsmlopsmodel-servingtransformerllm-servinginferencellamaamdrocmCUDAinferentiatrainiumtpuxpuhpudeepseekqwen
Python 49.62 k
15 小时前
hpcaitech/ColossalAI
https://static.github-zh.com/github_avatars/hpcaitech?size=40
hpcaitech / ColossalAI

#计算机科学#一个整合高效并行技术的AI大模型训练系统。

深度学习hpclarge-scaledata-parallelismpipeline-parallelismmodel-parallelism人工智能big-modeldistributed-computinginferenceheterogeneous-trainingfoundation-models
Python 40.96 k
2 天前
ggml-org/whisper.cpp
https://static.github-zh.com/github_avatars/ggml-org?size=40
ggml-org / whisper.cpp

OpenAI Whisper语音识别模型,C++移植版本。

openaispeech-to-texttransformerWhisperinferencespeech-recognition
C++ 40.79 k
2 天前
https://static.github-zh.com/github_avatars/deepspeedai?size=40
deepspeedai / DeepSpeed

#计算机科学#DeepSpeed Chat: 一键式RLHF训练,让你的类ChatGPT千亿大模型提速省钱15倍

深度学习PyTorchgpu机器学习billion-parametersdata-parallelismmodel-parallelisminferencepipeline-parallelismcompressionmixture-of-expertstrillion-parameterszero
Python 38.87 k
15 小时前
https://static.github-zh.com/github_avatars/google-ai-edge?size=40
google-ai-edge / mediapipe

#安卓#MediaPipe 是一个跨平台实时、流媒体机器学习解决方案。提供了人脸识别、人体姿势识别与跟踪、物体检测、自拍分割、即时运动跟踪等功能

mediapipeC++机器视觉深度学习Androidvideo-processingaudio-processingmobile-development机器学习inferencegraph-frameworkgraph-basedcalculator框架pipeline-frameworkstream-processingperception
C++ 30.22 k
2 天前
https://static.github-zh.com/github_avatars/Tencent?size=40
Tencent / ncnn

#安卓#ncnn 是一个为手机端极致优化的高性能神经网络前向计算框架

inferencehigh-preformancesimdarm-neon深度学习人工智能AndroidiOSncnnvulkan神经网络caffemxnetPyTorchonnxdarknetTensorflowmlirKerasRISC-V
C++ 21.63 k
1 天前
https://static.github-zh.com/github_avatars/SYSTRAN?size=40
SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习inferencequantizationspeech-recognitionspeech-to-texttransformerWhisperopenai
Python 16.55 k
13 天前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / sglang

#大语言模型#SGLang is a fast serving framework for large language models and vision language models.

CUDAinferencellamallava大语言模型llm-servingmoePyTorchtransformervlmllama3llama3-1deepseekdeepseek-llmdeepseek-v3deepseek-r1deepseek-r1-zeroqwen3llama4
Python 15.14 k
4 小时前
https://static.github-zh.com/github_avatars/stas00?size=40
stas00 / ml-engineering

#大语言模型#Machine Learning Engineering Open Book

PyTorchslurmlarge-language-models大语言模型机器学习scalabilitytransformersmachine-learning-engineeringmlops人工智能inferencetraining
Python 14.03 k
6 天前
gvergnaud/ts-pattern
https://static.github-zh.com/github_avatars/gvergnaud?size=40
gvergnaud / ts-pattern

🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.

pattern-matchingTypeScripttspatternmatchinginferencetype-inferenceexhaustiveconditionsbranchingJavaScript
TypeScript 13.67 k
1 个月前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / TensorRT

#计算机科学#NVIDIA®TensorRT™是一款用于在NVIDIA GPU上进行高性能深度学习推理的SDK。此存储库包含TensorRT的开源组件。

tensorrtNvidia深度学习inferencegpu-acceleration
C++ 11.72 k
25 天前
https://static.github-zh.com/github_avatars/aws?size=40
aws / amazon-sagemaker-examples

#计算机科学#Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

sagemakerAmazon Web Servicesreinforcement-learning机器学习深度学习ExampleJupyter Notebookmlops数据科学traininginference
Jupyter Notebook 10.55 k
3 个月前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / text-generation-inference

#自然语言处理#Large Language Model Text Generation Inference

bloom自然语言处理PyTorchinferencegpt深度学习transformerfalconstarcoder
Python 10.22 k
2 天前
https://static.github-zh.com/github_avatars/triton-inference-server?size=40
triton-inference-server / server

#计算机科学#The Triton Inference Server provides an optimized cloud and edge inferencing solution.

inferencegpu机器学习深度学习clouddatacenterEdge
Python 9.34 k
2 天前
https://static.github-zh.com/github_avatars/openvinotoolkit?size=40
openvinotoolkit / openvino

#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

inference深度学习openvino人工智能机器视觉diffusion-modelsgenerative-aillm-inference自然语言处理performance-boostspeech-recognitionstable-diffusiondeploy-aioptimize-aitransformersyolorecommendation-systemgood-first-issue
C++ 8.43 k
16 小时前
dusty-nv/jetson-inference
https://static.github-zh.com/github_avatars/dusty-nv?size=40
dusty-nv / jetson-inference

#计算机科学#Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

深度学习inference机器视觉embeddedimage-recognitionobject-detectionsegmentationjetsonjetson-tx1jetson-tx2jetson-xavierNvidiatensorrtcaffevideo-analyticsRobotics机器学习jetson-nano
C++ 8.33 k
8 个月前
https://static.github-zh.com/github_avatars/oumi-ai?size=40
oumi-ai / oumi

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

dpoevaluationfine-tuninginferencellama大语言模型sftvlms
Python 8.18 k
1 天前
https://static.github-zh.com/github_avatars/xorbitsai?size=40
xorbitsai / inference

#大语言模型#Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

ggmlPyTorchchatglm部署flan-t5大语言模型wizardlm人工智能机器学习Whisperinferenceopenai-apimistralgemmallamallamacppvllmqwenllama3glm4
Python 8.03 k
2 天前
https://static.github-zh.com/github_avatars/Linzaer?size=40
Linzaer / Ultra-Light-Fast-Generic-Face-Detector-1MB

#人脸识别# 💎1MB lightweight face detection model (1MB轻量级人脸检测模型)

face-detectionarminferencemnnncnn
Python 7.36 k
1 年前
https://static.github-zh.com/github_avatars/gcanti?size=40
gcanti / io-ts

Runtime type system for IO decoding/encoding

TypeScriptvalidationinferencetypesruntime
TypeScript 6.78 k
6 个月前
loading...