GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

inferentia

Website
Wikipedia
https://static.github-zh.com/github_avatars/vllm-project?size=40
vllm-project / vllm

#大语言模型#vLLM 是一个高效的开源库,用于加速大语言模型推理,通过优化内存管理和分布式处理实现高吞吐量和低延迟。

gpt大语言模型PyTorchllmopsmlopsmodel-servingtransformerllm-servinginferencellamaamdrocmCUDAinferentiatrainiumtpuxpuhpudeepseekqwen
Python 53.63 k
2 小时前
https://static.github-zh.com/github_avatars/aphrodite-engine?size=40
aphrodite-engine / aphrodite-engine

#计算机科学#Large-scale LLM inference engine

APIinference-engine机器学习CUDAinferentiarocmintelloraspeculative-decodingtpu
C++ 1.49 k
9 天前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

benchmarkingfoundation-modelsinferentiallama2sagemakergenerative-aibenchmarkbedrockllama3trainiumevaluation-metricsdeepseekdeepseek-r1
Jupyter Notebook 247
4 个月前
https://static.github-zh.com/github_avatars/aws-solutions-library-samples?size=40
aws-solutions-library-samples / guidance-for-machine-learning-inference-on-aws

This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It addresses the basic implementation requirements as well as ways...

inferentia机器学习
Shell 44
2 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / aws-inferentia-huggingface-workshop

#自然语言处理#CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker

自然语言处理sagemakerinferentia
Jupyter Notebook 14
2 年前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / awsome-fmops

Collection of bet practices, reference architectures, examples, and utilities for foundation model development and deployment on AWS.

eksgenerative-aigpuinferentiakserveKubernetesllm-inferenceTerraformllm-trainingPyTorch
HCL 12
9 天前
https://static.github-zh.com/github_avatars/daekeun-ml?size=40
daekeun-ml / aws-inferentia

This repository provides an easy hands-on way to get started with AWS Inferentia. A demonstration of this hands-on can be seen in the AWS Innovate 2023 - AIML Edition session.

inferentia
Jupyter Notebook 8
2 年前
https://static.github-zh.com/github_avatars/DarkSector?size=40
DarkSector / inf1-sentence-transformers

Sentence Transformers on EC2 Inf1

Amazon Web Servicesinferentia
Jupyter Notebook 1
2 年前
https://static.github-zh.com/github_avatars/fereydoonboroojerdi?size=40
fereydoonboroojerdi / multimodal-customer-insights-generator

#计算机科学#Scalable multimodal AI system combining FSDP, RLHF, and Inferentia optimization for customer insights generation.

Amazon Web Servicescustomer-insights深度学习inferentiaPyTorchrlhfsagemaker
Python 1
3 个月前
https://static.github-zh.com/github_avatars/windson?size=40
windson / inferentia-deployments

#大语言模型#Deploy Large Models on AWS Inferentia (Inf2) instances.

Amazon Web Servicesinferentia大语言模型
Jupyter Notebook 0
2 年前
https://static.github-zh.com/github_avatars/Frrietzh?size=40
Frrietzh / vllm

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

CUDAhpuhuggingface-transformersinferenceinferentialanguage-model大语言模型机器学习model-servingollamaPythonqwenraylibtpu
Python 0
3 个月前