#

ai-infra

https://static.github-zh.com/github_avatars/HuaizhengZhang?size=40

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys...

3.27 k
2 个月前
https://static.github-zh.com/github_avatars/Tencent?size=40

#大语言模型#A.I.G (AI-Infra-Guard) is a comprehensive, intelligent, and easy-to-use AI Red Teaming platform developed by Tencent Zhuque Lab.

Python 1.72 k
4 天前
https://static.github-zh.com/github_avatars/thu-ml?size=40

#大语言模型#SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 713
1 个月前
https://static.github-zh.com/github_avatars/ForceInjection?size=40

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML 455
9 小时前
https://static.github-zh.com/github_avatars/RLinf?size=40

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 367
4 天前
https://static.github-zh.com/github_avatars/tensorchord?size=40

This is a landscape of the infrastructure that powers the generative AI ecosystem

HTML 149
1 年前
https://static.github-zh.com/github_avatars/OpenMLIR?size=40

Triton multi-level runner, include cubin, ptx, ttgir etc.

Python 36
4 天前
https://static.github-zh.com/github_avatars/jinbooooom?size=40

#大语言模型#hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 35
20 天前
https://static.github-zh.com/github_avatars/biubiutomato?size=40

#大语言模型#TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.

Python 33
1 个月前
https://static.github-zh.com/github_avatars/NexusGPU?size=40

vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust

Rust 24
1 天前
https://static.github-zh.com/github_avatars/awesomelistsio?size=40

#Awesome#A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python 18
2 个月前
https://static.github-zh.com/github_avatars/OpenMLIR?size=40

Triton for OpenCL backend, and use mlir-translate to get source OpenCL code

MLIR 11
19 天前
https://static.github-zh.com/github_avatars/jinbooooom?size=40

OriginDL: A distributed deep learning framework Built from scratch

C++ 10
6 天前
https://static.github-zh.com/github_avatars/oliverlabs?size=40
10
4 个月前
https://static.github-zh.com/github_avatars/memas-ai?size=40

Memory Management Service, a Long Term Memory Solution for AI

Python 8
1 年前
https://static.github-zh.com/github_avatars/leonardocremasco?size=40

#大语言模型#TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.

Python 1
4 个月前
loading...
Website
Wikipedia