GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

cutlass

Website
Wikipedia
https://static.github-zh.com/github_avatars/bytedance?size=40
bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

cutlassPyTorchCUDAgpu
C++ 1.04 k
2 天前
https://static.github-zh.com/github_avatars/coderonion?size=40
coderonion / awesome-cuda-and-hpc

#大语言模型#🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

CUDAcublastensorrtAwesome Lists大语言模型gpublasPyTorchhpcgemmllamacudnntritontensorrt-llmcutlassmlirtvmdeepseekptxvlm
300
9 天前
https://static.github-zh.com/github_avatars/DD-DuDa?size=40
DD-DuDa / Cute-Learning

Examples of CUDA implementations by Cutlass CuTe

CUDAcutlassgpu
Makefile 212
1 个月前
https://static.github-zh.com/github_avatars/leimao?size=40
leimao / CUTLASS-Examples

CUTLASS and CuTe Examples

CUDAcutlassDocker
Cuda 64
16 天前
https://static.github-zh.com/github_avatars/Bruce-Lee-LY?size=40
Bruce-Lee-LY / flash_attention_inference

#大语言模型#Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.

CUDAflash-attentiongpuinference大语言模型Nvidiacutlassmha
C++ 39
5 个月前
https://static.github-zh.com/github_avatars/YashasSamaga?size=40
YashasSamaga / ConvolutionBuildingBlocks

#计算机科学#GEMM and Winograd based convolutions using CUTLASS

深度学习convolutionCUDAcutlass
Cuda 26
5 年前
https://static.github-zh.com/github_avatars/yester31?size=40
yester31 / Cutlass_EX

study of cutlass

cmakeC++CUDAcutlassparallel-programming
Cuda 22
9 个月前
https://static.github-zh.com/github_avatars/Bruce-Lee-LY?size=40
Bruce-Lee-LY / cutlass_gemm

#大语言模型#Multiple GEMM operators are constructed with cutlass to support LLM inference.

cublascutlass大语言模型Nvidiagemmgpu
C++ 18
14 天前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / whl

Kernel Library Wheel for SGLang

CUDAcutlass
HTML 10
1 天前
https://static.github-zh.com/github_avatars/qdLMF?size=40
qdLMF / LightGlue-with-FlashAttentionV2-TensorRT

A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.2.

cutecutlasstensorrtfeature-matchingCUDAflash-attentionmultihead-attentiontransformersuperpoint
Cuda 9
5 个月前
https://static.github-zh.com/github_avatars/cjmcv?size=40
cjmcv / ai-infra-notes

#大语言模型#Reading notes on the open source code of AI infrastructure (sglang, llm, cutlass, hpc, etc.)

CUDAcutlasshpcinference大语言模型mlsyssimdgpu
3
13 天前
https://static.github-zh.com/github_avatars/digital-nomad-cheng?size=40
digital-nomad-cheng / tvm_project_course

编译器CUDAcutlass神经网络tensorrttvm
Python 1
2 年前
https://static.github-zh.com/github_avatars/Routhleck?size=40
Routhleck / blocksparse-pytorch-implement

pytorch implements block sparse

CUDAcutlassmatrix-multiplicationPythonPyTorch
C++ 1
2 年前
https://static.github-zh.com/github_avatars/prateekshukla1108?size=40
prateekshukla1108 / cutlass3

Docs

cutlass
HTML 0
3 个月前
https://static.github-zh.com/github_avatars/peterlau123?size=40
peterlau123 / Lolly

Lightweight and production level C++ Open source Library

C++CCUDAcutlass
C++ 0
3 个月前
https://static.github-zh.com/github_avatars/jiaau?size=40
jiaau / kernels

This repository showcases common optimization techniques for kernels.

C++CUDAcutecutlasshpcKernel
Cuda 0
2 个月前