GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

cublas

Website
Wikipedia
https://static.github-zh.com/github_avatars/cupy?size=40
cupy / cupy

NumPy & SciPy for GPU

CUDAcudnncublascusolverncclPythonNumPycupycurandcusparsegpuSciPytensorrocm
Python 10.28 k
2 天前
https://static.github-zh.com/github_avatars/kevmo314?size=40
kevmo314 / scuda

SCUDA 是一个 GPU over IP 桥接器,允许将远程机器上的 GPU 连接到仅有 CPU 的机器上。

CUDAgpuremote-accessNetworkcublascudnnnvmlmlops
C++ 1.73 k
6 天前
https://static.github-zh.com/github_avatars/lebedov?size=40
lebedov / scikit-cuda

Python interface to GPU-powered libraries

PythongpuCUDAblaslapacknumericalcublascusolver
Python 991
2 年前
https://static.github-zh.com/github_avatars/coreylowman?size=40
coreylowman / cudarc

Safe rust wrapper around CUDA toolkit

CUDAcuda-programminggpugpu-accelerationRustcublascurandcuda-kernelscudnnnccl
Rust 857
1 个月前
https://static.github-zh.com/github_avatars/Bruce-Lee-LY?size=40
Bruce-Lee-LY / cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

CUDAgemmcublasNvidiagpu
Cuda 418
9 个月前
https://static.github-zh.com/github_avatars/coderonion?size=40
coderonion / awesome-cuda-and-hpc

#大语言模型#🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

CUDAcublastensorrtAwesome Lists大语言模型gpublasPyTorchhpcgemmllamacudnntritontensorrt-llmcutlassmlirtvmdeepseekptxvlm
278
16 天前
https://static.github-zh.com/github_avatars/Bruce-Lee-LY?size=40
Bruce-Lee-LY / cuda_hook

Hooked CUDA-related dynamic libraries by using automated code generation tools.

nvmlgpuNvidiacublascudnncurandcusolvercusparse
C 157
2 年前
https://static.github-zh.com/github_avatars/sasagawa888?size=40
sasagawa888 / deeppipe2

#计算机科学#Deep Learning library using GPU(CUDA/cuBLAS)

gpuCUDAcublas深度学习Elixir
Elixir 94
4 年前
https://static.github-zh.com/github_avatars/rxwei?size=40
rxwei / cuda-swift

Parallel Computing Library for Linux and macOS & NVIDIA CUDA Wrapper

CUDAcublasgpuSwiftparallel
Swift 82
8 年前
https://static.github-zh.com/github_avatars/bokutotu?size=40
bokutotu / zenu

#计算机科学#A Deep Learning framework with very few dependencies, Written in Rust

人工智能autograd深度学习深度神经网络RustblascublasCUDAcudnngpu-computinghpc
Rust 63
4 个月前
https://static.github-zh.com/github_avatars/Bruce-Lee-LY?size=40
Bruce-Lee-LY / cuda_hgemv

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.

cublasCUDAgemmgpuNvidia
Cuda 62
9 个月前
https://static.github-zh.com/github_avatars/rbaygildin?size=40
rbaygildin / learn-gpgpu

Algorithms implemented in CUDA + resources about GPGPU

gpugpu-computingNvidiaCUDAcurandcublasparallel-computingopenclgpgpu图像处理
Cuda 56
3 年前
https://static.github-zh.com/github_avatars/hma02?size=40
hma02 / cublasHgemm-P100

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

gpuprecisionfloat16cublasgemm
Cuda 34
6 年前
https://static.github-zh.com/github_avatars/eth-cscs?size=40
eth-cscs / Tiled-MM

Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.

CUDAgpumatrix-multiplicationrocmNvidiaamdcublas
C++ 32
2 个月前
https://static.github-zh.com/github_avatars/hma02?size=40
hma02 / cublasgemm-benchmark

code for benchmarking GPU performance based on cublasSgemm and cublasHgemm

gpubenchmarkinggemmcublasCUDA
Cuda 31
3 年前
https://static.github-zh.com/github_avatars/coderonion?size=40
coderonion / cuda-beginner-course-cpp-version

bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码

C++CUDAcuda-programminggpugpu-programmingNvidiaparallel-programmingRustcudnncublasPython
Cuda 29
10 个月前
https://static.github-zh.com/github_avatars/conradsnicta?size=40
conradsnicta / bandicoot-code

#计算机科学#Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io

C++openclgpumatrix-functionsmatrix-librarylinear-algebralinear-algebra-libraryscientific-computing机器学习cublasCUDAcuda-kernelscusolvergpu-accelerationgpu-computing
29
2 年前
https://static.github-zh.com/github_avatars/mnicely?size=40
mnicely / nvml_examples

Examples showing how to utilize the NVML library for GPU monitoring

CUDAcublasNvidianvml
C++ 28
3 年前
https://static.github-zh.com/github_avatars/jagennath-hari?size=40
jagennath-hari / CUDA-Accelerated-Visual-Inertial-Odometry-Fusion

Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, ...

cublasCUDAcusolverkalman-filterRoboticssensor-fusionstate-estimationvisual-inertial-odometryvisual-odometryunscented-kalman-filterros2
Cuda 27
1 年前
https://static.github-zh.com/github_avatars/gritukan?size=40
gritukan / hamkaas

#计算机科学#

cublasCUDAcudnndiy深度学习inference
C++ 26
9 个月前
loading...