GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

slurm

Website
Wikipedia
https://static.github-zh.com/github_avatars/stas00?size=40
stas00 / ml-engineering

#大语言模型#Machine Learning Engineering Open Book

PyTorchslurmlarge-language-models大语言模型机器学习scalabilitytransformersmachine-learning-engineeringmlops人工智能inferencetraining
Python 14.03 k
6 天前
https://static.github-zh.com/github_avatars/SchedMD?size=40
SchedMD / slurm

Slurm: A Highly Scalable Workload Manager

slurmslurm-job-schedulerslurm-workload-manager
C 3.1 k
4 天前
nextflow-io/nextflow
https://static.github-zh.com/github_avatars/nextflow-io?size=40
nextflow-io / nextflow

A DSL for data-driven computational pipelines

Bioinformaticsworkflow-enginepipelinepipeline-frameworknextflowcloudGroovyslurmAmazon Web ServicesDockersingularityhpcreproducible-sciencereproducible-researchdataflow
Groovy 2.96 k
6 天前
https://static.github-zh.com/github_avatars/dstackai?size=40
dstackai / dstack

#计算机科学#dstack is an open-source alternative to Kubernetes and Slurm, designed to simplify GPU allocation and AI workload orchestration for ML teams across top clouds, on-prem clusters, and accelerators.

机器学习PythonAmazon Web ServicesAzureGoogle 云gpu大语言模型cloudorchestrationfine-tuningtrainingKubernetesamdDockerinferenceNvidiaslurm
Python 1.8 k
3 天前
https://static.github-zh.com/github_avatars/facebookincubator?size=40
facebookincubator / submitit

Python 3.8+ toolbox for submitting jobs to Slurm

slurmPythonclusters
Python 1.45 k
1 个月前
https://static.github-zh.com/github_avatars/DataBiosphere?size=40
DataBiosphere / toil

A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.

Common Workflow LanguagePythonmesosslurmworkflowpipelineAmazon Web ServicesKubernetes
Python 912
6 天前
https://static.github-zh.com/github_avatars/PySlurm?size=40
PySlurm / pyslurm

Python Interface to Slurm

slurmcythonPythonhpccluster
Cython 519
1 个月前
https://static.github-zh.com/github_avatars/rackslab?size=40
rackslab / Slurm-web

Open source web interface for Slurm HPC & AI clusters

dashboardhpcslurmwebui人工智能
Python 438
10 天前
https://static.github-zh.com/github_avatars/LambdaLabsML?size=40
LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

CUDAdeepspeeddistributed-traininggpugpu-clusterkuberentesncclPyTorchslurmclustermpisharding
Python 435
4 个月前
https://static.github-zh.com/github_avatars/giovtorres?size=40
giovtorres / slurm-docker-cluster

A Slurm cluster using docker-compose

hpcslurmDocker Compose
Dockerfile 373
9 个月前
https://static.github-zh.com/github_avatars/pipefunc?size=40
pipefunc / pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

pipeline-frameworkpipelinesreproducible-researchdaghpcparallel-computingslurmworkflow-engine
Python 371
6 天前
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / torchx

#计算机科学#TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

PyTorch机器学习Kubernetesslurmdistributed-trainingpipelinescomponents深度学习Pythonrayairflow
Python 366
9 天前
https://static.github-zh.com/github_avatars/elasticluster?size=40
elasticluster / elasticluster

Create clusters of VMs on the cloud and configure them with Ansible.

PythonclusterslurmhadoopApache Sparkec2cloudGoogle 云AzureclusteringhpcAnsible
Python 337
2 年前
https://static.github-zh.com/github_avatars/justanhduc?size=40
justanhduc / task-spooler

A scheduler for GPU/CPU tasks

slurmslurm-job-schedulerjob-schedulerC++LinuxDebianCmakefile
C 334
1 年前
https://static.github-zh.com/github_avatars/Azure?size=40
Azure / batch-shipyard

Simplify HPC and Batch workloads on Azure

DockerhpcmpigpuinfinibandrdmaAzurebatch-processingnfsglusterfsazure-functionscontainerssingularitywindows-containersServerlessslurm
Python 278
2 年前
https://static.github-zh.com/github_avatars/zhenrong-wang?size=40
zhenrong-wang / hpc-now

A Cross-Platform, Multi-Cloud High-Performance Computing Platform

aliyunAmazon Web ServicesAzurebaiduyunCcloudGoogle 云hpchuaweicloudScripttencent-cloudTerraformclusterLinuxslurmopentofuDevOps
C 259
5 个月前
https://static.github-zh.com/github_avatars/dell?size=40
dell / omnia

An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.

k8s-clusterKubernetesslurmdell-emcAnsiblehpc
YAML 254
5 天前
https://static.github-zh.com/github_avatars/vpenso?size=40
vpenso / prometheus-slurm-exporter

Prometheus exporter for performance metrics from Slurm.

prometheus-exporterslurm
Go 253
1 年前
https://static.github-zh.com/github_avatars/nebius?size=40
nebius / soperator

#计算机科学#Run Slurm in Kubernetes

hpcKubernetes机器学习model-trainingslurmhigh-performance-computing
Go 233
9 天前
https://static.github-zh.com/github_avatars/TUM-DAML?size=40
TUM-DAML / seml

SEML: Slurm Experiment Management Library

Utility Softwareslurmslurm-workload-managerexperiment-trackinghyperparameter-optimizationexperiment-managerorchestration
Python 182
23 天前
loading...