GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

checkpointing

Website
Wikipedia
https://static.github-zh.com/github_avatars/kakaobrain?size=40
kakaobrain / torchgpipe

#计算机科学#A GPipe implementation in PyTorch

深度学习PyTorchgpipemodel-parallelismpipeline-parallelismparallelismcheckpointing
Python 843
1 年前
https://static.github-zh.com/github_avatars/argonne-lcf?size=40
argonne-lcf / dlio_benchmark

#大语言模型#An I/O benchmark for deep Learning applications

人工智能data-management深度学习storagePyTorchTensorflowcheckpointing大语言模型
Python 87
1 个月前
https://static.github-zh.com/github_avatars/cedana?size=40
cedana / cedana-cli

Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/performance in realtime to maximize pure runtime.

checkpointingcpugpu人工智能DockerLinux
Go 58
1 个月前
https://static.github-zh.com/github_avatars/ECP-VeloC?size=40
ECP-VeloC / VELOC

Very-Low Overhead Checkpointing System

checkpointingasync-storage
C++ 57
5 个月前
https://static.github-zh.com/github_avatars/dorukkarinca?size=40
dorukkarinca / keras-buoy

#计算机科学#Keras wrapper that autosaves what ModelCheckpoint cannot.

Kerascheckpointing数据科学机器学习colab-notebookcolaboratorycolab
Python 24
3 年前
https://static.github-zh.com/github_avatars/jorgensd?size=40
jorgensd / adios4dolfinx

Extending DOLFINx with checkpointing functionality

checkpointing
Python 24
2 个月前
https://static.github-zh.com/github_avatars/alex-w-99?size=40
alex-w-99 / Checkpointing-Program

A lightweight checkpointing program written in C.

Ccheckpointingld-preloadsignal-processingldpreloadmmap
C 2
2 年前
https://static.github-zh.com/github_avatars/EJASKHAN?size=40
EJASKHAN / flink-consume-produce-ek

This FLINK project will consume streams from an azure event-hub and produce to a different event-hub ,and the config files for deploying the same in kubernetes

KubernetesflinkJavak8s-deploymentk8s-clusterkubernetes-deploymentcheckpointingAzureDockerflink-examples
Java 2
4 年前
https://static.github-zh.com/github_avatars/f-dangel?size=40
f-dangel / wandb_preempt

Code and tutorial on integrating wandb sweeps with Slurm pre-emption

checkpointingPyTorchslurmwandb
Python 2
9 个月前
https://static.github-zh.com/github_avatars/rubrikinc?size=40
rubrikinc / sysfail

A shared library to help test your code with failure-injection

checkpointingfailure-injectionprogress
C++ 1
5 个月前
https://static.github-zh.com/github_avatars/Christopher-K-Long?size=40
Christopher-K-Long / thread-chunks

A python package for performing memory intensive computations in parallel using chunks and checkpointing.

checkpointingchunkingmultithreadingparallel
Python 1
2 个月前
https://static.github-zh.com/github_avatars/Shaswat-G?size=40
Shaswat-G / PyRecover

#计算机科学#Robust distributed checkpointing and job management system for multi-GPU SLURM workloads

checkpointingcluster-computing深度学习distributed-trainingfault-tolerancegpu-computinghigh-performance-computinghpcjob-managementPyTorchslurm
Python 1
14 天前
https://static.github-zh.com/github_avatars/gulabpatel?size=40
gulabpatel / Model_Checkpoingting

checkpointcheckpointing
Jupyter Notebook 1
3 年前
https://static.github-zh.com/github_avatars/EJASKHAN?size=40
EJASKHAN / flink-producer

This is a standalone flink producer using for testing the flink-consume-produce-ek repo contents

flinkKubernetesapache-flink部署checkpointing
Java 1
4 年前
https://static.github-zh.com/github_avatars/jrwellshpc?size=40
jrwellshpc / dmtcp_scripts

DMTCP scripts to get Python scripts working with SLURM.

slurmdmtcpgpucounter人工智能机器学习checkpointcheckpointing
Shell 1
1 年前
https://static.github-zh.com/github_avatars/Christopher-K-Long?size=40
Christopher-K-Long / saveable-objects

A python package for checkpointing, saving, and loading objects.

checkpointcheckpointingloadsave
Python 0
2 个月前
https://static.github-zh.com/github_avatars/grebtsew?size=40
grebtsew / AlbumOrganizer

#人脸识别#A digital album face recognition manager, that isolates images of a specified person from a digital album.

机器视觉DockerDocker Composeface-recognitioncheckpointingmultiprocess
Python 0
1 年前
https://static.github-zh.com/github_avatars/AD1024?size=40
AD1024 / torch-checkpointing

Compile a torch model to a checkpointed model

checkpointing
Python 0
5 年前
https://static.github-zh.com/github_avatars/kamangir?size=40
kamangir / blue-objects-2024-09-05-a

🌀 data objects for Bash (attempt one).

人工智能Bashcheckpointing
0
9 个月前
https://static.github-zh.com/github_avatars/SanjithChockan?size=40
SanjithChockan / CheckPointing-Recovery

Koo and Toueg’s checkpointing and recovery protocol

checkpointingdistributed-systemsrecovery
Java 0
2 年前
loading...