The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...

PyTorch resnet pretrained-models pretrained-weights distributed-training

Python35.42 k

1 天前

diffusers

Hugging Face@huggingface

#计算机科学#🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

深度学习 diffusion image-generation PyTorch score-based-generative-modeling

Python31.01 k

13 小时前

smolagents

Hugging Face@huggingface

🤗 smolagents: a barebones library for agents that think in code.

Python22.9 k

20 天前

您可能感兴趣的

grok-1

@xai-org

大模型Grok-1开源

Python50.53 k

1 年前

VoiceCraft

@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.4 k

7 个月前

insanely-fast-whisper

@Vaibhavs10

Jupyter Notebook8.6 k

1 年前

faster-whisper

@SYSTRAN

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习 inference quantization speech-recognition speech-to-text

Python18.44 k

2 个月前

devika

@stitionai

Devika is now Opcode

Python19.49 k

13 天前

whisperX

@m-bain

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text Whisper

Python18.05 k

5 天前

whisper-jax

@sanchit-gandhi

#计算机科学#JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

深度学习 jax speech-recognition speech-to-text Whisper

Jupyter Notebook4.64 k

2 年前

OpenHands

@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent 人工智能大语言模型 ChatGPT claude-ai

Python64.01 k

4 小时前

maestro

@Doriandarko

A framework for Claude Opus to intelligently orchestrate subagents.

Python4.28 k

1 年前

skyvern

@Skyvern-AI

#大语言模型#Automate browser-based workflows with LLMs and Computer Vision

API 自动化 browser computer gpt

Python14.53 k

2 小时前

Open-Sora

@hpcaitech

Open-Sora：完全开源的高效复现类Sora视频生成方案

Python27.33 k

5 个月前

whisper

OpenAI@openai

whisper 是一个通用语音识别模型

Python89.09 k

1 个月前

MeloTTS

@myshell-ai

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

text-to-speech tts 中文 english french

Python6.84 k

9 个月前

whisper.cpp

@ggml-org

OpenAI Whisper语音识别模型，C++移植版本。

openai speech-to-text transformer Whisper inference

C++43.68 k

2 天前

AniPortrait

@Zejun-Yang

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python5 k

1 年前

SWE-agent

@SWE-agent

#大语言模型#SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

agent 人工智能 developer-tools 大语言模型 agent-based-model

Python17.52 k

1 天前