#自然语言处理#为 Jax、PyTorch 和 TensorFlow 打造的先进的自然语言处理
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
#计算机科学#🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
This repository contains the Hugging Face Agents Course.
Open-Sora: 完全开源的高效复现类Sora视频生成方案
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
reproduction of AnimateAnyone
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
Fast and memory-efficient exact attention
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Auto masking and inpainting for person, face, hand. Resizing image using detection model.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
#面试#AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Official code for "Style Aligned Image Generation via Shared Attention"
#计算机科学#Easily compute clip embeddings and build a clip retrieval system with them
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
Collection of generative models in Tensorflow
#大语言模型#This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
0 条讨论