GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

text-to-image-generation

Website
Wikipedia
https://static.github-zh.com/github_avatars/NVlabs?size=40
NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

diffusionditPyTorchsanatext-to-image-generationtransformers
Python 4.49 k
1 天前
https://static.github-zh.com/github_avatars/Lightricks?size=40
Lightricks / ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

comfyuidiffusion-modelsditimage-to-videoimage-to-video-generationtext-to-imagetext-to-image-generation
Python 2.34 k
2 个月前
https://static.github-zh.com/github_avatars/adobe-research?size=40
adobe-research / custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

customizationfine-tuningtext-to-image-generation机器视觉diffusion-modelsfew-shotPyTorch
Python 1.96 k
2 年前
https://static.github-zh.com/github_avatars/FoundationVision?size=40
FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

auto-regressive-modelautoregressive-modelsgenerative-modelgptgpt-2image-generationtext-to-imagetext-to-image-generationtransformers
Python 1.43 k
3 个月前
https://static.github-zh.com/github_avatars/muzishen?size=40
muzishen / IMAGDressing

#数据仓库#[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high f...

数据集diffusion-modelstext-to-image-generation
Python 1.28 k
23 天前
https://static.github-zh.com/github_avatars/songweige?size=40
songweige / rich-text-to-image

Rich-Text-to-Image Generation

机器视觉diffusion-modelsPyTorchrich-texttext-to-image-generation
Python 801
2 年前
https://static.github-zh.com/github_avatars/PKU-YuanGroup?size=40
PKU-YuanGroup / UniWorld-V1

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

diffusionhigh-level-featureimage-editingimage-understandinglow-level-visiontext-to-image-generationunifyunify-aivlm
Python 701
1 个月前
https://static.github-zh.com/github_avatars/AIDC-AI?size=40
AIDC-AI / Awesome-Unified-Multimodal-Models

Awesome Unified Multimodal Models

multimodal-large-language-modelstext-to-image-generationmultimodal-modelsvision-language-model
690
1 个月前
https://static.github-zh.com/github_avatars/FoundationVision?size=40
FoundationVision / Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

generativegenerative-ai大语言模型text-to-imagetext-to-image-generationautoregressive-modelslarge-language-modelsmultimodal-large-language-models
Python 613
5 个月前
https://static.github-zh.com/github_avatars/donahowe?size=40
donahowe / AutoStudio

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

image-generationtext-to-image-generation
Jupyter Notebook 446
5 个月前
https://static.github-zh.com/github_avatars/Paranioar?size=40
Paranioar / Awesome_Matching_Pretraining_Transfering

#Awesome#The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

cross-modal-retrieval教程Awesome Listsimage-text-matchingimage-text-retrievallarge-language-modelslarge-vision-language-modelsmultimodal-pretrainingparameter-efficient-fine-tuningvision-and-languagemultimodal-large-language-models大语言模型text-to-image-generationtext-to-image-synthesistext-to-video-generation
428
9 个月前
https://static.github-zh.com/github_avatars/ByteVisionLab?size=40
ByteVisionLab / TokenFlow

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

large-language-modelstext-to-image-generation
Python 377
1 个月前
https://static.github-zh.com/github_avatars/OSU-NLP-Group?size=40
OSU-NLP-Group / MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

diffusion-modelsimage-editingimage-generationimage-synthesisinstruction-followingtext-to-imagetext-to-image-generationtext-to-image-synthesis
Python 375
7 个月前
https://static.github-zh.com/github_avatars/woctezuma?size=40
woctezuma / stable-diffusion-colab

#计算机科学#Colab notebook for Stable Diffusion Hyper-SDXL.

colabcolab-notebookcolaboratorystable-diffusionhuggingface-diffusersdiffusiondiffusion-modelstext-to-imagetext-to-image-generationtext-to-image-synthesisdiffusersgoogle-colabgoogle-colab-notebookimage-generationtext2image深度学习
Jupyter Notebook 326
5 个月前
https://static.github-zh.com/github_avatars/RockeyCoss?size=40
RockeyCoss / SPO

[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

diffusion-modelsdposdxltext-to-imagetext-to-image-generation
Python 245
5 个月前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / diffusion-fast

Faster generation with text-to-image diffusion models.

diffusersdiffusion-modelsPyTorchsdxltext-to-image-generation
Python 226
3 个月前
https://static.github-zh.com/github_avatars/CFGpp-diffusion?size=40
CFGpp-diffusion / CFGpp

Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

diffusion-modelimage-editing机器学习PyTorchtext-to-imagetext-to-image-generation
Python 224
6 个月前
https://static.github-zh.com/github_avatars/yunqing-me?size=40
yunqing-me / AttackVLM

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

generative-aitext-to-image-generationfoundation-modelslarge-language-modelsvision-language-model
Python 211
9 个月前
https://static.github-zh.com/github_avatars/tsunghan-wu?size=40
tsunghan-wu / SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

diffusion-modelsimage-editingtext-to-image-generationdalle-3stable-diffusion
Python 182
1 年前
https://static.github-zh.com/github_avatars/markfulton?size=40
markfulton / NanoBananaEditor

The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version ...

boltimageeditortext-to-imagetext-to-image-generationvibecoding
TypeScript 180
13 天前
loading...