Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models."
#计算机科学#Segment Anything combined with CLIP
#大语言模型#Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.
#自然语言处理#extending stable diffusion prompts with suitable style cues using text generation
A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.
#自然语言处理#NLP model that predicts subreddit based on the title of a post
#计算机科学#Implementation of MagicMix: Semantic Mixing with Diffusion Models paper
#自然语言处理#Fact checking baseline combining dense retrieval and textual entailment
#计算机科学#An image classifier to classify things as huggable or not.
:hugs: A multilabel lymph node segmentation dataset from contrast CT
#计算机科学#Colorize your black and white images and YouTube videos for free. Streamlit application based on CNN deployed on Hugging Face.
#自然语言处理#NLP model that determines whether a plot is anime enough
Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generation.
#大语言模型#This project employs the GFPGAN algorithm to upscale and restore images. The tool leverages state-of-the-art deep learning models to enhance image quality and potentially restore degraded parts of the...
Automagically create flashcards from text
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
Your open-source alternative to AlphaFold3🚀