The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
#向量搜索引擎#向量相似性搜索库,为稠密向量提供高效相似度搜索和聚类
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Fairseq 是一个Python编写的 Seq2seq 建模工具包,可用于翻译、摘要、语言建模和其他文本生成任务训练自定义模型
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Open-Sora: 完全开源的高效复现类Sora视频生成方案
VMamba: Visual State Space Models,code is based on mamba
Flops counter for neural networks in pytorch framework
Mamba SSM architecture
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Puter 是一个运行在浏览器上的OS。功能丰富、快速、可扩展性强。它可以用于构建远程桌面环境,也可以作为云存储服务、远程服务器、网络托管平台等的接口。
#数据仓库#TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
#计算机科学#Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when ...
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
#计算机科学#Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Efficient vision foundation models for high-resolution generation and perception.
0 条讨论