关于

Contrastive Language-Audio Pretraining

arxiv.org

创建时间

2022-03-06

是否国产

否

语言

Python97.2%
Shell2.8%

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

LAION-AI 的其他开源项目

Open-Assistant

@LAION-AI

#大语言模型#面向所有人的对话式 AI，我们相信我们即将创造一场革命，正如 Stable Diffusion 改变了现代艺术的创作过程, 我们将透过对话式 AI 来改变世界.

ChatGPT language-model rlhf 人工智能 assistant

Python37.34 k

8 个月前

CLIP_benchmark

@LAION-AI

CLIP-like model evaluation

Jupyter Notebook697

1 个月前

audio-dataset

@LAION-AI

Audio Dataset for training CLAP and other models

Python667

1 年前

您可能感兴趣的

Open-Sora

@hpcaitech

Open-Sora：完全开源的高效复现类Sora视频生成方案

Python26.31 k

21 小时前

grok-1

@xai-org

大模型Grok-1开源

Python50.24 k

8 个月前

Open-Sora-Plan

@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python11.95 k

1 个月前

VoiceCraft

@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.25 k

2 个月前

transformer-debugger

OpenAI@openai

Python4.08 k

1 年前

DiT

@facebookresearch • Meta

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python7.22 k

1 年前

EMO

@HumanAIGC

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7.63 k

8 个月前

musiclm-pytorch

Phil Wang@lucidrains

#计算机科学#Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

人工智能 attention-mechanisms 深度学习 music-synthesis transformers

Python3.25 k

2 年前

T-Rex

@IDEA-Research

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

interactive object-counting object-detection open-set visual-prompt

Python2.47 k

9 天前

audioset_tagging_cnn

@qiuqiangkong

Python1.46 k

9 个月前

low_cost_robot

@AlexanderKoch-Koch

Python3.27 k

7 个月前

CLAP

Microsoft@microsoft

Learning audio concepts from natural language supervision

Python550

7 个月前

serenity

SerenityOS@SerenityOS

#操作系统#SerenityOS 是一款基于X86架构的类 Unix 的图形化操作系统，其UI界面仿90年代设计。

操作系统 C++Unix desktop-environment

C++31.58 k

2 天前

encodec

@facebookresearch • Meta

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python3.66 k

1 年前

audiocraft

@facebookresearch • Meta

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable mu...

Jupyter Notebook21.91 k

2 个月前

FeatUp

@mhamilton723

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook1.46 k

10 个月前

generative-models

Stability AI@Stability-AI

Generative Models by Stability AI

Python25.78 k

1 个月前

jepa

@facebookresearch • Meta

PyTorch code and models for V-JEPA self-supervised learning from video.

Python2.96 k

2 个月前

LAION-AI / CLAP

关于

创建时间

是否国产

修改时间

语言

0 条讨论

LAION-AI 的其他开源项目

您可能感兴趣的

LAION-AI / CLAP

关于

创建时间

是否国产

修改时间

语言

自述文件

0 条讨论

LAION-AI 的其他开源项目

您可能感兴趣的