#安卓#MediaPipe 是一个跨平台实时、流媒体机器学习解决方案。提供了人脸识别、人体姿势识别与跟踪、物体检测、自拍分割、即时运动跟踪等功能
#计算机科学#基于TensorFlow开发,音轨分离软件,只需输入一段命令就可以将音乐的人声和各种乐器声分离。
#计算机科学#A PyTorch-based Speech Toolkit
THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.
macOS System-wide Audio Equalizer & Volume Mixer 🎧
#计算机科学#🎛 🔊 A Python library for audio.
#计算机科学#A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
An implementation of Shazam's song recognition algorithm.
Auto-Editor: Efficient media analysis and rendering
#计算机科学#A library for audio and music analysis, feature extraction.
#Awesome#List of articles related to deep learning applied to music
#计算机科学#Isolate vocals, drums, bass, and other instrumental stems from any song
🎵 🌈 Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
#计算机科学#Data manipulation and transformation for audio signal processing, powered by PyTorch
Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
#人脸识别#The collection of pre-trained, state-of-the-art AI models for ailia SDK
A little package that brings sound to any Go application. Suitable for playback and audio-processing.
Open-Source Large Vocabulary Continuous Speech Recognition Engine
#自然语言处理#Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.