#学习与技能提升#免费公共 API 集合
#数据仓库#Label Studio is a multi-type data labeling and annotation tool with standardized output format
Faker is a Python package that generates fake data for you.
#计算机科学#pix2tex: Using a ViT to convert images of equations into LaTeX code.
#计算机科学#CVAT 是一个领先的工业级用于机器学习的图片、视频标注工具。
#计算机科学#A MNIST-like fashion product database. Benchmark 👇
#自然语言处理#Open source annotation tool for machine learning practitioners.
#自然语言处理#大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
#数据仓库#Techniques for deep learning with satellite & aerial imagery
#大语言模型#A powerful tool for creating fine-tuning datasets for LLM
#Awesome#Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
#计算机科学#PHP-ML - Machine Learning library for PHP
Documentation on how to access and use the Quick, Draw! Dataset.
Browser compatibility data for Web technologies as displayed on MDN
#自然语言处理#Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
esProc SPL is a JVM-based programming language designed for structured data computation, serving as both a data analysis tool and an embedded computing engine.
#数据仓库#TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard