#数据仓库#Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
#计算机科学#A system for quickly generating training data with weak supervision
#自然语言处理#Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
#自然语言处理#skweak: A software toolkit for weak supervision applied to NLP tasks
#自然语言处理#BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
#计算机科学#Manga&Comic text detection
#搜索#Labelling platform for text using weak supervision.
#自然语言处理#[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach'.
Implementation of CRAFT Text Detection
#计算机科学#A curated list of programmatic weak supervision papers and resources
Labeling is boring. Use this tool to speed up your next object detection project!
#计算机科学#Weakly Supervised End-to-End Learning (NeurIPS 2021)
#计算机科学#Dataset for paper "Weak Supervision for Fake News Detection via Reinforcement Learning" published in AAAI'2020.
#自然语言处理#Framework to learn Named Entity Recognition models without labelled data using weak supervision.
#计算机科学#SPEAR: Programmatically label and build training data quickly.
#计算机科学#A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
#计算机科学#Official implementation of SuperSimpleNet