#

data-centric-ai

https://static.github-zh.com/github_avatars/dcai-course?size=40

#计算机科学#Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 👩🏽‍💻

Jupyter Notebook 472
7 个月前
https://static.github-zh.com/github_avatars/gszfwsb?size=40

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Highlight).

Python 381
11 天前
https://static.github-zh.com/github_avatars/GAIR-NLP?size=40

#大语言模型#[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 260
2 个月前
https://static.github-zh.com/github_avatars/yueyu1030?size=40

#自然语言处理#[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

Python 153
2 年前
https://static.github-zh.com/github_avatars/aai-institute?size=40

#计算机科学#pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation

Python 136
8 天前
https://static.github-zh.com/github_avatars/dcai-course?size=40
CSS 103
3 个月前
https://static.github-zh.com/github_avatars/opendataval?size=40

#计算机科学#OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)

Python 99
7 个月前
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
Python 84
2 年前
https://static.github-zh.com/github_avatars/NextBrain-ai?size=40

nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets

Jupyter Notebook 68
3 年前
loading...
Website
Wikipedia