#

data-centric-machine-learning

https://static.github-zh.com/github_avatars/sangmichaelxie?size=40

#自然语言处理#Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

HTML 341
2 年前
https://static.github-zh.com/github_avatars/microsoft?size=40

#计算机科学#Contains implementations of data-centric approaches for improving semantic segmentation on satellite imagery.

Python 42
5 个月前
https://static.github-zh.com/github_avatars/luo-junyu?size=40

#大语言模型#A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective

37
7 个月前
https://static.github-zh.com/github_avatars/seedatnabeel?size=40
Jupyter Notebook 16
2 年前
https://static.github-zh.com/github_avatars/mashijie1028?size=40

#计算机科学#(Pattern Recognition 2025) Towards Trustworthy Dataset Distillation

Python 14
9 个月前
https://static.github-zh.com/github_avatars/seedatnabeel?size=40

TRIAGE: Characterizing and auditing training data for improved regression (NeurIPS 2023)

Jupyter Notebook 11
2 年前
https://static.github-zh.com/github_avatars/seedatnabeel?size=40
Jupyter Notebook 9
3 年前
https://static.github-zh.com/github_avatars/seedatnabeel?size=40

You can’t handle the (dirty) truth: Data-centric insights improve pseudo-labeling

Jupyter Notebook 7
1 年前
https://static.github-zh.com/github_avatars/Decentralized-AI-Reserach-Lab?size=40

Collaboratively Learning Federated Models from Noisy Decentralized Data

Python 1
6 个月前
https://static.github-zh.com/github_avatars/miriamspsantos?size=40
1
1 年前
Website
Wikipedia