←
Back to Open Source
🐙 GitHub Detail
D
pengr/DataMan
By pengr
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".
GitHub
Python
Updated 09 Jun 2026
Live Snapshot
⭐
Stars
125
🍴
Forks
2
📄
License
Unknown
🧩
Type
Python
📘
About this open-source project
Live information fetched from GitHub.
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".
🌿
Default Branch
main
🐞
Open Issues
3
👀
Watchers
125