🌱 Open Source
Back to Open Source

🐙 GitHub Detail

D

pengr/DataMan

By pengr

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

GitHub Python Updated 09 Jun 2026

Live Snapshot

Stars

125

🍴

Forks

2

📄

License

Unknown

🧩

Type

Python

📘

About this open-source project

Live information fetched from GitHub.

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

🌿

Default Branch

main

🐞

Open Issues

3

👀

Watchers

125