๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 23

Showing 10 results from 1,706

P

jasondelaat/pymonad

GitHub Python BSD 3-Clause "New" or "Revised" License

PyMonad implements data structures typically available in pure functional or functional first programming languages like Haskell and F#. Included are Monad and Monoid data types with several common monads included - such as Maybe and State - as well as some useful tools such as the @curry decorat... Read more

โ˜… 211 Forks 24 jasondelaat Updated 29 Mar 2026
S

openai/safety-rbr-code-and-data

GitHub Jupyter Notebook MIT License

Code and example data for the paper: Rule Based Rewards for Language Model Safety

โ˜… 209 Forks 22 openai Updated 25 Jun 2026
M

avvorstenbosch/Masterclass-LLMs-for-Data-Science

GitHub HTML Other

The masterclass "Large Language Models for Data Science" explains what LLMs are, what they can and cannot do, and what they can be used for.

โ˜… 208 Forks 20 avvorstenbosch Updated 21 Jun 2026
Q

princeton-nlp/QuRating

GitHub Python

[ICML 2024] Selecting High-Quality Data for Training Language Models

โ˜… 204 Forks 14 princeton-nlp Updated 07 Jun 2026
D

ibestvina/datasloth

GitHub Python MIT License

Natural language Pandas queries and data generation powered by GPT-3

โ˜… 200 Forks 9 ibestvina Updated 13 Feb 2026
T

taxilang/taxilang

GitHub Kotlin Apache License 2.0

Taxi is a language for describing APIs, data models, and how everything relates

โ˜… 199 Forks 7 taxilang Updated 27 May 2026
C

msg-systems/coreferee

GitHub Python MIT License

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

โ˜… 198 Forks 20 msg-systems Updated 24 May 2026
F

Salma0-8/Finance-Odyssey

GitHub

This repository contains curated roadmaps, case studies, and learning resources for finance careers involving programming and data. Each file offers a structured guide to mastering the tools, languages, and concepts needed in that field. Ideal for students, aspiring quants, and finance profession... Read more

โ˜… 197 Forks 24 Salma0-8 Updated 25 Jun 2026
T

naptha/tessdata

GitHub Shell Apache License 2.0

Tesseract Language Trained Data

โ˜… 196 Forks 70 naptha Updated 13 Dec 2025
L

tanfiona/LLM-on-Tabular-Data-Prediction-Table-Understanding-Data-Generation

GitHub

Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".

โ˜… 196 Forks 41 tanfiona Updated 23 Jun 2026
Pagination Page 23 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.