๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 19

Showing 10 results from 1,706

D

div-bargali/Data-Structures-and-Algorithms

GitHub C++ MIT License

Data Structures and Algorithms implemented In Python, C, C++, Java or any other languages. Aimed to help strengthen the concepts of DSA. Give a Star ๐ŸŒŸ if it helps you.

โ˜… 272 Forks 390 div-bargali Updated 18 May 2026
C

Monarobase/country-list

GitHub PHP

List of all countries with names and ISO 3166-1 codes in all languages and data formats

โ˜… 270 Forks 40 Monarobase Updated 01 Jun 2026
G

JohannLai/gptcli

GitHub TypeScript MIT License

๐Ÿ”ฅ All-in-one ChatGPT CLI - Build custom AI tools like GitHub Actions. Features AI commit, natural language commands, translation, and gitmoji. Create powerful CLI tools in seconds, bridging natural language and structured data.

โ˜… 270 Forks 15 JohannLai Updated 25 Jun 2026
G

OFA-Sys/gsm8k-ScRel

GitHub Python

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

โ˜… 269 Forks 19 OFA-Sys Updated 10 Jun 2026
F

yeslogic/fathom

GitHub Rust Apache License 2.0

๐Ÿšง (Alpha stage software) A declarative data definition language for formally specifying binary data formats. ๐Ÿšง

โ˜… 267 Forks 12 yeslogic Updated 28 May 2026
T

sharmaroshan/Twitter-Sentiment-Analysis

GitHub Jupyter Notebook GNU General Public License v3.0

It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization

โ˜… 268 Forks 128 sharmaroshan Updated 10 May 2026
D

alon-albalak/data-selection-survey

GitHub Creative Commons Zero v1.0 Universal

A Survey on Data Selection for Language Models

โ˜… 261 Forks 16 alon-albalak Updated 25 May 2026
M

jam-cc/MMAD

GitHub Python

The Codes and Data of A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection [ICLR'25]

โ˜… 261 Forks 19 jam-cc Updated 24 Jun 2026
C

finos/common-domain-model

GitHub Java Other

The CDM is a model for financial products, trades in those products, and the lifecycle events of those trades. It is an open source standard that aligns data, systems and processes and is available as code in multiple languages for easy implementation across technologies.

โ˜… 260 Forks 123 finos Updated 25 Jun 2026
D

hexinfo/dat

GitHub Java Apache License 2.0

Asking yours data in a natural language way through pre-modeling (data models and semantic models).

โ˜… 253 Forks 40 hexinfo Updated 22 Jun 2026
Pagination Page 19 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.