๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 25

Showing 10 results from 1,706

P

yaph/programming-languages-influence

GitHub Python

Code to retrieve data for the programming languages influence visualizations from Freebase

โ˜… 186 Forks 54 yaph Updated 12 Jun 2026
J

Esri/joint-military-symbology-xml

GitHub C# Apache License 2.0

Joint Military Symbology Markup Language is a data encapsulation of MIL-STD-2525D and APP-6(D).

โ˜… 186 Forks 65 Esri Updated 21 Jun 2026
M

JD-P/minihf

GitHub Python Apache License 2.0

MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.

โ˜… 184 Forks 12 JD-P Updated 04 Jun 2026
F

kaitoInfra/fast-twitter-api

GitHub MIT License

Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required

โ˜… 183 Forks 34 kaitoInfra Updated 28 May 2026
S

secretflow/scql

GitHub Go Apache License 2.0

SCQL (Secure Collaborative Query Language) is a system that allows multiple distrusting parties to run joint analysis without revealing their private data.

โ˜… 181 Forks 72 secretflow Updated 11 Jun 2026
O

huggingface/olm-datasets

GitHub Python Apache License 2.0

Pipeline for pulling and processing online language model pretraining data from the web

โ˜… 179 Forks 22 huggingface Updated 11 Jun 2026
J

jvalue/jayvee

GitHub TypeScript

Jayvee is a domain-specific language and runtime for automated processing of data pipelines

โ˜… 178 Forks 14 jvalue Updated 05 Jun 2026
I

lupantech/InterGPS

GitHub Python MIT License

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

โ˜… 178 Forks 35 lupantech Updated 23 Jun 2026
B

Azure99/BlossomLM

GitHub Python Apache License 2.0

A powerful, open-source large language model, including the training data.

โ˜… 177 Forks 7 Azure99 Updated 15 Jun 2026
D

abdurakhmonoff/data-structures-and-algorithms-kotlin

GitHub Kotlin MIT License

Implementation and explanation of Data Structures and Algorithms (DSA) in Kotlin programming language

โ˜… 175 Forks 63 abdurakhmonoff Updated 02 Jun 2026
Pagination Page 25 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.