๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 32

Showing 10 results from 1,706

P

kavgan/phrase-at-scale

GitHub Python

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

โ˜… 132 Forks 45 kavgan Updated 12 Jun 2026
F

cognitect-labs/fern

GitHub Clojure Eclipse Public License 1.0

Easy language for data

โ˜… 130 Forks 9 cognitect-labs Updated 22 Mar 2026
G

mistyreed63849/Graph-LLM

GitHub Python

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model (IEEE Transactions on Big Data)

โ˜… 129 Forks 18 mistyreed63849 Updated 26 May 2026
M

lanbaoshen/mcp-jenkins

GitHub Python MIT License

The Model Context Protocol (MCP) is an open-source implementation that bridges Jenkins with AI language models following Anthropic's MCP specification. This project enables secure, contextual AI interactions with Jenkins tools while maintaining data privacy and security.

โ˜… 129 Forks 58 lanbaoshen Updated 24 Jun 2026
D

kehanlu/DeSTA2

GitHub HTML Other

Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"

โ˜… 127 Forks 10 kehanlu Updated 12 Jun 2026
P

pd-l2ork/pd

GitHub C

This is the old 1.x L2Ork version of Pure-Data real-time digital signal processing language developed and maintained at DISIS, Virginia Tech. It has been superseded by the cross-platform 2.x version of Pd-l2Ork found at https://github.com/pd-l2ork/pd-l2ork

โ˜… 127 Forks 62 pd-l2ork Updated 11 Feb 2025
S

SeeSharpOpenSource/SeeSharpTools

GitHub C# GNU General Public License v3.0

This open source repository is developed and maintained by the JYTEK company, Shanghai, China, for the purpose of facilitating the development of the test and measurement using C# programming language. The project follows the GNU GPL V3.0 license and provides many commonly used T&M GUIs, utilit... Read more

โ˜… 127 Forks 64 SeeSharpOpenSource Updated 26 Jun 2026
M

mojo-lang/mojo

GitHub Go

A data oriented, simple but powerful DSL language.

โ˜… 126 Forks 8 mojo-lang Updated 25 Jun 2026
C

anweiss/cddl

GitHub Rust MIT License

Concise data definition language (RFC 8610) implementation and JSON and CBOR validator in Rust

โ˜… 125 Forks 19 anweiss Updated 22 Jun 2026
D

pengr/DataMan

GitHub Python

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

โ˜… 125 Forks 2 pengr Updated 09 Jun 2026
Pagination Page 32 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.