๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 22

Showing 10 results from 1,706

T

naver/tamgu

GitHub C++ BSD 3-Clause "New" or "Revised" License

Tamgu (ํƒ๊ตฌ), a FIL programming language: Functional, Imperative, Logical all in one for annotation and data augmentation

โ˜… 224 Forks 13 naver Updated 16 Jun 2026
P

pikelang/Pike

GitHub C Other

Pike is a dynamic programming language with a syntax similar to Java and C. It is simple to learn, does not require long compilation passes and has powerful built-in data types allowing simple and really fast data manipulation.

โ˜… 220 Forks 41 pikelang Updated 26 Jun 2026
R

emanuelhuber/RGPR

GitHub R

Ground-penetrating radar (GPR) data processing and visualisation: a free and open-source software package (R language)

โ˜… 219 Forks 66 emanuelhuber Updated 26 Jun 2026
R

taylor-arnold/rpkg

GitHub HTML

A collection of R packages spanning natural language processing, statistical analysis, data visualization, and text analysis

โ˜… 218 Forks 36 taylor-arnold Updated 05 Apr 2026
A

the-momentum/apple-health-mcp-server

GitHub Python MIT License

MCP server for querying Apple Health data with natural language using DuckDB under the hood.

โ˜… 217 Forks 16 the-momentum Updated 21 Jun 2026
A

adl-lang/adl

GitHub C++ Other

ADL (Algebraic Data Language)

โ˜… 214 Forks 22 adl-lang Updated 05 Jan 2026
C

epigen/CellWhisperer

GitHub Jupyter Notebook GNU General Public License v3.0

CellWhisperer bridges the gap between transcriptomics data and natural language, enabling intuitive interaction with scRNA-seq datasets

โ˜… 214 Forks 30 epigen Updated 25 Jun 2026
K

bittorf/kalua

GitHub Shell Other

hardware-independent openWRT-extension (using POSIX-shell as main-language) for setting up, monitor and manage many, large wifi-mesh-networks for different locations including billing, captive portal / splash screen / weblogin, accounting, data retention and layer7/8-QoS

โ˜… 214 Forks 27 bittorf Updated 17 Feb 2026
S

Netflix/staash

GitHub Java Apache License 2.0

A language-agnostic as well as storage-agnostic web interface for storing data into persistent storage systems, the metadata layer abstracts a lot of storage details and the pattern automation APIs take care of automating common data access patterns.

โ˜… 213 Forks 39 Netflix Updated 30 Mar 2026
D

google-research-datasets/dakshina

GitHub

The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia text, a romanization lexicon of words in the native script with attested romanizations, and some full s... Read more

โ˜… 211 Forks 18 google-research-datasets Updated 25 Jun 2026
Pagination Page 22 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.