๐ Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
๐ Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: language-data
Page 15
Showing 10 results from 1,706
F33RNI/GPT-Telegramus
GitHub Python GNU Affero General Public License v3.0๐ The best free Telegram bot for ChatGPT, Microsoft Copilot (aka Bing AI / Sidney / EdgeGPT), Microsoft Copilot Designer (aka BingImageCreator), Gemini and Groq with stream writing, requests with images, multiple languages, admin control, data logging and more!
External source
GitHub
zverok/wikipedia_ql
GitHub Python MIT LicenseQuery language for efficient data extraction from Wikipedia
External source
GitHub
zszszszsz/.config
GitHub Shell MIT License# # Automatically generated file; DO NOT EDIT. # OpenWrt Configuration # CONFIG_MODULES=y CONFIG_HAVE_DOT_CONFIG=y # CONFIG_TARGET_sunxi is not set # CONFIG_TARGET_apm821xx is not set # CONFIG_TARGET_ath25 is not set CONFIG_TARGET_ar71xx=y # CONFIG_TARGET_ath79 is not set # CONFIG_TARGET_bcm27xx ... Read more
External source
GitHub
huggingface/datablations
GitHub Jupyter Notebook Apache License 2.0Scaling Data-Constrained Language Models
External source
GitHub
ZigeW/data_management_LLM
GitHubCollection of training data management explorations for large language models
External source
GitHub
dwyl/learn-postgresql
GitHub JavaScript๐ Learn how to use PostgreSQL and Structured Query Language (SQL) to store and query your relational data. ๐
External source
GitHub
JinjieNi/MegaDLMs
GitHub PythonGPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.
External source
GitHub
oyekanmiayo/dsa-all-langs
GitHub Java MIT LicenseA repository that contains explanations and (custom) implementations of the most popular data structures and algorithms in various languages.
External source
GitHub
mochilang/mochi
GitHub Scheme MIT LicenseMochi is a small, fast, embeddable programming language designed for agents, data, and AI. It combines functional syntax, stream-first semantics, and native support for datasets, graphs, and simulation.
External source
GitHub
opencybersecurityalliance/kestrel-lang
GitHub Python Apache License 2.0Kestrel threat hunting language: building reusable, composable, and shareable huntflows across different data sources and threat intel.
External source
GitHub
10 results on this page ยท 1,706 total found
Showing first 1,000 accessible GitHub results.