🌱 Open Source β–Ύ

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 21

Showing 10 results from 1,706

D

guosyjlu/DS-Agent

GitHub Python

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

β˜… 234 Forks 35 guosyjlu Updated 18 Jun 2026
P

probcomp/PClean

GitHub Julia Apache License 2.0

A domain-specific probabilistic programming language for scalable Bayesian data cleaning

β˜… 231 Forks 34 probcomp Updated 22 Jun 2026
L

LostXine/LLaRA

GitHub Python Apache License 2.0

[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

β˜… 229 Forks 8 LostXine Updated 19 Apr 2026
D

JinjieNi/dlms-are-super-data-learners

GitHub Python

The official github repo for "Diffusion Language Models are Super Data Learners".

β˜… 228 Forks 8 JinjieNi Updated 24 Apr 2026
A

code-shoily/algorithms-in-dart

GitHub Dart MIT License

Implementation of data structures and algorithms in Dart programming language.

β˜… 226 Forks 76 code-shoily Updated 30 May 2026
D

SeekingDream/DyCodeEval

GitHub Python

Official repository of the ICML2025 paper β€œDynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination”

β˜… 226 Forks 18 SeekingDream Updated 07 Jun 2026
S

wz0919/ScaleVLN

GitHub Python

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

β˜… 225 Forks 5 wz0919 Updated 23 Jun 2026
T

Tablam/TablaM

GitHub Rust Mozilla Public License 2.0

The practical relational programing language for data-oriented applications

β˜… 225 Forks 2 Tablam Updated 25 May 2026
H

BaReinhard/Hacktoberfest-Data-Structure-and-Algorithms

GitHub Java GNU General Public License v3.0

A repo to contains data structure s and algorithms, regardless of language.

β˜… 224 Forks 696 BaReinhard Updated 06 Jun 2026
P

SignLLM/Prompt2Sign

GitHub Python Other

[ICCV 2025 CV411y] The Data and Code of Prompt2Sign: A Comprehensive Multilingual Sign Language Dataset.

β˜… 224 Forks 26 SignLLM Updated 22 Jun 2026
Pagination Page 21 of 100

10 results on this page Β· 1,706 total found

Showing first 1,000 accessible GitHub results.