๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 8

Showing 10 results from 1,705

A

iiitv/algos

GitHub Java MIT License

Popular Algorithms and Data Structures implemented in popular languages

โ˜… 1,073 Forks 488 iiitv Updated 06 Jun 2026
T

OpenBioLink/ThoughtSource

GitHub Jupyter Notebook MIT License

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

โ˜… 1,015 Forks 82 OpenBioLink Updated 24 Jun 2026
R

grailbio/reflow

GitHub Go Apache License 2.0

A language and runtime for distributed, incremental data processing in the cloud

โ˜… 975 Forks 50 grailbio Updated 04 Jun 2026
P

fizx/parsley

GitHub Shell MIT License

Parsley is a simple language for extracting structured data from web pages. Parsley consists of an powerful selector language wrapped with a JSON structure that can represent page-wide formatting.

โ˜… 906 Forks 37 fizx Updated 24 Jun 2026
M

PrismarineJS/minecraft-data

GitHub JavaScript

Language independent module providing minecraft data for minecraft clients, servers and libraries.

โ˜… 880 Forks 266 PrismarineJS Updated 25 Jun 2026
A

nashsu/autocli-skill

GitHub

The perfect companion for ClaudeCode/OpenClaw/Agent, Give your AI Agent the ability to reach information across the entire web, fetching real-time data from Bilibili, Zhihu, Twitter/X, YouTube, Weibo, Reddit, Facebook, Instagram, TikTok, Notion, Cursor and 55+ platforms with natural language โ€” re... Read more

โ˜… 877 Forks 88 nashsu Updated 25 Jun 2026
P

The-FinAI/PIXIU

GitHub Jupyter Notebook MIT License

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artifici... Read more

โ˜… 871 Forks 117 The-FinAI Updated 22 Jun 2026
L

tesseract-ocr/langdata

GitHub Apache License 2.0

Source training data for Tesseract for lots of languages

โ˜… 868 Forks 872 tesseract-ocr Updated 18 Jun 2026
D

mtahiraslan/data-analyst-roadmap

GitHub MIT License

Based on my own experience, I believe this roadmap provides clear answers to all the key questions about becoming a data analyst from scratch, including which technologies and programming languages to learn, what soft skills are essential, and how to start a professional career in this field.

โ˜… 864 Forks 113 mtahiraslan Updated 21 Jun 2026
C

JackieTseng/conference_call_for_paper

GitHub HTML

2021-2022 International Conferences in Artificial Intelligence, Machine Learning, Computer Vision, Data Mining, Natural Language Processing and Robotics

โ˜… 852 Forks 124 JackieTseng Updated 15 May 2026
Pagination Page 8 of 100

10 results on this page ยท 1,705 total found

Showing first 1,000 accessible GitHub results.