๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: language-data

Page 33

Showing 10 results from 1,706

B

MESAInternational/B2MML-BatchML

GitHub Other

B2MML is an XML implementation of the ANSI/ISA-95, Enterprise-Control System Integration, family of standards (ISA-95), known internationally as IEC/ISO 62264. B2MML consists of a set of XML schemas written using the World Wide Web Consortium's XML Schema language (XSD) that implement the data m... Read more

โ˜… 125 Forks 46 MESAInternational Updated 25 Jun 2026
I

THUDM/iPrompt

GitHub Python

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

โ˜… 124 Forks 20 THUDM Updated 29 Oct 2025
H

TAbhay/Hactoberfest-2021

GitHub C++

Make your first PR! ~ A beginner friendly repository made specifically for open source beginners. Add your profile, a blog or any program under any language (it can be anything from a hello-world program to a complex data structure algorithm) or update the existing one. Just make sure to add the ... Read more

โ˜… 121 Forks 488 TAbhay Updated 14 Apr 2026
H

ChicagoHAI/hypothesis-generation

GitHub Python MIT License

This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools that leverage large language models to generate hypothesis for open-domain research. For more details, please see the original paper using the link below.

โ˜… 120 Forks 15 ChicagoHAI Updated 25 Jun 2026
K

young-geng/koala_data_pipeline

GitHub Python Apache License 2.0

The data processing pipeline for the Koala chatbot language model

โ˜… 118 Forks 12 young-geng Updated 07 Aug 2025
D

kmeanskaran/Data-Science-Projects

GitHub Jupyter Notebook Apache License 2.0

Data Science projects on various problem statements and datasets using Data Analysis, Machine Learning Algorithms, Deep Learning Algorithms, Natural Language Processing, Business Intelligence concepts by Python

โ˜… 117 Forks 22 kmeanskaran Updated 16 Jun 2026
D

danicat/datasus

GitHub R GNU General Public License v3.0

An Interface for the Brazilian Public Healthcare Data Repository (DATASUS) for the R Language

โ˜… 116 Forks 27 danicat Updated 12 May 2026
S

HelloWorldLTY/scEval

GitHub Jupyter Notebook

Codes for paper: Evaluating the Utilities of Large Language Models in Single-cell Data Analysis.

โ˜… 116 Forks 12 HelloWorldLTY Updated 11 Jun 2026
L

DAMO-NLP-SG/LLM-Sentiment

GitHub Python MIT License

[NAACL 2024] Data and code for our paper "Sentiment Analysis in the Era of Large Language Models: A Reality Check"

โ˜… 116 Forks 21 DAMO-NLP-SG Updated 13 May 2026
F

tzano/fountain

GitHub Python MIT License

Natural Language Data Augmentation Tool for Conversational Systems

โ˜… 115 Forks 9 tzano Updated 03 Dec 2024
Pagination Page 33 of 100

10 results on this page ยท 1,706 total found

Showing first 1,000 accessible GitHub results.