🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

Curated Tools → Curated AI Models 🤖 Submit Tool ＋

Live sources

🐙

GitHub

Open repos

🤗

Hugging Face

AI models

🐳

Self Hosted

GitHub search

🧠

LLM

Model search

🔎 Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻

Search keyword

🔎

Source

🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

CRM Workflow Chatbot Llama

Live Results

GitHub Open Source Repositories

Search: mlx-local-inference

Page 1

Showing 10 results from 14

signerlabs/Klee

GitHub Swift

A native macOS AI chat app powered by MLX. 100% local inference on Apple Silicon, no cloud required. Built with ShipSwift.

★ 1,737 Forks 136 signerlabs Updated 28 May 2026

External source

GitHub

View Details → Open Source ↗

madroidmaq/mlx-omni-server

GitHub Python MIT License

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.

★ 720 Forks 88 madroidmaq Updated 29 May 2026

External source

GitHub

View Details → Open Source ↗

scouzi1966/maclocal-api

GitHub Swift MIT License

'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation and MLX Models and other APIs running on your Mac through a single aggregated OpenAI-compatible API endpoint. Supports Apple Vision and single command (non-server) inference with piping as well . Now with Web B... Read more

★ 292 Forks 15 scouzi1966 Updated 27 May 2026

External source

GitHub

View Details → Open Source ↗

Epistates/pmetal

GitHub Rust Other

PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal acceleration.

★ 291 Forks 19 Epistates Updated 29 May 2026

External source

GitHub

View Details → Open Source ↗

defilantech/LLMKube

GitHub Go Apache License 2.0

Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready

★ 114 Forks 17 defilantech Updated 29 May 2026

External source

GitHub

View Details → Open Source ↗

sgInnora/sentinel-reverse

GitHub Python MIT License

AI-Powered Autonomous Binary Reverse Engineering CLI — the native reverse engine from Innora-Sentinel. Local LLM inference (MLX), MPS GPU acceleration, multi-round iterative analysis, zero API cost.

★ 73 Forks 23 sgInnora Updated 27 May 2026

External source

GitHub

View Details → Open Source ↗

mu-hashmi/personaplex-mlx

GitHub Python MIT License

PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV inference.

★ 68 Forks 12 mu-hashmi Updated 24 May 2026

External source

GitHub

View Details → Open Source ↗

dnakov/llm-asi-arch

GitHub Python MIT License

🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architecture discovery with local LLM inference on Apple Silicon.

★ 29 Forks 13 dnakov Updated 08 Apr 2026

External source

GitHub

View Details → Open Source ↗

bendusy/mlx-local-inference

GitHub Python MIT License

Full local AI inference stack on Apple Silicon via MLX — LLM, ASR, Embedding, OCR, TTS, Transcription

★ 14 Forks 1 bendusy Updated 07 May 2026

External source

GitHub

View Details → Open Source ↗

wesleysimplicio/ds4-simplicio-apple-v6

GitHub C++

Ultra-fast 100% on-device Universal State Runtime for LLMs on Apple Silicon (M1–M5). DeepSeekV4/others on Mac M3 48gb RAM. Powered by MLX + Metal + ANE — extreme local inference, zero cloud, built for AI agents.

★ 14 Forks 1 wesleysimplicio Updated 29 May 2026

External source

GitHub

View Details → Open Source ↗

Pagination Page 1 of 2

10 results on this page · 14 total found

Showing first 14 accessible GitHub results.