Menu ☰

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: mlx-local-inference

Page 1

Showing 10 results from 14

K

signerlabs/Klee

GitHub Swift

A native macOS AI chat app powered by MLX. 100% local inference on Apple Silicon, no cloud required. Built with ShipSwift.

β˜… 1,737 Forks 136 signerlabs Updated 28 May 2026
M

madroidmaq/mlx-omni-server

GitHub Python MIT License

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.

β˜… 720 Forks 88 madroidmaq Updated 29 May 2026
M

scouzi1966/maclocal-api

GitHub Swift MIT License

'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation and MLX Models and other APIs running on your Mac through a single aggregated OpenAI-compatible API endpoint. Supports Apple Vision and single command (non-server) inference with piping as well . Now with Web B... Read more

β˜… 292 Forks 15 scouzi1966 Updated 27 May 2026
P

Epistates/pmetal

GitHub Rust Other

PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal acceleration.

β˜… 291 Forks 19 Epistates Updated 29 May 2026
L

defilantech/LLMKube

GitHub Go Apache License 2.0

Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server β€” multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready

β˜… 114 Forks 17 defilantech Updated 29 May 2026
S

sgInnora/sentinel-reverse

GitHub Python MIT License

AI-Powered Autonomous Binary Reverse Engineering CLI β€” the native reverse engine from Innora-Sentinel. Local LLM inference (MLX), MPS GPU acceleration, multi-round iterative analysis, zero API cost.

β˜… 73 Forks 23 sgInnora Updated 27 May 2026
P

mu-hashmi/personaplex-mlx

GitHub Python MIT License

PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV inference.

β˜… 68 Forks 12 mu-hashmi Updated 24 May 2026
L

dnakov/llm-asi-arch

GitHub Python MIT License

πŸ€– Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architecture discovery with local LLM inference on Apple Silicon.

β˜… 29 Forks 13 dnakov Updated 08 Apr 2026
M

bendusy/mlx-local-inference

GitHub Python MIT License

Full local AI inference stack on Apple Silicon via MLX β€” LLM, ASR, Embedding, OCR, TTS, Transcription

β˜… 14 Forks 1 bendusy Updated 07 May 2026
D

wesleysimplicio/ds4-simplicio-apple-v6

GitHub C++

Ultra-fast 100% on-device Universal State Runtime for LLMs on Apple Silicon (M1–M5). DeepSeekV4/others on Mac M3 48gb RAM. Powered by MLX + Metal + ANE β€” extreme local inference, zero cloud, built for AI agents.

β˜… 14 Forks 1 wesleysimplicio Updated 29 May 2026
Pagination Page 1 of 2

10 results on this page Β· 14 total found

Showing first 14 accessible GitHub results.