π Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
π Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: mlx-local-inference
Page 1
Showing 10 results from 14
signerlabs/Klee
GitHub SwiftA native macOS AI chat app powered by MLX. 100% local inference on Apple Silicon, no cloud required. Built with ShipSwift.
External source
GitHub
madroidmaq/mlx-omni-server
GitHub Python MIT LicenseMLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.
External source
GitHub
scouzi1966/maclocal-api
GitHub Swift MIT License'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation and MLX Models and other APIs running on your Mac through a single aggregated OpenAI-compatible API endpoint. Supports Apple Vision and single command (non-server) inference with piping as well . Now with Web B... Read more
External source
GitHub
Epistates/pmetal
GitHub Rust OtherPMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal acceleration.
External source
GitHub
defilantech/LLMKube
GitHub Go Apache License 2.0Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server β multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready
External source
GitHub
sgInnora/sentinel-reverse
GitHub Python MIT LicenseAI-Powered Autonomous Binary Reverse Engineering CLI β the native reverse engine from Innora-Sentinel. Local LLM inference (MLX), MPS GPU acceleration, multi-round iterative analysis, zero API cost.
External source
GitHub
mu-hashmi/personaplex-mlx
GitHub Python MIT LicensePersonaPlex on Apple Silicon: an MLX port of NVIDIAβs full-duplex speech-to-speech model with realtime local/web modes and offline WAV inference.
External source
GitHub
dnakov/llm-asi-arch
GitHub Python MIT Licenseπ€ Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architecture discovery with local LLM inference on Apple Silicon.
External source
GitHub
bendusy/mlx-local-inference
GitHub Python MIT LicenseFull local AI inference stack on Apple Silicon via MLX β LLM, ASR, Embedding, OCR, TTS, Transcription
External source
GitHub
wesleysimplicio/ds4-simplicio-apple-v6
GitHub C++Ultra-fast 100% on-device Universal State Runtime for LLMs on Apple Silicon (M1βM5). DeepSeekV4/others on Mac M3 48gb RAM. Powered by MLX + Metal + ANE β extreme local inference, zero cloud, built for AI agents.
External source
GitHub
10 results on this page Β· 14 total found
Showing first 14 accessible GitHub results.