Menu ☰

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: bench

Page 2

Showing 10 results from 5,177

T

OpenBMB/ToolBench

GitHub Python Apache License 2.0

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

β˜… 5,655 Forks 485 OpenBMB Updated 01 Jun 2026
B

bestiejs/benchmark.js

GitHub JavaScript Other

A benchmarking library. As used on jsPerf.com.

β˜… 5,466 Forks 332 bestiejs Updated 01 Jun 2026
S

SWE-bench/SWE-bench

GitHub Python MIT License

SWE-bench: Can Language Models Resolve Real-world Github Issues?

β˜… 5,059 Forks 875 SWE-bench Updated 01 Jun 2026
C

Kodezi/Chronos

GitHub Java Other

Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.

β˜… 4,950 Forks 215 Kodezi Updated 31 May 2026
M

SWE-agent/mini-swe-agent

GitHub Python MIT License

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepoβ€”but scores >74% on SWE-bench verified!

β˜… 4,791 Forks 654 SWE-agent Updated 01 Jun 2026
B

foolwood/benchmark_results

GitHub

Visual Tracking Paper List

β˜… 3,950 Forks 1,031 foolwood Updated 23 May 2026
D

clovaai/deep-text-recognition-benchmark

GitHub Jupyter Notebook Apache License 2.0

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

β˜… 3,934 Forks 1,133 clovaai Updated 30 May 2026
A

denji/awesome-http-benchmark

GitHub MIT License

HTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)

β˜… 3,755 Forks 255 denji Updated 29 May 2026
X

devMEremenko/XcodeBenchmark

GitHub Swift MIT License

XcodeBenchmark measures the compilation time of a large codebase on iMac, MacBook, and Mac Pro

β˜… 3,645 Forks 417 devMEremenko Updated 30 May 2026
A

THUDM/AgentBench

GitHub Python Apache License 2.0

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

β˜… 3,463 Forks 259 THUDM Updated 01 Jun 2026
Pagination Page 2 of 100

10 results on this page Β· 5,177 total found

Showing first 1,000 accessible GitHub results.