tgi
Here are 15 public repositories matching this topic...
Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU NVIDIA + Apple Silicon Metal, autoscaling, air-gapped, production-ready
-
Updated
May 23, 2026 - Go
大模型推理框架加速,让 LLM 飞起来
-
Updated
May 10, 2024 - Python
Bench360 is a modular benchmarking suite for local LLM deployments. It offers a full-stack, extensible pipeline to evaluate the latency, throughput, quality, and cost of LLM inference on consumer and enterprise GPUs. Bench360 supports flexible backends, tasks and scenarios, enabling fair and reproducible comparisons for researchers & practitioners.
-
Updated
Feb 18, 2026 - Python
Curated list of model-agnostic LLM serving runtimes, routers, evaluators, and standards. Run LLMs without locking into one vendor.
-
Updated
May 23, 2026
LLM Inference performance harness
-
Updated
Dec 29, 2025 - Python
AWS deployment stack for Gemma 3 on SageMaker with HuggingFace TGI, OpenAI-compatible API (Lambda + API Gateway), and OpenWebUI chat interface
-
Updated
Mar 25, 2026 - Python
Throughput + latency benchmark for OpenAI-compatible LLM endpoints (vLLM, TGI, llama.cpp, Ollama). TTFT, TPOT, throughput, percentiles. Model-agnostic.
-
Updated
May 23, 2026 - Python
Pyxis architecture - public design notes, model-agnostic LLM serving infrastructure, and the operating-model argument behind the platform.
-
Updated
May 23, 2026
Bridge GitHub Copilot Chat with local vLLM/TGI servers and HuggingFace cloud models. Enterprise-ready VS Code extension for air-gapped AI coding.
-
Updated
Sep 29, 2025 - TypeScript
Self-hosted FastAPI gateway exposing OpenAI and Anthropic Messages APIs in front of any open-source LLM runtime (vLLM, Ollama, llama.cpp, TGI, SGLang, LocalAI, LM Studio). Streaming, embeddings, metrics, auth, rate limiting.
-
Updated
Apr 22, 2026 - Python
Lightweight HTML form with Python Flask app and accompanying scripts for swift testing of interactions with SEA-LION family of LLMs.
-
Updated
Aug 2, 2024 - Python
Improve this page
Add a description, image, and links to the tgi topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tgi topic, visit your repo's landing page and select "manage topics."