gpu-benchmarking

Here are 9 public repositories matching this topic...

Cre4T3Tiv3 / jetson-orin-matmul-analysis

CUDA matrix multiplication benchmarking on Jetson Orin Nano. Four implementations, three power modes, five matrix sizes. 99.5% mathematical validation. C++/CUDA and Python.

Updated Apr 2, 2026
Python

lokeshpuma / Deep_Learning

Star

Hands-on Jupyter notebooks for deep learning with TensorFlow, covering fundamental concepts, model training, and applied tabular projects.

machine-learning deep-learning tensorflow jupyter-notebook neural-networks tensorboard gradient-descent gpu-benchmarking

Updated May 29, 2026
Jupyter Notebook

kevinbazira / llm-rocm-benchmarks

Star

Standalone LLM inference benchmarking pipelines on AMD GPUs using ROCm, vLLM, MAD, and data visualization scripts.

performance-engineering machine-learning rocm model-serving amd-gpu mlops inference-optimization llm vllm llm-inference llm-benchmarking gpu-benchmarking

Updated Feb 21, 2026
Python

saminkhan1 / llm-serving-benchmark-lab

Star

Artifact-backed LLM serving performance lab for vLLM baselines, official metrics, GuideLLM checks, and SGLang/PD scaffolding

python performance-engineering modal prometheus artifact-evaluation llm llm-serving vllm llm-inference sglang llm-performance gpu-benchmarking guidellm inference-benchmarking serving-metrics

Updated May 21, 2026
Python

tdiprima / run_system_checks

Star

One-shot script to audit GPU, CUDA, PyTorch, CPU, and disk performance before debugging a slow or broken ML environment.

machine-learning cuda pytorch system-diagnostics gpu-benchmarking

Updated Apr 3, 2026
Shell

FluidNumerics / gpu-microbenchmarks

Star

gpu benchmarks gpu-benchmarking

Updated Jan 19, 2022
C++

Tennisee-data / benchHUB

Star

benchHUB is a Python-based project to parse, aggregate, and visualize system and performance benchmarks. It includes a Streamlit dashboard to display and compare results.