test_gpu_scripts/reports_single_gpu_aikubeworker0016.md

1.8 KiB

GPU Test Report

  • Date: 2026-05-22 15:27:53
  • Host: aikubeworker0016
  • GPU: NVIDIA H100 80GB HBM3 x8
  • Driver: 580.159.03 | CUDA: 13.0

Summary

Test Result
GPU Info PASS (8 GPUs detected)
Memory Bandwidth WARN (829 GB/s via PyTorch fallback)
Compute Throughput FAIL (worst TF32 358 vs >= 444)

GPU Information

GPU Model VRAM Temp Power SM Clock
0 NVIDIA H100 80GB HBM3 81559 MB 20C 70/700W 345 MHz
1 NVIDIA H100 80GB HBM3 81559 MB 20C 67/700W 345 MHz
2 NVIDIA H100 80GB HBM3 81559 MB 21C 67/700W 345 MHz
3 NVIDIA H100 80GB HBM3 81559 MB 20C 67/700W 345 MHz
4 NVIDIA H100 80GB HBM3 81559 MB 20C 67/700W 345 MHz
5 NVIDIA H100 80GB HBM3 81559 MB 22C 69/700W 345 MHz
6 NVIDIA H100 80GB HBM3 81559 MB 20C 68/700W 345 MHz
7 NVIDIA H100 80GB HBM3 81559 MB 20C 66/700W 345 MHz

Memory Bandwidth

Source: pytorch

Metric Value Peak Efficiency
H2D (PCIe) 11.8 GB/s 0 GB/s 0.0%
D2H (PCIe) 10.1 GB/s 0 GB/s 0.0%
D2D (NVLink) 829.0 GB/s 3400 GB/s 24.4%

Verdict: WARN (D2D 829.0 GB/s via PyTorch fallback; nvbandwidth unavailable — figure is indicative only, not a true HBM peak)

Compute Throughput

DType Achieved (TFLOPS) Peak Threshold Status
FP32 51.9 67 >= 54 WARN
TF32 357.8 495 >= 444 FAIL
FP16 667.2 990 >= 734 WARN
BF16 699.1 990 >= 745 WARN
FP8 1146.2 1979 >= 1400 FAIL

Verdict: FAIL (absolute TFLOPS thresholds; worst efficiency 57.9%)


Generated by GPU Test Suite v0.2.0