test_gpu_scripts/reports_single_gpu_aikubeworker0012.md

1.8 KiB

GPU Test Report

  • Date: 2026-05-22 15:27:51
  • Host: aikubeworker0012
  • GPU: NVIDIA H100 80GB HBM3 x8
  • Driver: 580.159.03 | CUDA: 13.0

Summary

Test Result
GPU Info PASS (8 GPUs detected)
Memory Bandwidth WARN (829 GB/s via PyTorch fallback)
Compute Throughput FAIL (worst TF32 362 vs >= 444)

GPU Information

GPU Model VRAM Temp Power SM Clock
0 NVIDIA H100 80GB HBM3 81559 MB 25C 70/700W 345 MHz
1 NVIDIA H100 80GB HBM3 81559 MB 25C 73/700W 345 MHz
2 NVIDIA H100 80GB HBM3 81559 MB 26C 69/700W 345 MHz
3 NVIDIA H100 80GB HBM3 81559 MB 25C 70/700W 345 MHz
4 NVIDIA H100 80GB HBM3 81559 MB 24C 69/700W 345 MHz
5 NVIDIA H100 80GB HBM3 81559 MB 27C 70/700W 345 MHz
6 NVIDIA H100 80GB HBM3 81559 MB 25C 70/700W 345 MHz
7 NVIDIA H100 80GB HBM3 81559 MB 24C 72/700W 345 MHz

Memory Bandwidth

Source: pytorch

Metric Value Peak Efficiency
H2D (PCIe) 11.8 GB/s 0 GB/s 0.0%
D2H (PCIe) 9.9 GB/s 0 GB/s 0.0%
D2D (NVLink) 829.1 GB/s 3400 GB/s 24.4%

Verdict: WARN (D2D 829.1 GB/s via PyTorch fallback; nvbandwidth unavailable — figure is indicative only, not a true HBM peak)

Compute Throughput

DType Achieved (TFLOPS) Peak Threshold Status
FP32 52.0 67 >= 54 WARN
TF32 362.3 495 >= 444 FAIL
FP16 691.0 990 >= 734 WARN
BF16 713.0 990 >= 745 WARN
FP8 1148.8 1979 >= 1400 FAIL

Verdict: FAIL (absolute TFLOPS thresholds; worst efficiency 58.0%)


Generated by GPU Test Suite v0.2.0