1.8 KiB
1.8 KiB
GPU Test Report
- Date: 2026-05-22 15:27:51
- Host: aikubeworker0012
- GPU: NVIDIA H100 80GB HBM3 x8
- Driver: 580.159.03 | CUDA: 13.0
Summary
| Test | Result |
|---|---|
| GPU Info | PASS (8 GPUs detected) |
| Memory Bandwidth | WARN (829 GB/s via PyTorch fallback) |
| Compute Throughput | FAIL (worst TF32 362 vs >= 444) |
GPU Information
| GPU | Model | VRAM | Temp | Power | SM Clock |
|---|---|---|---|---|---|
| 0 | NVIDIA H100 80GB HBM3 | 81559 MB | 25C | 70/700W | 345 MHz |
| 1 | NVIDIA H100 80GB HBM3 | 81559 MB | 25C | 73/700W | 345 MHz |
| 2 | NVIDIA H100 80GB HBM3 | 81559 MB | 26C | 69/700W | 345 MHz |
| 3 | NVIDIA H100 80GB HBM3 | 81559 MB | 25C | 70/700W | 345 MHz |
| 4 | NVIDIA H100 80GB HBM3 | 81559 MB | 24C | 69/700W | 345 MHz |
| 5 | NVIDIA H100 80GB HBM3 | 81559 MB | 27C | 70/700W | 345 MHz |
| 6 | NVIDIA H100 80GB HBM3 | 81559 MB | 25C | 70/700W | 345 MHz |
| 7 | NVIDIA H100 80GB HBM3 | 81559 MB | 24C | 72/700W | 345 MHz |
Memory Bandwidth
Source: pytorch
| Metric | Value | Peak | Efficiency |
|---|---|---|---|
| H2D (PCIe) | 11.8 GB/s | 0 GB/s | 0.0% |
| D2H (PCIe) | 9.9 GB/s | 0 GB/s | 0.0% |
| D2D (NVLink) | 829.1 GB/s | 3400 GB/s | 24.4% |
Verdict: WARN (D2D 829.1 GB/s via PyTorch fallback; nvbandwidth unavailable — figure is indicative only, not a true HBM peak)
Compute Throughput
| DType | Achieved (TFLOPS) | Peak | Threshold | Status |
|---|---|---|---|---|
| FP32 | 52.0 | 67 | >= 54 | WARN |
| TF32 | 362.3 | 495 | >= 444 | FAIL |
| FP16 | 691.0 | 990 | >= 734 | WARN |
| BF16 | 713.0 | 990 | >= 745 | WARN |
| FP8 | 1148.8 | 1979 | >= 1400 | FAIL |
Verdict: FAIL (absolute TFLOPS thresholds; worst efficiency 58.0%)
Generated by GPU Test Suite v0.2.0