Skip to content

Commit 2f21f02

Browse files
committed
Fix typo
1 parent 3678702 commit 2f21f02

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_posts/2025-02-24-ptpc-fp8-rocm.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ The illustration shows two quantization approaches:
6464

6565
**Scaling Factors:**
6666
- **Top (Per-Tensor)**: Single scalars ΔX[1] and ΔW[1] for entire tensors
67-
- **Bottom (PTPC)**: Vector ΔX[T×1] with one scale per token and ΔW[1×Co] with one scale per output channel
67+
- **Bottom (PTPC)**: Vector ΔX[T×1] with one scale per token and ΔW[1×Co] with one scale per input channel
6868

6969
This granular scaling approach allows PTPC-FP8 to achieve accuracy close to BF16 while maintaining the speed and memory benefits of 8-bit computation.
7070

0 commit comments

Comments
 (0)