Skip to content

Commit 4c6c47e

Browse files
committed
Add references to lightning-thunder
1 parent b5e6175 commit 4c6c47e

File tree

2 files changed

+6
-0
lines changed

2 files changed

+6
-0
lines changed

benchmarks/python/benchmark_inference.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,9 @@
77
- Latency (ms/token)
88
- Time to First Token (TTFT)
99
- Time Between Output Tokens (TBOT)
10+
11+
Pulled from the lightning-thunder repo. Reference:
12+
https://github.com/Lightning-AI/lightning-thunder/blob/4d3a3c3a7481efdc6a23cdeea99c3ffd31af5e78/thunder/benchmarks/benchmark_inference.py
1013
"""
1114

1215
# fmt: off

benchmarks/python/layers_for_inference_benchmark.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,9 @@
1111
# SPDX-License-Identifier: BSD-3-Clause
1212
#
1313
# NOTE: `pytorch_nvfp4_quantize` and `linear_to_swizzled_128_4` are copied from NVIDIA's Fuser's test code.
14+
#
15+
# Pulled from the lightning-thunder repo. Reference:
16+
# https://github.com/Lightning-AI/lightning-thunder/blob/4d3a3c3a7481efdc6a23cdeea99c3ffd31af5e78/thunder/benchmarks/layers_for_inference_benchmark.py
1417

1518
# fmt: off
1619

0 commit comments

Comments
 (0)