Open
Description
I have 2 separate questions which I could not find an answer yet, so post it here hope someone can answer:
-
When doing TRT conversion from torchscript to trt. Would nav call
polygraphy surgeon sanitize
to do things like constant folding? This is helpful when dealing with larger size models. It seems nav underlying uses polygraphy but want to check if it also sanitizes. -
There's an alpha release for TRT-LLM tool which combines TensorRT and FasterTransformer. Is this tool on your roadmap to support it? As a user for nav, I like the simpler interface it provides compared to do compilation/conversion in multiple steps. It would be great to see future support related to LLM.