Hi @sogartar
Could you teach me how to generate this fp8 mlir which only matmul is fp8 and other ops keep in f32?
https://sharkpublic.blob.core.windows.net/sharkpublic/dan/fp8_prefill.mlir
I can not find related doc describes where is this mlir be generated. Maybe you can point me to the doc if any : )
Thanks
Hong-Rong