misc: update submodule tools

yanjun.qiu · yanjun.qiu · commit 49e641a74fa9 · 2025-04-01T15:59:12.000+08:00
diff --git a/.dev/update_submodules.sh b/.dev/update_submodules.sh
@@ -4,5 +4,5 @@ git submodule init
 git submodule update --remote # update all submodule
 # git submodule update --remote ffpa-attn-mma # only update ffpa-attn-mma
 git add .
-git commit -m "Automated submodule update"
+git commit -m "misc: Automated submodule update"
 set +x
diff --git a/.github/.gitignore b/.github/.gitignore
@@ -22,4 +22,5 @@ bin
 *.log
 *.txt
 *.tex
-tmp*
+tmp*
+pdfs
diff --git a/kernels/hgemm/README.md b/kernels/hgemm/README.md
@@ -3,7 +3,7 @@
 
 ![toy-hgemm-library](https://github.com/user-attachments/assets/962bda14-b494-4423-b8eb-775da9f5503d)
 
-[📖Toy-HGEMM Library⚡️⚡️](./kernels/hgemm) is a library that write many HGEMM kernels from scratch using Tensor Cores with WMMA, MMA PTX and CuTe API, thus, can achieve `98%~100%` performance of **cuBLAS**. The codes here are source from 📖[CUDA-Learn-Notes](https://github.com/DefTruth/CUDA-Learn-Notes)  ![](https://img.shields.io/github/stars/DefTruth/CUDA-Learn-Notes.svg?style=social) and exported as a standalone library, please checkout [CUDA-Learn-Notes](https://github.com/DefTruth/CUDA-Learn-Notes) for latest updates. Welcome to 🌟👆🏻star this repo to support me, many thanks ~ 🎉🎉
+[📖Toy-HGEMM Library⚡️⚡️](./kernels/hgemm) is a library that write many HGEMM kernels from scratch using Tensor Cores with WMMA, MMA PTX and CuTe API, thus, can achieve `98%~100%` performance of **cuBLAS**. The codes here are source from 📖[CUDA-Learn-Notes](https://github.com/xlite-dev/CUDA-Learn-Notes)  ![](https://img.shields.io/github/stars/xlite-dev/CUDA-Learn-Notes.svg?style=social) and exported as a standalone library, please checkout [CUDA-Learn-Notes](https://github.com/xlite-dev/CUDA-Learn-Notes) for latest updates. Welcome to 🌟👆🏻star this repo to support me, many thanks ~ 🎉🎉
 
 <div id="hgemm-sgemm"></div>  
 
@@ -27,11 +27,11 @@ Currently, on NVIDIA L20, RTX 4090 and RTX 3080 Laptop, compared with cuBLAS's d
 ## ©️Citations🎉🎉
 
 ```BibTeX
-@misc{hgemm-mma@2024,
-  title={hgemm-mma: Write HGEMM from scratch using Tensor Cores with WMMA, MMA PTX and CuTe API.},
-  url={https://github.com/DefTruth/hgemm-mma},
-  note={Open-source software available at https://github.com/DefTruth/hgemm-mma},
-  author={DefTruth etc},
+@misc{hgemm-tensorcores-mma@2024,
+  title={hgemm-tensorcores-mma: Write HGEMM from scratch using Tensor Cores with WMMA, MMA PTX and CuTe API.},
+  url={https://github.com/xlite-dev/hgemm-tensorcores-mma},
+  note={Open-source software available at https://github.com/xlite-dev/hgemm-tensorcores-mma},
+  author={xlite-dev etc},
   year={2024}
 }
 ```

-Original file line number
+Diff line change
 *.log
 *.txt
 *.tex
 -tmp*
 +tmp*
 +pdfs