Open
Description
Since the DeepSeek-V3 Hugging Face Transformers modeling file does not include MTP implementations, the llm-compressor cannot recognize and load MTP weights. After compression, the MTP layer will not be preserved. Could we provide an API to track all unused tensors and save them to a new safetensors file?