Add HQQ to weight compression algorithms for LLMs

### 🚀 Feature request

[HQQ](https://github.com/mobiusml/hqq) is a popular data-free weight quantization algorithm for LLMs. It would be super cool to add it NNCF's weight compression algorithms. I would like to work on this myself. I understand I need to create my `hqq.py` file inside `nncf/quantization/algorithms/weight_compression` dir & I'm currently diving into the implementations of `awq` and `gptq`. Currently, I'm having trouble understanding the `NNCFGraph` object which needs to be passed to the `apply` method. Are there some docs on how to understand this Graph object? It would also be super helpful if you guys can point me to some code/docs that I can look into to understand the workflow better. Looking forward to contributing 🚀 

### Feature Use Case

HQQ is a fast and accurate model quantizer that skips the need for calibration data. It offers compression quality competitive with that of calibration-based methods. For instance, HQQ takes less than 5 minutes to process the colossal Llama-2-70B, that’s over 50x faster compared to the widely adopted GPTQ

### Are you going to submit a PR?

- [x] Yes I'd like to help by submitting a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add HQQ to weight compression algorithms for LLMs #3347

🚀 Feature request

Feature Use Case

Are you going to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add HQQ to weight compression algorithms for LLMs #3347

Description

🚀 Feature request

Feature Use Case

Are you going to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions