[Good First Issue][NNCF]: Use stateful model in LLM compression examples

### Context

A “stateful model” is a model that implicitly preserves data between two consecutive inference calls such as KV cache for LLMs ([more details](https://docs.openvino.ai/nightly/openvino-workflow/running-inference/inference-request/stateful-models.html#)). Using a stateful model in the inference allows us to minimize the overhead of processing the KV cache, and due to this and additional optimizations, significantly speed up the inference of the model. OpenVINO currently export LLM from PyTorch to OpenVINO IR as stateful model by default. Thus, NNCF should also demonstrate the default flow in its examples.

### What needs to be done?

Update the following LLM compression examples to use stateful model:
- [Large Language Models FP8 Compression Example](https://github.com/openvinotoolkit/nncf/tree/develop/examples/llm_compression/openvino/smollm2_360m_fp8)
- [Find the appropriate hyperparameters to compress the TinyLLama model](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama_find_hyperparams/README.md)
- [Compress TinyLLama model using synthetic data](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama_synthetic_data/README.md)

### Example Pull Requests

https://github.com/openvinotoolkit/nncf/pull/3490

### Resources

- [Contribution guide - start here!](https://github.com/openvinotoolkit/nncf/blob/develop/CONTRIBUTING.md)
- [Intel DevHub Discord channel](https://discord.gg/7pVRxUwdWG) - engage in discussions, ask questions and talk to OpenVINO developers
- [How to link your Pull Request to an issue](https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#manually-linking-a-pull-request-to-an-issue-using-the-pull-request-sidebar)


### Contact points

@ljaljushkin 
@alexsu52 

### Ticket

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Good First Issue][NNCF]: Use stateful model in LLM compression examples #3491

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Good First Issue][NNCF]: Use stateful model in LLM compression examples #3491

Description

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions