Open
Description
I tried your example code in gemma3.
But I encounted
NVMLError_NotSupported Traceback (most recent call last)
Cell In[1], line 42
29 recipe = [
30 GPTQModifier(
31 targets="Linear",
(...)
38 ),
39 ]
41 # Perform oneshot
---> 42 oneshot(
43 model=model,
44 tokenizer=model_id,
45 dataset=DATASET_ID,
46 splits=DATASET_SPLIT,
47 recipe=recipe,
48 max_seq_length=MAX_SEQUENCE_LENGTH,
49 num_calibration_samples=NUM_CALIBRATION_SAMPLES,
50 trust_remote_code_model=True,
51 data_collator=data_collator,
52 )
54 # Confirm generations of the quantized model look sane.
55 print("========== SAMPLE GENERATION ==============")
I used AGX Orin 64GB.
llmcompressor 0.6.0