Skip to content

Getting error while finetuning: ZeroDivisionError: division by zero #338

@indranildeveloper

Description

@indranildeveloper

Need Help

The command I am running:

accelerate launch --mixed_precision=fp16 --num_processes=1 train_finetune_accelerate.py --config_path ./Configs/config_ft.yml

Here is the full Error I am getting:

ipex flag is deprecated, will be removed in Accelerate v1.10. From 2.7.0, PyTorch has all needed optimizations for Intel CPU and XPU.
The following values were not passed to `accelerate launch` and had defaults used instead:
        `--num_machines` was set to a value of `1`
        `--dynamo_backend` was set to a value of `'no'`
To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
bert loaded
bert_encoder loaded
predictor loaded
decoder loaded
text_encoder loaded
predictor_encoder loaded
style_encoder loaded
diffusion loaded
text_aligner loaded
pitch_extractor loaded
mpd loaded
msd loaded
wd loaded
BERT AdamW (
Parameter Group 0
    amsgrad: False
    base_momentum: 0.85
    betas: (0.9, 0.99)
    capturable: False
    decoupled_weight_decay: True
    differentiable: False
    eps: 1e-09
    foreach: None
    fused: None
    initial_lr: 1e-05
    lr: 1e-05
    max_lr: 2e-05
    max_momentum: 0.95
    maximize: False
    min_lr: 0
    weight_decay: 0.01
)
decoder AdamW (
Parameter Group 0
    amsgrad: False
    base_momentum: 0.85
    betas: (0.0, 0.99)
    capturable: False
    decoupled_weight_decay: True
    differentiable: False
    eps: 1e-09
    foreach: None
    fused: None
    initial_lr: 0.0001
    lr: 0.0001
    max_lr: 0.0002
    max_momentum: 0.95
    maximize: False
    min_lr: 0
    weight_decay: 0.0001
)
output_84.wav 44100
output_133.wav 44100
output_31.wav 44100
output_140.wav 44100
output_140.wav 44100
output_139.wav 44100
output_133.wav 44100
output_118.wav 44100
output_126.wav 44100
output_139.wav 44100
output_139.wav 44100
output_152.wav 44100
output_140.wav 44100
output_133.wav 44100
output_95.wav 44100
output_118.wav 44100
output_168.wav 44100
output_168.wav 44100
Epochs: 1
Traceback (most recent call last):
  File "/home/indranil/projects/phlama/StyleTTS2/train_finetune_accelerate.py", line 843, in <module>
    main()
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/click/core.py", line 1442, in __call__
    return self.main(*args, **kwargs)
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/click/core.py", line 1363, in main
    rv = self.invoke(ctx)
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/click/core.py", line 1226, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/click/core.py", line 794, in invoke
    return callback(*args, **kwargs)
  File "/home/indranil/projects/phlama/StyleTTS2/train_finetune_accelerate.py", line 810, in main
    % (loss_test / iters_test, loss_align / iters_test, loss_f / iters_test)
ZeroDivisionError: division by zero
Traceback (most recent call last):
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/bin/accelerate", line 8, in <module>
    sys.exit(main())
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/accelerate/commands/accelerate_cli.py", line 50, in main
    args.func(args)
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/accelerate/commands/launch.py", line 1199, in launch_command
    simple_launcher(args)
  File "/home/indranil/projects/phlama/StyleTTS2/.venv/lib/python3.10/site-packages/accelerate/commands/launch.py", line 785, in simple_launcher
    raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
subprocess.CalledProcessError: Command '['/home/indranil/projects/phlama/StyleTTS2/.venv/bin/python3.10', 'train_finetune_accelerate.py', '--config_path', './Configs/config_ft.yml']' returned non-zero exit status 1.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions