Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any optimizations that can be done 30+GB VRAM during training #3

Open
gt732 opened this issue Oct 2, 2024 · 7 comments
Open

Any optimizations that can be done 30+GB VRAM during training #3

gt732 opened this issue Oct 2, 2024 · 7 comments

Comments

@gt732
Copy link

gt732 commented Oct 2, 2024

Hi,

Is there any optimizations or settings I can change to get this running on my 3090 24GB? I'm using the photos included in the demo of Yuilang to test the code.

image

Thanks!

@gt732 gt732 changed the title Any optimizations that can be done - 30+GB VRAM during training Any optimizations that can be done 30+GB VRAM during training Oct 2, 2024
@YuliangXiu
Copy link
Owner

YuliangXiu commented Oct 2, 2024

Hi @gt732

I updated the last reply #2 (comment), it could even get reduced to 8GB GPU memory.

@gt732
Copy link
Author

gt732 commented Oct 2, 2024

@YuliangXiu thanks for the update I’ll try following the documentation and see if I can reduce the VRAM usage during training.

@gt732
Copy link
Author

gt732 commented Oct 3, 2024

@YuliangXiu I was able to get the initial model trained using the following arguments. The VRAM usage spiked between 8-16GB.

accelerate launch multi_concepts/train.py \
  --pretrained_model_name_or_path $BASE_MODEL \
  --project_name ${SUBJECT_NAME} \
  --instance_data_dir ${INPUT_DIR}  \
  --output_dir ${EXP_DIR} \
  --class_data_dir data/multi_concepts_data \
  --train_batch_size 1  \
  --phase1_train_steps 1000 \
  --phase2_train_steps 4000 \
  --lr_step_rules "1:2000,0.1" \
  --initial_learning_rate 5e-4 \
  --learning_rate 2e-6 \
  --prior_loss_weight 1.0 \
  --syn_loss_weight "2.0,2.0" \
  --mask_loss_weight 1.0 \
  --lambda_attention 1e-2 \
  --img_log_steps 1000 \
  --checkpointing_steps 1000 \
  --use_view_prompt \
  --log_checkpoints \
  --boft_block_num=8 \
  --boft_block_size=0 \
  --boft_n_butterfly_factor=1 \
  --lora_r=32 \
  --enable_xformers_memory_efficient_attention \
  --use_peft ${peft_type} \
  --wandb_mode "offline" \
  --use_view_prompt \
  --do_not_apply_masked_prior \
  --mixed_precision fp16 \
  --gradient_checkpointing \
  --use_8bit_adam \
  --set_grads_to_none \

Now the last challenge is getting this step to run

python cores/main_mc.py \
 --config configs/tech_mc_geometry.yaml \
 --exp_dir ${EXP_DIR} \
 --sub_name ${SUBJECT_NAME} \
 --use_peft ${peft_type} \
 --use_shape_description \

I'm running into compiling issues when running the trainer. I tried a ton of different methods to try and fix it but there's something wrong with the gcc libraries in my conda env. This is being tested on Windows 10 WSL Ubuntu 24.04.1 LTS

ERROR

Memory usage statistics:

  Maximum number of tetrahedra:  5333413
  Maximum number of tet blocks (blocksize = 8188):  652
  Approximate memory for tetrahedral mesh (bytes):  752,983,904
  Approximate memory for extra pointers (bytes):  12,066,080
  Approximate memory for algorithms (bytes):  134,400
  Approximate memory for working arrays (bytes):  210,901,848
  Approximate total used memory (bytes):  976,086,232

shape of vertices: (834173, 3), shape of grids: (4986585, 4)
MESA: error: ZINK: failed to choose pdev
glx: failed to create drisw screen
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1000/1000 [03:15<00:00,  5.11it/s]
fitted mesh with num_vertex 481862, num_faces 890574
[INFO] loading stable diffusion...
[INFO] using hugging face custom model key: results/human/yuliang
Loading pipeline components...: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:00<00:00, 16.43it/s]
Added 7 tokens
[INFO] loaded PEFT adapters!
[INFO] loaded stable diffusion!
get rgb text prompt
get normal text prompt
[INFO] Trainer: df | 2024-10-03_10-02-14 | cuda | fp32 | results/human/yuliang/geometry
[INFO] #parameters: 11480403
[INFO] Loading latest checkpoint ...
[WARN] No checkpoint found, model randomly initialized.
++> Evaluate results/human/yuliang/geometry at epoch 0 ...
  0% 0/10 [00:00<?, ?it/s]/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/variables/functions.py:663: UserWarning: Graph break due to unsupported builtin _gridencoder.PyCapsule.grid_encode_forward. This function is either a Python builtin (e.g. _warnings.warn) or a third-party C/C++ Python extension (perhaps created with pybind). If it is a Python builtin, please file an issue on GitHub so the PyTorch team can add support for it and see the next case for a workaround. If it is a third-party C/C++ Python extension, please either wrap it into a PyTorch-understood custom operator (see https://pytorch.org/tutorials/advanced/custom_ops_landing_page.html for more details) or, if it is traceable, use torch.compiler.allow_in_graph.
  torch._dynamo.utils.warn_once(msg)
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
Traceback (most recent call last):
  File "/home/scheme/PuzzleAvatar/cores/main_mc.py", line 379, in <module>
    trainer.train(train_loader, valid_loader, max_epoch)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 723, in train
    self.evaluate_one_epoch(valid_loader)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 1032, in evaluate_one_epoch
    preds, preds_depth, preds_normal, preds_alpha, loss = self.eval_step(data)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 594, in eval_step
    outputs = self.model(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 433, in _fn
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 1116, in __call__
    return self._torchdynamo_orig_callable(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 948, in __call__
    result = self._inner_convert(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 472, in __call__
    return _compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_utils_internal.py", line 84, in wrapper_function
    return StrobelightCompileTimeProfiler.profile_compile_time(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_strobelight/compile_time_profiler.py", line 129, in profile_compile_time
    return func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 817, in _compile
    guarded_code = compile_inner(code, one_graph, hooks, transform)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 636, in compile_inner
    out_code = transform_code_object(code, transform)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/bytecode_transformation.py", line 1185, in transform_code_object
    transformations(instructions, code_options)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 178, in _fn
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 582, in transform
    tracer.run()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 2451, in run
    super().run()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 893, in run
    while self.step():
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 805, in step
    self.dispatch_table[inst.opcode](self, inst)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 497, in wrapper
    return handle_graph_break(self, inst, speculation.reason)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 566, in handle_graph_break
    self.output.compile_subgraph(self, reason=reason)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1123, in compile_subgraph
    self.compile_and_call_fx_graph(tx, pass2.graph_output_vars(), root)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1318, in compile_and_call_fx_graph
    compiled_fn = self.call_user_compiler(gm)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1409, in call_user_compiler
    raise BackendCompilerFailed(self.compiler_fn, e).with_traceback(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1390, in call_user_compiler
    compiled_fn = compiler_fn(gm, self.example_inputs())
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/repro/after_dynamo.py", line 129, in __call__
    compiled_gm = compiler_fn(gm, example_inputs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/__init__.py", line 1951, in __call__
    return compile_fx(model_, inputs_, config_patches=self.config)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1505, in compile_fx
    return aot_autograd(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/backends/common.py", line 69, in __call__
    cg = aot_module_simplified(gm, example_inputs, **self.kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/aot_autograd.py", line 954, in aot_module_simplified
    compiled_fn, _ = create_aot_dispatcher_function(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/aot_autograd.py", line 687, in create_aot_dispatcher_function
    compiled_fn, fw_metadata = compiler_fn(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/jit_compile_runtime_wrappers.py", line 168, in aot_dispatch_base
    compiled_fw = compiler(fw_module, updated_flat_args)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1410, in fw_compiler_base
    return inner_compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 84, in debug_wrapper
    inner_compiled_fn = compiler_fn(gm, example_inputs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/debug.py", line 304, in inner
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 527, in compile_fx_inner
    compiled_graph = fx_codegen_and_compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 831, in fx_codegen_and_compile
    compiled_fn = graph.compile_to_fn()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1749, in compile_to_fn
    return self.compile_to_module().call
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1678, in compile_to_module
    self.codegen_with_cpp_wrapper() if self.cpp_wrapper else self.codegen()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1638, in codegen
    self.scheduler.codegen()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/scheduler.py", line 2741, in codegen
    self.get_backend(device).codegen_node(node)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/cuda_combined_scheduling.py", line 69, in codegen_node
    return self._triton_scheduling.codegen_node(node)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/simd.py", line 1148, in codegen_node
    return self.codegen_node_schedule(node_schedule, buf_accesses, numel, rnumel)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/simd.py", line 1317, in codegen_node_schedule
    src_code = kernel.codegen_kernel()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/triton.py", line 2159, in codegen_kernel
    **self.inductor_meta_common(),
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/triton.py", line 2047, in inductor_meta_common
    "backend_hash": torch.utils._triton.triton_hash_with_backend(),
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/utils/_triton.py", line 63, in triton_hash_with_backend
    backend = triton_backend()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/utils/_triton.py", line 49, in triton_backend
    target = driver.active.get_current_target()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 23, in __getattr__
    self._initialize_obj()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 20, in _initialize_obj
    self._obj = self._init_fn()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 9, in _create_driver
    return actives[0]()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 371, in __init__
    self.utils = CudaUtils()  # TODO: make static
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 80, in __init__
    mod = compile_module_from_src(Path(os.path.join(dirname, "driver.c")).read_text(), "cuda_utils")
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 57, in compile_module_from_src
    so = _build(name, src_path, tmpdir, library_dirs(), include_dir, libraries)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/build.py", line 48, in _build
    ret = subprocess.check_call(cc_cmd)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/subprocess.py", line 369, in check_call
    raise CalledProcessError(retcode, cmd)
torch._dynamo.exc.BackendCompilerFailed: backend='inductor' raised:
CalledProcessError: Command '['/home/scheme/anaconda3/envs/PuzzleAvatar/bin/x86_64-conda-linux-gnu-cc', '/tmp/tmp6m8__m0s/main.c', '-O3', '-shared', '-fPIC', '-o', '/tmp/tmp6m8__m0s/cuda_utils.cpython-310-x86_64-linux-gnu.so', '-lcuda', '-L/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/lib', '-L/usr/lib/wsl/lib', '-L/lib/x86_64-linux-gnu', '-I/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/include', '-I/tmp/tmp6m8__m0s', '-I/home/scheme/anaconda3/envs/PuzzleAvatar/include/python3.10']' returned non-zero exit status 1.

Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information


You can suppress this exception and fall back to eager by setting:
    import torch._dynamo
    torch._dynamo.config.suppress_errors = True

  0% 0/10 [00:03<?, ?it/s]

ENVIRONMENT INFO

System Information

OS Information

Distributor ID: Ubuntu
Description: Ubuntu 24.04.1 LTS
Release: 24.04
Codename: noble
5.15.153.1-microsoft-standard-WSL2

Python Version

Python 3.10.14

Conda Environment Packages

packages in environment at /home/scheme/anaconda3/envs/PuzzleAvatar:

Name Version Build Channel

_libgcc_mutex 0.1 conda_forge conda-forge
_openmp_mutex 4.5 2_gnu conda-forge
_sysroot_linux-64_curr_repodata_hack 3 haa98f57_10
absl-py 2.1.0 pypi_0 pypi
accelerate 0.34.2 pypi_0 pypi
addict 2.4.0 pypi_0 pypi
aiohappyeyeballs 2.4.3 pypi_0 pypi
aiohttp 3.10.8 pypi_0 pypi
aiosignal 1.3.1 pypi_0 pypi
albucore 0.0.17 pypi_0 pypi
albumentations 1.4.17 pypi_0 pypi
altair 5.4.1 pypi_0 pypi
annotated-types 0.7.0 pypi_0 pypi
antlr4-python3-runtime 4.9.3 pypi_0 pypi
anyio 4.6.0 pypi_0 pypi
asttokens 2.4.1 pypi_0 pypi
async-timeout 4.0.3 pypi_0 pypi
attrs 24.2.0 pypi_0 pypi
autoflake 2.3.1 pypi_0 pypi
binutils 2.36.1 hdd6e379_2 conda-forge
binutils_impl_linux-64 2.36.1 h193b22a_2 conda-forge
binutils_linux-64 2.36 hf3e587d_10 conda-forge
bitsandbytes 0.44.1 pypi_0 pypi
blinker 1.8.2 pypi_0 pypi
boto3 1.35.31 pypi_0 pypi
botocore 1.35.31 pypi_0 pypi
bzip2 1.0.8 h5eee18b_6
ca-certificates 2024.9.24 h06a4308_0
cachetools 5.5.0 pypi_0 pypi
cchardet 2.1.7 pypi_0 pypi
certifi 2022.12.7 pypi_0 pypi
cffi 1.17.1 pypi_0 pypi
chardet 5.2.0 pypi_0 pypi
charset-normalizer 2.1.1 pypi_0 pypi
chumpy 0.70 pypi_0 pypi
click 8.1.7 pypi_0 pypi
clip 1.0 pypi_0 pypi
comm 0.2.2 pypi_0 pypi
configargparse 1.7 pypi_0 pypi
contourpy 1.3.0 pypi_0 pypi
cycler 0.12.1 pypi_0 pypi
cython 3.0.11 pypi_0 pypi
dash 2.18.1 pypi_0 pypi
dash-core-components 2.0.0 pypi_0 pypi
dash-html-components 2.0.0 pypi_0 pypi
dash-table 5.0.0 pypi_0 pypi
dataclasses 0.6 pypi_0 pypi
dataclasses-json 0.6.7 pypi_0 pypi
datasets 3.0.1 pypi_0 pypi
dearpygui 1.11.1 pypi_0 pypi
decorator 5.1.1 pypi_0 pypi
deepspeed 0.15.1 pypi_0 pypi
defusedxml 0.7.1 pypi_0 pypi
deprecated 1.2.14 pypi_0 pypi
diffusers 0.30.3 pypi_0 pypi
dill 0.3.8 pypi_0 pypi
distro 1.9.0 pypi_0 pypi
docker-pycreds 0.4.0 pypi_0 pypi
einops 0.8.0 pypi_0 pypi
entrypoints 0.4 pypi_0 pypi
et-xmlfile 1.1.0 pypi_0 pypi
eval-type-backport 0.2.0 pypi_0 pypi
exceptiongroup 1.2.2 pypi_0 pypi
executing 2.1.0 pypi_0 pypi
face-alignment 1.4.1 pypi_0 pypi
fastjsonschema 2.20.0 pypi_0 pypi
filelock 3.13.1 pypi_0 pypi
flash-attn 2.6.3 pypi_0 pypi
flask 3.0.3 pypi_0 pypi
flatbuffers 24.3.25 pypi_0 pypi
fonttools 4.54.1 pypi_0 pypi
freetype-py 2.5.1 pypi_0 pypi
freqencoder 0.0.0 dev_0
frozenlist 1.4.1 pypi_0 pypi
fsspec 2024.2.0 pypi_0 pypi
ftfy 6.2.3 pypi_0 pypi
func-timeout 4.3.5 pypi_0 pypi
future 1.0.0 pypi_0 pypi
fvcore 0.1.5.post20221221 pypi_0 pypi
gcc 11.2.0 h702ea55_10 conda-forge
gcc_impl_linux-64 11.2.0 h82a94d6_16 conda-forge
gcc_linux-64 11.2.0 h39a9532_10 conda-forge
gitdb 4.0.11 pypi_0 pypi
gitpython 3.1.43 pypi_0 pypi
gridencoder 0.0.0 dev_0
groundingdino 0.1.0 dev_0
grpcio 1.66.2 pypi_0 pypi
gxx_impl_linux-64 11.2.0 h82a94d6_16 conda-forge
gxx_linux-64 11.2.0 hacbe6df_10 conda-forge
h11 0.14.0 pypi_0 pypi
hjson 3.1.0 pypi_0 pypi
httpcore 1.0.6 pypi_0 pypi
httpx 0.27.2 pypi_0 pypi
huggingface-hub 0.25.1 pypi_0 pypi
idna 3.4 pypi_0 pypi
imageio 2.35.1 pypi_0 pypi
imageio-ffmpeg 0.5.1 pypi_0 pypi
importlib-metadata 8.5.0 pypi_0 pypi
iopath 0.1.10 pypi_0 pypi
ipycanvas 0.13.3 pypi_0 pypi
ipyevents 2.0.2 pypi_0 pypi
ipython 8.28.0 pypi_0 pypi
ipywidgets 8.1.5 pypi_0 pypi
itsdangerous 2.2.0 pypi_0 pypi
jax 0.4.33 pypi_0 pypi
jaxlib 0.4.33 pypi_0 pypi
jedi 0.19.1 pypi_0 pypi
jinja2 3.1.3 pypi_0 pypi
jiter 0.5.0 pypi_0 pypi
jmespath 1.0.1 pypi_0 pypi
joblib 1.4.2 pypi_0 pypi
jsonschema 4.23.0 pypi_0 pypi
jsonschema-specifications 2023.12.1 pypi_0 pypi
jupyter-client 7.4.9 pypi_0 pypi
jupyter-core 5.7.2 pypi_0 pypi
jupyterlab-widgets 3.0.13 pypi_0 pypi
kaolin 0.16.0 pypi_0 pypi
kernel-headers_linux-64 3.10.0 h57e8cba_10
kiwisolver 1.4.7 pypi_0 pypi
kornia 0.7.3 pypi_0 pypi
kornia-rs 0.1.5 pypi_0 pypi
lazy-loader 0.4 pypi_0 pypi
ld_impl_linux-64 2.36.1 hea4e1c9_2 conda-forge
libffi 3.4.4 h6a678d5_1
libgcc 5.2.0 0 conda-forge
libgcc-devel_linux-64 11.2.0 h0952999_16 conda-forge
libgcc-ng 14.1.0 h77fa898_0 conda-forge
libgomp 14.1.0 h77fa898_0 conda-forge
libsanitizer 11.2.0 he4da1e4_16 conda-forge
libstdcxx-devel_linux-64 11.2.0 h0952999_16 conda-forge
libstdcxx-ng 14.1.0 hc0a3c3a_0 conda-forge
libuuid 1.41.5 h5eee18b_0
lightning-utilities 0.11.7 pypi_0 pypi
llvmlite 0.43.0 pypi_0 pypi
lpips 0.1.4 pypi_0 pypi
markdown 3.7 pypi_0 pypi
markdown-it-py 3.0.0 pypi_0 pypi
markupsafe 2.1.5 pypi_0 pypi
marshmallow 3.22.0 pypi_0 pypi
matplotlib 3.9.2 pypi_0 pypi
matplotlib-inline 0.1.7 pypi_0 pypi
mdurl 0.1.2 pypi_0 pypi
mediapipe 0.10.15 pypi_0 pypi
mesh-to-sdf 0.0.15 pypi_0 pypi
ml-dtypes 0.5.0 pypi_0 pypi
mpmath 1.3.0 pypi_0 pypi
multidict 6.1.0 pypi_0 pypi
multipledispatch 1.0.0 pypi_0 pypi
multiprocess 0.70.16 pypi_0 pypi
mypy-extensions 1.0.0 pypi_0 pypi
narwhals 1.9.0 pypi_0 pypi
nbformat 5.10.4 pypi_0 pypi
ncurses 6.4 h6a678d5_0
nest-asyncio 1.6.0 pypi_0 pypi
networkx 3.2.1 pypi_0 pypi
ninja 1.11.1.1 pypi_0 pypi
numba 0.60.0 pypi_0 pypi
numpy 1.26.3 pypi_0 pypi
nvdiffrast 0.3.3 dev_0
nvidia-cublas-cu12 12.1.3.1 pypi_0 pypi
nvidia-cuda-cupti-cu12 12.1.105 pypi_0 pypi
nvidia-cuda-nvrtc-cu12 12.1.105 pypi_0 pypi
nvidia-cuda-runtime-cu12 12.1.105 pypi_0 pypi
nvidia-cudnn-cu12 9.1.0.70 pypi_0 pypi
nvidia-cufft-cu12 11.0.2.54 pypi_0 pypi
nvidia-curand-cu12 10.3.2.106 pypi_0 pypi
nvidia-cusolver-cu12 11.4.5.107 pypi_0 pypi
nvidia-cusparse-cu12 12.1.0.106 pypi_0 pypi
nvidia-ml-py 12.560.30 pypi_0 pypi
nvidia-nccl-cu12 2.20.5 pypi_0 pypi
nvidia-nvjitlink-cu12 12.6.77 pypi_0 pypi
nvidia-nvtx-cu12 12.1.105 pypi_0 pypi
omegaconf 2.3.0 pypi_0 pypi
open3d 0.18.0 pypi_0 pypi
openai 1.51.0 pypi_0 pypi
opencv-contrib-python 4.10.0.84 pypi_0 pypi
opencv-python 4.10.0.84 pypi_0 pypi
opencv-python-headless 4.10.0.84 pypi_0 pypi
openpyxl 3.1.5 pypi_0 pypi
openssl 3.3.1 hb9d3cd8_3 conda-forge
opt-einsum 3.4.0 pypi_0 pypi
packaging 24.1 pypi_0 pypi
pandas 2.2.3 pypi_0 pypi
parso 0.8.4 pypi_0 pypi
peft 0.13.1.dev0 pypi_0 pypi
pexpect 4.9.0 pypi_0 pypi
pillow 10.2.0 pypi_0 pypi
pip 24.2 py310h06a4308_0
platformdirs 4.3.6 pypi_0 pypi
plotly 5.24.1 pypi_0 pypi
pooch 1.8.2 pypi_0 pypi
portalocker 2.10.1 pypi_0 pypi
prompt-toolkit 3.0.48 pypi_0 pypi
protobuf 4.25.5 pypi_0 pypi
psutil 6.0.0 pypi_0 pypi
ptyprocess 0.7.0 pypi_0 pypi
pudb 2024.1.2 pypi_0 pypi
pure-eval 0.2.3 pypi_0 pypi
py-cpuinfo 9.0.0 pypi_0 pypi
pyarrow 17.0.0 pypi_0 pypi
pybind11 2.13.6 pypi_0 pypi
pycocotools 2.0.8 pypi_0 pypi
pycparser 2.22 pypi_0 pypi
pydantic 2.9.2 pypi_0 pypi
pydantic-core 2.23.4 pypi_0 pypi
pydeck 0.9.1 pypi_0 pypi
pyflakes 3.2.0 pypi_0 pypi
pyfqmr 0.2.1 pypi_0 pypi
pyglet 2.0.17 pypi_0 pypi
pygltflib 1.16.2 pypi_0 pypi
pygments 2.18.0 pypi_0 pypi
pymcubes 0.1.6 pypi_0 pypi
pymeshfix 0.17.0 pypi_0 pypi
pymeshlab 2023.12.post2 pypi_0 pypi
pyopengl 3.1.0 pypi_0 pypi
pyparsing 3.1.4 pypi_0 pypi
pyquaternion 0.9.9 pypi_0 pypi
pyrender 0.1.45 pypi_0 pypi
pyrr 0.10.3 pypi_0 pypi
python 3.10.14 h955ad1f_1
python-dateutil 2.9.0.post0 pypi_0 pypi
pytorch-lightning 2.1.0 pypi_0 pypi
pytorch3d 0.7.8 pypi_0 pypi
pytz 2024.2 pypi_0 pypi
pyvista 0.44.1 pypi_0 pypi
pyyaml 6.0.2 pypi_0 pypi
pyzmq 26.2.0 pypi_0 pypi
readline 8.2 h5eee18b_0
rectangle-packer 2.0.2 pypi_0 pypi
referencing 0.35.1 pypi_0 pypi
regex 2024.9.11 pypi_0 pypi
replicate 0.34.1 pypi_0 pypi
requests 2.32.3 pypi_0 pypi
retrying 1.3.4 pypi_0 pypi
rich 13.9.1 pypi_0 pypi
rpds-py 0.20.0 pypi_0 pypi
rtree 1.3.0 pypi_0 pypi
s3transfer 0.10.2 pypi_0 pypi
safetensors 0.4.5 pypi_0 pypi
scikit-image 0.24.0 pypi_0 pypi
scikit-learn 1.5.2 pypi_0 pypi
scipy 1.14.1 pypi_0 pypi
scooby 0.10.0 pypi_0 pypi
seaborn 0.13.2 pypi_0 pypi
segment-anything 1.0 pypi_0 pypi
sentencepiece 0.2.0 pypi_0 pypi
sentry-sdk 2.15.0 pypi_0 pypi
setproctitle 1.3.3 pypi_0 pypi
setuptools 75.1.0 py310h06a4308_0
six 1.16.0 pypi_0 pypi
smmap 5.0.1 pypi_0 pypi
sniffio 1.3.1 pypi_0 pypi
sounddevice 0.5.0 pypi_0 pypi
sqlite 3.45.3 h5eee18b_0
stack-data 0.6.3 pypi_0 pypi
streamlit 1.39.0 pypi_0 pypi
supervision 0.4.0 pypi_0 pypi
sympy 1.12 pypi_0 pypi
sysroot_linux-64 2.17 h57e8cba_10
tabulate 0.9.0 pypi_0 pypi
taming-transformers 0.0.1 pypi_0 pypi
tenacity 9.0.0 pypi_0 pypi
tensorboard 2.18.0 pypi_0 pypi
tensorboard-data-server 0.7.2 pypi_0 pypi
tensorboardx 2.6.2.2 pypi_0 pypi
termcolor 2.4.0 pypi_0 pypi
test-tube 0.7.5 pypi_0 pypi
tetgen 0.6.4 pypi_0 pypi
threadpoolctl 3.5.0 pypi_0 pypi
tifffile 2024.9.20 pypi_0 pypi
timm 1.0.9 pypi_0 pypi
tk 8.6.14 h39e8969_0
tokenizers 0.20.0 pypi_0 pypi
toml 0.10.2 pypi_0 pypi
tomli 2.0.2 pypi_0 pypi
torch 2.4.0+cu121 pypi_0 pypi
torch-ema 0.3 pypi_0 pypi
torch-fidelity 0.3.0 pypi_0 pypi
torchaudio 2.4.0+cu121 pypi_0 pypi
torchmetrics 1.4.2 pypi_0 pypi
torchvision 0.19.0+cu121 pypi_0 pypi
tornado 6.4.1 pypi_0 pypi
tqdm 4.66.5 pypi_0 pypi
traitlets 5.14.3 pypi_0 pypi
transformers 4.45.1 pypi_0 pypi
trimesh 4.4.9 pypi_0 pypi
triton 3.0.0 pypi_0 pypi
typing-extensions 4.12.2 pypi_0 pypi
typing-inspect 0.9.0 pypi_0 pypi
tzdata 2024.2 pypi_0 pypi
ultralytics 8.3.3 pypi_0 pypi
ultralytics-thop 2.0.8 pypi_0 pypi
urllib3 1.26.13 pypi_0 pypi
urwid 2.6.15 pypi_0 pypi
urwid-readline 0.15.1 pypi_0 pypi
usd-core 24.8 pypi_0 pypi
vtk 9.3.1 pypi_0 pypi
wandb 0.18.3 pypi_0 pypi
watchdog 5.0.3 pypi_0 pypi
wcwidth 0.2.13 pypi_0 pypi
werkzeug 3.0.4 pypi_0 pypi
wheel 0.44.0 py310h06a4308_0
widgetsnbextension 4.0.13 pypi_0 pypi
wrapt 1.16.0 pypi_0 pypi
xatlas 0.0.9 pypi_0 pypi
xformers 0.0.28.post1 pypi_0 pypi
xxhash 3.5.0 pypi_0 pypi
xz 5.4.6 h5eee18b_1
yacs 0.1.8 pypi_0 pypi
yapf 0.40.2 pypi_0 pypi
yarl 1.13.1 pypi_0 pypi
zipp 3.20.2 pypi_0 pypi
zlib 1.2.13 h5eee18b_1

CUDA Version

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Tue_Feb__7_19:32:13_PST_2023
Cuda compilation tools, release 12.1, V12.1.66
Build cuda_12.1.r12.1/compiler.32415258_0
Thu Oct 3 13:53:40 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 565.51.01 Driver Version: 565.90 CUDA Version: 12.7 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 3090 On | 00000000:2B:00.0 On | N/A |
| 0% 33C P0 104W / 350W | 1394MiB / 24576MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 33 G /Xwayland N/A |
+-----------------------------------------------------------------------------------------+

PyTorch Version and CUDA Support

PyTorch Version: 2.4.0+cu121
CUDA Version: 12.1
CUDNN Version: 90100

GCC Version

gcc (conda-forge gcc 11.2.0-16) 11.2.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

glibc Version

ldd (Ubuntu GLIBC 2.39-0ubuntu8.3) 2.39
Copyright (C) 2024 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
Written by Roland McGrath and Ulrich Drepper.

Installed PyTorch Packages

pytorch-lightning 2.1.0
pytorch3d 0.7.8
torch 2.4.0+cu121
torch-ema 0.3
torch-fidelity 0.3.0
torchaudio 2.4.0+cu121
torchmetrics 1.4.2
torchvision 0.19.0+cu121

pip freeze Output

absl-py==2.1.0
accelerate==0.34.2
addict==2.4.0
aiohappyeyeballs==2.4.3
aiohttp==3.10.8
aiosignal==1.3.1
albucore==0.0.17
albumentations==1.4.17
altair==5.4.1
annotated-types==0.7.0
antlr4-python3-runtime==4.9.3
anyio==4.6.0
asttokens==2.4.1
async-timeout==4.0.3
attrs==24.2.0
autocommand==2.2.2
autoflake==2.3.1
backports.tarfile==1.2.0
bitsandbytes==0.44.1
blinker==1.8.2
boto3==1.35.31
botocore==1.35.31
cachetools==5.5.0
cchardet==2.1.7
certifi==2022.12.7
cffi==1.17.1
chardet==5.2.0
charset-normalizer==2.1.1
chumpy==0.70
click==8.1.7
clip @ git+https://github.com/openai/CLIP.git@dcba3cb2e2827b402d2701e7e1c7d9fed8a20ef1
comm==0.2.2
ConfigArgParse==1.7
contourpy==1.3.0
cycler==0.12.1
Cython==3.0.11
dash==2.18.1
dash-core-components==2.0.0
dash-html-components==2.0.0
dash-table==5.0.0
dataclasses==0.6
dataclasses-json==0.6.7
datasets==3.0.1
dearpygui==1.11.1
decorator==5.1.1
deepspeed==0.15.1
defusedxml==0.7.1
Deprecated==1.2.14
diffusers==0.30.3
dill==0.3.8
distro==1.9.0
docker-pycreds==0.4.0
einops==0.8.0
entrypoints==0.4
et-xmlfile==1.1.0
eval_type_backport==0.2.0
exceptiongroup==1.2.2
executing==2.1.0
face-alignment==1.4.1
fastjsonschema==2.20.0
filelock==3.13.1
flash-attn==2.6.3
Flask==3.0.3
flatbuffers==24.3.25
fonttools==4.54.1
freetype-py==2.5.1
-e git+https://github.com/YuliangXiu/PuzzleAvatar.git@e50e4f10aa2d31103e58bf49b2568c9773a28a35#egg=freqencoder&subdirectory=cores/lib/freqencoder
frozenlist==1.4.1
fsspec==2024.2.0
ftfy==6.2.3
func_timeout==4.3.5
future==1.0.0
fvcore==0.1.5.post20221221
gitdb==4.0.11
GitPython==3.1.43
-e git+https://github.com/YuliangXiu/PuzzleAvatar.git@e50e4f10aa2d31103e58bf49b2568c9773a28a35#egg=gridencoder&subdirectory=cores/lib/gridencoder
-e git+https://github.com/IDEA-Research/GroundingDINO.git@57535c5a79791cb76e36fdb64975271354f10251#egg=groundingdino
grpcio==1.66.2
h11==0.14.0
hjson==3.1.0
httpcore==1.0.6
httpx==0.27.2
huggingface-hub==0.25.1
idna==3.4
imageio==2.35.1
imageio-ffmpeg==0.5.1
importlib_metadata==8.5.0
importlib_resources==6.4.0
inflect==7.3.1
iopath==0.1.10
ipycanvas==0.13.3
ipyevents==2.0.2
ipython==8.28.0
ipywidgets==8.1.5
itsdangerous==2.2.0
jaraco.collections==5.1.0
jaraco.context==5.3.0
jaraco.functools==4.0.1
jaraco.text==3.12.1
jax==0.4.33
jaxlib==0.4.33
jedi==0.19.1
Jinja2==3.1.3
jiter==0.5.0
jmespath==1.0.1
joblib==1.4.2
jsonschema==4.23.0
jsonschema-specifications==2023.12.1
jupyter_client==7.4.9
jupyter_core==5.7.2
jupyterlab_widgets==3.0.13
kaolin==0.16.0
kiwisolver==1.4.7
kornia==0.7.3
kornia_rs==0.1.5
lazy_loader==0.4
lightning-utilities==0.11.7
llvmlite==0.43.0
lpips==0.1.4
Markdown==3.7
markdown-it-py==3.0.0
MarkupSafe==2.1.5
marshmallow==3.22.0
matplotlib==3.9.2
matplotlib-inline==0.1.7
mdurl==0.1.2
mediapipe==0.10.15
mesh-to-sdf @ git+https://github.com/marian42/mesh_to_sdf.git@c9f26e6399f7fd8deb40c7fba02c7e74aca6c657
ml_dtypes==0.5.0
more-itertools==10.3.0
mpmath==1.3.0
multidict==6.1.0
multipledispatch==1.0.0
multiprocess==0.70.16
mypy-extensions==1.0.0
narwhals==1.9.0
nbformat==5.10.4
nest-asyncio==1.6.0
networkx==3.2.1
ninja==1.11.1.1
numba==0.60.0
numpy==1.26.3
-e git+https://github.com/NVlabs/nvdiffrast.git@729261dc64c4241ea36efda84fbf532cc8b425b8#egg=nvdiffrast
nvidia-cublas-cu12==12.1.3.1
nvidia-cuda-cupti-cu12==12.1.105
nvidia-cuda-nvrtc-cu12==12.1.105
nvidia-cuda-runtime-cu12==12.1.105
nvidia-cudnn-cu12==9.1.0.70
nvidia-cufft-cu12==11.0.2.54
nvidia-curand-cu12==10.3.2.106
nvidia-cusolver-cu12==11.4.5.107
nvidia-cusparse-cu12==12.1.0.106
nvidia-ml-py==12.560.30
nvidia-nccl-cu12==2.20.5
nvidia-nvjitlink-cu12==12.6.77
nvidia-nvtx-cu12==12.1.105
omegaconf==2.3.0
open3d==0.18.0
openai==1.51.0
opencv-contrib-python==4.10.0.84
opencv-python==4.10.0.84
opencv-python-headless==4.10.0.84
openpyxl==3.1.5
opt_einsum==3.4.0
packaging==24.1
pandas==2.2.3
parso==0.8.4
-e git+https://github.com/huggingface/peft.git@ca8462bb68b48d3cc613f3aafc81eb50634549d1#egg=peft
pexpect==4.9.0
pillow==10.2.0
platformdirs==4.3.6
plotly==5.24.1
pooch==1.8.2
portalocker==2.10.1
prompt_toolkit==3.0.48
protobuf==4.25.5
psutil==6.0.0
ptyprocess==0.7.0
pudb==2024.1.2
pure_eval==0.2.3
py-cpuinfo==9.0.0
pyarrow==17.0.0
pybind11==2.13.6
pycocotools==2.0.8
pycparser==2.22
pydantic==2.9.2
pydantic_core==2.23.4
pydeck==0.9.1
pyflakes==3.2.0
pyfqmr==0.2.1
pyglet==2.0.17
pygltflib==1.16.2
Pygments==2.18.0
PyMCubes==0.1.6
pymeshfix==0.17.0
pymeshlab==2023.12.post2
PyOpenGL==3.1.0
pyparsing==3.1.4
pyquaternion==0.9.9
pyrender==0.1.45
pyrr==0.10.3
python-dateutil==2.9.0.post0
pytorch-lightning==2.1.0
pytorch3d==0.7.8
pytz==2024.2
pyvista==0.44.1
PyYAML==6.0.2
pyzmq==26.2.0
rectangle-packer==2.0.2
referencing==0.35.1
regex==2024.9.11
replicate==0.34.1
requests==2.32.3
retrying==1.3.4
rich==13.9.1
rpds-py==0.20.0
Rtree==1.3.0
s3transfer==0.10.2
safetensors==0.4.5
scikit-image==0.24.0
scikit-learn==1.5.2
scipy==1.14.1
scooby==0.10.0
seaborn==0.13.2
segment_anything @ git+https://github.com/facebookresearch/segment-anything.git@dca509fe793f601edb92606367a655c15ac00fdf
sentencepiece==0.2.0
sentry-sdk==2.15.0
setproctitle==1.3.3
six==1.16.0
smmap==5.0.1
sniffio==1.3.1
sounddevice==0.5.0
stack-data==0.6.3
streamlit==1.39.0
supervision==0.4.0
sympy==1.12
tabulate==0.9.0
taming-transformers @ git+https://github.com/YuliangXiu/taming-transformers.git@97c8078024f32c86518dda684caac5b6c548140f
tenacity==9.0.0
tensorboard==2.18.0
tensorboard-data-server==0.7.2
tensorboardX==2.6.2.2
termcolor==2.4.0
test_tube==0.7.5
tetgen==0.6.4
threadpoolctl==3.5.0
tifffile==2024.9.20
timm==1.0.9
tokenizers==0.20.0
toml==0.10.2
tomli==2.0.2
torch==2.4.0+cu121
torch-ema==0.3
torch-fidelity==0.3.0
torchaudio==2.4.0+cu121
torchmetrics==1.4.2
torchvision==0.19.0+cu121
tornado==6.4.1
tqdm==4.66.5
traitlets==5.14.3
transformers==4.45.1
trimesh==4.4.9
triton==3.0.0
typeguard==4.3.0
typing-inspect==0.9.0
typing_extensions==4.12.2
tzdata==2024.2
ultralytics==8.3.3
ultralytics-thop==2.0.8
urllib3==1.26.13
urwid==2.6.15
urwid_readline==0.15.1
usd-core==24.8
vtk==9.3.1
wandb==0.18.3
watchdog==5.0.3
wcwidth==0.2.13
Werkzeug==3.0.4
widgetsnbextension==4.0.13
wrapt==1.16.0
xatlas==0.0.9
xformers==0.0.28.post1
xxhash==3.5.0
yacs==0.1.8
yapf==0.40.2
yarl==1.13.1
zipp==3.20.2

NVIDIA Driver Version

Thu Oct  3 13:53:44 2024       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 565.51.01              Driver Version: 565.90         CUDA Version: 12.7     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3090        On  |   00000000:2B:00.0  On |                  N/A |
|  0%   32C    P8             34W /  350W |    1394MiB /  24576MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A        33      G   /Xwayland                                   N/A      |
+-----------------------------------------------------------------------------------------+

@YuliangXiu
Copy link
Owner

How about removing the torch.compile at https://github.com/YuliangXiu/PuzzleAvatar/blob/main/cores/lib/trainer.py#L113? I have not tested my code on the windows platform.

@gt732
Copy link
Author

gt732 commented Oct 3, 2024

@YuliangXiu

NICE!!!!! That worked! I'll keep you posted!

image

@YuliangXiu
Copy link
Owner

YuliangXiu commented Oct 3, 2024

@YuliangXiu

NICE!!!!! That worked! I'll keep you posted!

image

Great, congrats @gt732 ! Could you please submit a pull request to reduce VRAM usage and maybe other small changes to run the code?

This would be really helpful for users with limited GPU resources. Thanks so much.

@YuliangXiu YuliangXiu reopened this Oct 17, 2024
@jim-1ee
Copy link

jim-1ee commented Nov 28, 2024

@YuliangXiu I was able to get the initial model trained using the following arguments. The VRAM usage spiked between 8-16GB.

accelerate launch multi_concepts/train.py \
  --pretrained_model_name_or_path $BASE_MODEL \
  --project_name ${SUBJECT_NAME} \
  --instance_data_dir ${INPUT_DIR}  \
  --output_dir ${EXP_DIR} \
  --class_data_dir data/multi_concepts_data \
  --train_batch_size 1  \
  --phase1_train_steps 1000 \
  --phase2_train_steps 4000 \
  --lr_step_rules "1:2000,0.1" \
  --initial_learning_rate 5e-4 \
  --learning_rate 2e-6 \
  --prior_loss_weight 1.0 \
  --syn_loss_weight "2.0,2.0" \
  --mask_loss_weight 1.0 \
  --lambda_attention 1e-2 \
  --img_log_steps 1000 \
  --checkpointing_steps 1000 \
  --use_view_prompt \
  --log_checkpoints \
  --boft_block_num=8 \
  --boft_block_size=0 \
  --boft_n_butterfly_factor=1 \
  --lora_r=32 \
  --enable_xformers_memory_efficient_attention \
  --use_peft ${peft_type} \
  --wandb_mode "offline" \
  --use_view_prompt \
  --do_not_apply_masked_prior \
  --mixed_precision fp16 \
  --gradient_checkpointing \
  --use_8bit_adam \
  --set_grads_to_none \

Now the last challenge is getting this step to run

python cores/main_mc.py \
 --config configs/tech_mc_geometry.yaml \
 --exp_dir ${EXP_DIR} \
 --sub_name ${SUBJECT_NAME} \
 --use_peft ${peft_type} \
 --use_shape_description \

I'm running into compiling issues when running the trainer. I tried a ton of different methods to try and fix it but there's something wrong with the gcc libraries in my conda env. This is being tested on Windows 10 WSL Ubuntu 24.04.1 LTS

ERROR

Memory usage statistics:

  Maximum number of tetrahedra:  5333413
  Maximum number of tet blocks (blocksize = 8188):  652
  Approximate memory for tetrahedral mesh (bytes):  752,983,904
  Approximate memory for extra pointers (bytes):  12,066,080
  Approximate memory for algorithms (bytes):  134,400
  Approximate memory for working arrays (bytes):  210,901,848
  Approximate total used memory (bytes):  976,086,232

shape of vertices: (834173, 3), shape of grids: (4986585, 4)
MESA: error: ZINK: failed to choose pdev
glx: failed to create drisw screen
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1000/1000 [03:15<00:00,  5.11it/s]
fitted mesh with num_vertex 481862, num_faces 890574
[INFO] loading stable diffusion...
[INFO] using hugging face custom model key: results/human/yuliang
Loading pipeline components...: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:00<00:00, 16.43it/s]
Added 7 tokens
[INFO] loaded PEFT adapters!
[INFO] loaded stable diffusion!
get rgb text prompt
get normal text prompt
[INFO] Trainer: df | 2024-10-03_10-02-14 | cuda | fp32 | results/human/yuliang/geometry
[INFO] #parameters: 11480403
[INFO] Loading latest checkpoint ...
[WARN] No checkpoint found, model randomly initialized.
++> Evaluate results/human/yuliang/geometry at epoch 0 ...
  0% 0/10 [00:00<?, ?it/s]/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/variables/functions.py:663: UserWarning: Graph break due to unsupported builtin _gridencoder.PyCapsule.grid_encode_forward. This function is either a Python builtin (e.g. _warnings.warn) or a third-party C/C++ Python extension (perhaps created with pybind). If it is a Python builtin, please file an issue on GitHub so the PyTorch team can add support for it and see the next case for a workaround. If it is a third-party C/C++ Python extension, please either wrap it into a PyTorch-understood custom operator (see https://pytorch.org/tutorials/advanced/custom_ops_landing_page.html for more details) or, if it is traceable, use torch.compiler.allow_in_graph.
  torch._dynamo.utils.warn_once(msg)
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: cannot find /lib/x86_64-linux-gnu/libc.so.6
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: /lib/x86_64-linux-gnu/libc.so.6: unknown type [0x13] section `.relr.dyn'
/home/scheme/anaconda3/envs/PuzzleAvatar/bin/../lib/gcc/x86_64-conda-linux-gnu/11.2.0/../../../../x86_64-conda-linux-gnu/bin/ld: skipping incompatible /lib/x86_64-linux-gnu/libc.so.6 when searching for /lib/x86_64-linux-gnu/libc.so.6
collect2: error: ld returned 1 exit status
Traceback (most recent call last):
  File "/home/scheme/PuzzleAvatar/cores/main_mc.py", line 379, in <module>
    trainer.train(train_loader, valid_loader, max_epoch)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 723, in train
    self.evaluate_one_epoch(valid_loader)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 1032, in evaluate_one_epoch
    preds, preds_depth, preds_normal, preds_alpha, loss = self.eval_step(data)
  File "/home/scheme/PuzzleAvatar/cores/lib/trainer.py", line 594, in eval_step
    outputs = self.model(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 433, in _fn
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl
    return forward_call(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 1116, in __call__
    return self._torchdynamo_orig_callable(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 948, in __call__
    result = self._inner_convert(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 472, in __call__
    return _compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_utils_internal.py", line 84, in wrapper_function
    return StrobelightCompileTimeProfiler.profile_compile_time(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_strobelight/compile_time_profiler.py", line 129, in profile_compile_time
    return func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 817, in _compile
    guarded_code = compile_inner(code, one_graph, hooks, transform)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 636, in compile_inner
    out_code = transform_code_object(code, transform)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/bytecode_transformation.py", line 1185, in transform_code_object
    transformations(instructions, code_options)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 178, in _fn
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/convert_frame.py", line 582, in transform
    tracer.run()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 2451, in run
    super().run()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 893, in run
    while self.step():
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 805, in step
    self.dispatch_table[inst.opcode](self, inst)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 497, in wrapper
    return handle_graph_break(self, inst, speculation.reason)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/symbolic_convert.py", line 566, in handle_graph_break
    self.output.compile_subgraph(self, reason=reason)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1123, in compile_subgraph
    self.compile_and_call_fx_graph(tx, pass2.graph_output_vars(), root)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1318, in compile_and_call_fx_graph
    compiled_fn = self.call_user_compiler(gm)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1409, in call_user_compiler
    raise BackendCompilerFailed(self.compiler_fn, e).with_traceback(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/output_graph.py", line 1390, in call_user_compiler
    compiled_fn = compiler_fn(gm, self.example_inputs())
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/repro/after_dynamo.py", line 129, in __call__
    compiled_gm = compiler_fn(gm, example_inputs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/__init__.py", line 1951, in __call__
    return compile_fx(model_, inputs_, config_patches=self.config)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1505, in compile_fx
    return aot_autograd(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/backends/common.py", line 69, in __call__
    cg = aot_module_simplified(gm, example_inputs, **self.kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/aot_autograd.py", line 954, in aot_module_simplified
    compiled_fn, _ = create_aot_dispatcher_function(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/aot_autograd.py", line 687, in create_aot_dispatcher_function
    compiled_fn, fw_metadata = compiler_fn(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/jit_compile_runtime_wrappers.py", line 168, in aot_dispatch_base
    compiled_fw = compiler(fw_module, updated_flat_args)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1410, in fw_compiler_base
    return inner_compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 84, in debug_wrapper
    inner_compiled_fn = compiler_fn(gm, example_inputs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/debug.py", line 304, in inner
    return fn(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 527, in compile_fx_inner
    compiled_graph = fx_codegen_and_compile(
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/contextlib.py", line 79, in inner
    return func(*args, **kwds)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 831, in fx_codegen_and_compile
    compiled_fn = graph.compile_to_fn()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1749, in compile_to_fn
    return self.compile_to_module().call
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1678, in compile_to_module
    self.codegen_with_cpp_wrapper() if self.cpp_wrapper else self.codegen()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1638, in codegen
    self.scheduler.codegen()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_dynamo/utils.py", line 231, in time_wrapper
    r = func(*args, **kwargs)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/scheduler.py", line 2741, in codegen
    self.get_backend(device).codegen_node(node)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/cuda_combined_scheduling.py", line 69, in codegen_node
    return self._triton_scheduling.codegen_node(node)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/simd.py", line 1148, in codegen_node
    return self.codegen_node_schedule(node_schedule, buf_accesses, numel, rnumel)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/simd.py", line 1317, in codegen_node_schedule
    src_code = kernel.codegen_kernel()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/triton.py", line 2159, in codegen_kernel
    **self.inductor_meta_common(),
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/_inductor/codegen/triton.py", line 2047, in inductor_meta_common
    "backend_hash": torch.utils._triton.triton_hash_with_backend(),
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/utils/_triton.py", line 63, in triton_hash_with_backend
    backend = triton_backend()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/torch/utils/_triton.py", line 49, in triton_backend
    target = driver.active.get_current_target()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 23, in __getattr__
    self._initialize_obj()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 20, in _initialize_obj
    self._obj = self._init_fn()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/driver.py", line 9, in _create_driver
    return actives[0]()
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 371, in __init__
    self.utils = CudaUtils()  # TODO: make static
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 80, in __init__
    mod = compile_module_from_src(Path(os.path.join(dirname, "driver.c")).read_text(), "cuda_utils")
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/driver.py", line 57, in compile_module_from_src
    so = _build(name, src_path, tmpdir, library_dirs(), include_dir, libraries)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/runtime/build.py", line 48, in _build
    ret = subprocess.check_call(cc_cmd)
  File "/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/subprocess.py", line 369, in check_call
    raise CalledProcessError(retcode, cmd)
torch._dynamo.exc.BackendCompilerFailed: backend='inductor' raised:
CalledProcessError: Command '['/home/scheme/anaconda3/envs/PuzzleAvatar/bin/x86_64-conda-linux-gnu-cc', '/tmp/tmp6m8__m0s/main.c', '-O3', '-shared', '-fPIC', '-o', '/tmp/tmp6m8__m0s/cuda_utils.cpython-310-x86_64-linux-gnu.so', '-lcuda', '-L/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/lib', '-L/usr/lib/wsl/lib', '-L/lib/x86_64-linux-gnu', '-I/home/scheme/anaconda3/envs/PuzzleAvatar/lib/python3.10/site-packages/triton/backends/nvidia/include', '-I/tmp/tmp6m8__m0s', '-I/home/scheme/anaconda3/envs/PuzzleAvatar/include/python3.10']' returned non-zero exit status 1.

Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information


You can suppress this exception and fall back to eager by setting:
    import torch._dynamo
    torch._dynamo.config.suppress_errors = True

  0% 0/10 [00:03<?, ?it/s]

ENVIRONMENT INFO

System Information

OS Information

Distributor ID: Ubuntu Description: Ubuntu 24.04.1 LTS Release: 24.04 Codename: noble 5.15.153.1-microsoft-standard-WSL2

Python Version

Python 3.10.14

Conda Environment Packages

packages in environment at /home/scheme/anaconda3/envs/PuzzleAvatar:

Name Version Build Channel

_libgcc_mutex 0.1 conda_forge conda-forge _openmp_mutex 4.5 2_gnu conda-forge _sysroot_linux-64_curr_repodata_hack 3 haa98f57_10 absl-py 2.1.0 pypi_0 pypi accelerate 0.34.2 pypi_0 pypi addict 2.4.0 pypi_0 pypi aiohappyeyeballs 2.4.3 pypi_0 pypi aiohttp 3.10.8 pypi_0 pypi aiosignal 1.3.1 pypi_0 pypi albucore 0.0.17 pypi_0 pypi albumentations 1.4.17 pypi_0 pypi altair 5.4.1 pypi_0 pypi annotated-types 0.7.0 pypi_0 pypi antlr4-python3-runtime 4.9.3 pypi_0 pypi anyio 4.6.0 pypi_0 pypi asttokens 2.4.1 pypi_0 pypi async-timeout 4.0.3 pypi_0 pypi attrs 24.2.0 pypi_0 pypi autoflake 2.3.1 pypi_0 pypi binutils 2.36.1 hdd6e379_2 conda-forge binutils_impl_linux-64 2.36.1 h193b22a_2 conda-forge binutils_linux-64 2.36 hf3e587d_10 conda-forge bitsandbytes 0.44.1 pypi_0 pypi blinker 1.8.2 pypi_0 pypi boto3 1.35.31 pypi_0 pypi botocore 1.35.31 pypi_0 pypi bzip2 1.0.8 h5eee18b_6 ca-certificates 2024.9.24 h06a4308_0 cachetools 5.5.0 pypi_0 pypi cchardet 2.1.7 pypi_0 pypi certifi 2022.12.7 pypi_0 pypi cffi 1.17.1 pypi_0 pypi chardet 5.2.0 pypi_0 pypi charset-normalizer 2.1.1 pypi_0 pypi chumpy 0.70 pypi_0 pypi click 8.1.7 pypi_0 pypi clip 1.0 pypi_0 pypi comm 0.2.2 pypi_0 pypi configargparse 1.7 pypi_0 pypi contourpy 1.3.0 pypi_0 pypi cycler 0.12.1 pypi_0 pypi cython 3.0.11 pypi_0 pypi dash 2.18.1 pypi_0 pypi dash-core-components 2.0.0 pypi_0 pypi dash-html-components 2.0.0 pypi_0 pypi dash-table 5.0.0 pypi_0 pypi dataclasses 0.6 pypi_0 pypi dataclasses-json 0.6.7 pypi_0 pypi datasets 3.0.1 pypi_0 pypi dearpygui 1.11.1 pypi_0 pypi decorator 5.1.1 pypi_0 pypi deepspeed 0.15.1 pypi_0 pypi defusedxml 0.7.1 pypi_0 pypi deprecated 1.2.14 pypi_0 pypi diffusers 0.30.3 pypi_0 pypi dill 0.3.8 pypi_0 pypi distro 1.9.0 pypi_0 pypi docker-pycreds 0.4.0 pypi_0 pypi einops 0.8.0 pypi_0 pypi entrypoints 0.4 pypi_0 pypi et-xmlfile 1.1.0 pypi_0 pypi eval-type-backport 0.2.0 pypi_0 pypi exceptiongroup 1.2.2 pypi_0 pypi executing 2.1.0 pypi_0 pypi face-alignment 1.4.1 pypi_0 pypi fastjsonschema 2.20.0 pypi_0 pypi filelock 3.13.1 pypi_0 pypi flash-attn 2.6.3 pypi_0 pypi flask 3.0.3 pypi_0 pypi flatbuffers 24.3.25 pypi_0 pypi fonttools 4.54.1 pypi_0 pypi freetype-py 2.5.1 pypi_0 pypi freqencoder 0.0.0 dev_0 frozenlist 1.4.1 pypi_0 pypi fsspec 2024.2.0 pypi_0 pypi ftfy 6.2.3 pypi_0 pypi func-timeout 4.3.5 pypi_0 pypi future 1.0.0 pypi_0 pypi fvcore 0.1.5.post20221221 pypi_0 pypi gcc 11.2.0 h702ea55_10 conda-forge gcc_impl_linux-64 11.2.0 h82a94d6_16 conda-forge gcc_linux-64 11.2.0 h39a9532_10 conda-forge gitdb 4.0.11 pypi_0 pypi gitpython 3.1.43 pypi_0 pypi gridencoder 0.0.0 dev_0 groundingdino 0.1.0 dev_0 grpcio 1.66.2 pypi_0 pypi gxx_impl_linux-64 11.2.0 h82a94d6_16 conda-forge gxx_linux-64 11.2.0 hacbe6df_10 conda-forge h11 0.14.0 pypi_0 pypi hjson 3.1.0 pypi_0 pypi httpcore 1.0.6 pypi_0 pypi httpx 0.27.2 pypi_0 pypi huggingface-hub 0.25.1 pypi_0 pypi idna 3.4 pypi_0 pypi imageio 2.35.1 pypi_0 pypi imageio-ffmpeg 0.5.1 pypi_0 pypi importlib-metadata 8.5.0 pypi_0 pypi iopath 0.1.10 pypi_0 pypi ipycanvas 0.13.3 pypi_0 pypi ipyevents 2.0.2 pypi_0 pypi ipython 8.28.0 pypi_0 pypi ipywidgets 8.1.5 pypi_0 pypi itsdangerous 2.2.0 pypi_0 pypi jax 0.4.33 pypi_0 pypi jaxlib 0.4.33 pypi_0 pypi jedi 0.19.1 pypi_0 pypi jinja2 3.1.3 pypi_0 pypi jiter 0.5.0 pypi_0 pypi jmespath 1.0.1 pypi_0 pypi joblib 1.4.2 pypi_0 pypi jsonschema 4.23.0 pypi_0 pypi jsonschema-specifications 2023.12.1 pypi_0 pypi jupyter-client 7.4.9 pypi_0 pypi jupyter-core 5.7.2 pypi_0 pypi jupyterlab-widgets 3.0.13 pypi_0 pypi kaolin 0.16.0 pypi_0 pypi kernel-headers_linux-64 3.10.0 h57e8cba_10 kiwisolver 1.4.7 pypi_0 pypi kornia 0.7.3 pypi_0 pypi kornia-rs 0.1.5 pypi_0 pypi lazy-loader 0.4 pypi_0 pypi ld_impl_linux-64 2.36.1 hea4e1c9_2 conda-forge libffi 3.4.4 h6a678d5_1 libgcc 5.2.0 0 conda-forge libgcc-devel_linux-64 11.2.0 h0952999_16 conda-forge libgcc-ng 14.1.0 h77fa898_0 conda-forge libgomp 14.1.0 h77fa898_0 conda-forge libsanitizer 11.2.0 he4da1e4_16 conda-forge libstdcxx-devel_linux-64 11.2.0 h0952999_16 conda-forge libstdcxx-ng 14.1.0 hc0a3c3a_0 conda-forge libuuid 1.41.5 h5eee18b_0 lightning-utilities 0.11.7 pypi_0 pypi llvmlite 0.43.0 pypi_0 pypi lpips 0.1.4 pypi_0 pypi markdown 3.7 pypi_0 pypi markdown-it-py 3.0.0 pypi_0 pypi markupsafe 2.1.5 pypi_0 pypi marshmallow 3.22.0 pypi_0 pypi matplotlib 3.9.2 pypi_0 pypi matplotlib-inline 0.1.7 pypi_0 pypi mdurl 0.1.2 pypi_0 pypi mediapipe 0.10.15 pypi_0 pypi mesh-to-sdf 0.0.15 pypi_0 pypi ml-dtypes 0.5.0 pypi_0 pypi mpmath 1.3.0 pypi_0 pypi multidict 6.1.0 pypi_0 pypi multipledispatch 1.0.0 pypi_0 pypi multiprocess 0.70.16 pypi_0 pypi mypy-extensions 1.0.0 pypi_0 pypi narwhals 1.9.0 pypi_0 pypi nbformat 5.10.4 pypi_0 pypi ncurses 6.4 h6a678d5_0 nest-asyncio 1.6.0 pypi_0 pypi networkx 3.2.1 pypi_0 pypi ninja 1.11.1.1 pypi_0 pypi numba 0.60.0 pypi_0 pypi numpy 1.26.3 pypi_0 pypi nvdiffrast 0.3.3 dev_0 nvidia-cublas-cu12 12.1.3.1 pypi_0 pypi nvidia-cuda-cupti-cu12 12.1.105 pypi_0 pypi nvidia-cuda-nvrtc-cu12 12.1.105 pypi_0 pypi nvidia-cuda-runtime-cu12 12.1.105 pypi_0 pypi nvidia-cudnn-cu12 9.1.0.70 pypi_0 pypi nvidia-cufft-cu12 11.0.2.54 pypi_0 pypi nvidia-curand-cu12 10.3.2.106 pypi_0 pypi nvidia-cusolver-cu12 11.4.5.107 pypi_0 pypi nvidia-cusparse-cu12 12.1.0.106 pypi_0 pypi nvidia-ml-py 12.560.30 pypi_0 pypi nvidia-nccl-cu12 2.20.5 pypi_0 pypi nvidia-nvjitlink-cu12 12.6.77 pypi_0 pypi nvidia-nvtx-cu12 12.1.105 pypi_0 pypi omegaconf 2.3.0 pypi_0 pypi open3d 0.18.0 pypi_0 pypi openai 1.51.0 pypi_0 pypi opencv-contrib-python 4.10.0.84 pypi_0 pypi opencv-python 4.10.0.84 pypi_0 pypi opencv-python-headless 4.10.0.84 pypi_0 pypi openpyxl 3.1.5 pypi_0 pypi openssl 3.3.1 hb9d3cd8_3 conda-forge opt-einsum 3.4.0 pypi_0 pypi packaging 24.1 pypi_0 pypi pandas 2.2.3 pypi_0 pypi parso 0.8.4 pypi_0 pypi peft 0.13.1.dev0 pypi_0 pypi pexpect 4.9.0 pypi_0 pypi pillow 10.2.0 pypi_0 pypi pip 24.2 py310h06a4308_0 platformdirs 4.3.6 pypi_0 pypi plotly 5.24.1 pypi_0 pypi pooch 1.8.2 pypi_0 pypi portalocker 2.10.1 pypi_0 pypi prompt-toolkit 3.0.48 pypi_0 pypi protobuf 4.25.5 pypi_0 pypi psutil 6.0.0 pypi_0 pypi ptyprocess 0.7.0 pypi_0 pypi pudb 2024.1.2 pypi_0 pypi pure-eval 0.2.3 pypi_0 pypi py-cpuinfo 9.0.0 pypi_0 pypi pyarrow 17.0.0 pypi_0 pypi pybind11 2.13.6 pypi_0 pypi pycocotools 2.0.8 pypi_0 pypi pycparser 2.22 pypi_0 pypi pydantic 2.9.2 pypi_0 pypi pydantic-core 2.23.4 pypi_0 pypi pydeck 0.9.1 pypi_0 pypi pyflakes 3.2.0 pypi_0 pypi pyfqmr 0.2.1 pypi_0 pypi pyglet 2.0.17 pypi_0 pypi pygltflib 1.16.2 pypi_0 pypi pygments 2.18.0 pypi_0 pypi pymcubes 0.1.6 pypi_0 pypi pymeshfix 0.17.0 pypi_0 pypi pymeshlab 2023.12.post2 pypi_0 pypi pyopengl 3.1.0 pypi_0 pypi pyparsing 3.1.4 pypi_0 pypi pyquaternion 0.9.9 pypi_0 pypi pyrender 0.1.45 pypi_0 pypi pyrr 0.10.3 pypi_0 pypi python 3.10.14 h955ad1f_1 python-dateutil 2.9.0.post0 pypi_0 pypi pytorch-lightning 2.1.0 pypi_0 pypi pytorch3d 0.7.8 pypi_0 pypi pytz 2024.2 pypi_0 pypi pyvista 0.44.1 pypi_0 pypi pyyaml 6.0.2 pypi_0 pypi pyzmq 26.2.0 pypi_0 pypi readline 8.2 h5eee18b_0 rectangle-packer 2.0.2 pypi_0 pypi referencing 0.35.1 pypi_0 pypi regex 2024.9.11 pypi_0 pypi replicate 0.34.1 pypi_0 pypi requests 2.32.3 pypi_0 pypi retrying 1.3.4 pypi_0 pypi rich 13.9.1 pypi_0 pypi rpds-py 0.20.0 pypi_0 pypi rtree 1.3.0 pypi_0 pypi s3transfer 0.10.2 pypi_0 pypi safetensors 0.4.5 pypi_0 pypi scikit-image 0.24.0 pypi_0 pypi scikit-learn 1.5.2 pypi_0 pypi scipy 1.14.1 pypi_0 pypi scooby 0.10.0 pypi_0 pypi seaborn 0.13.2 pypi_0 pypi segment-anything 1.0 pypi_0 pypi sentencepiece 0.2.0 pypi_0 pypi sentry-sdk 2.15.0 pypi_0 pypi setproctitle 1.3.3 pypi_0 pypi setuptools 75.1.0 py310h06a4308_0 six 1.16.0 pypi_0 pypi smmap 5.0.1 pypi_0 pypi sniffio 1.3.1 pypi_0 pypi sounddevice 0.5.0 pypi_0 pypi sqlite 3.45.3 h5eee18b_0 stack-data 0.6.3 pypi_0 pypi streamlit 1.39.0 pypi_0 pypi supervision 0.4.0 pypi_0 pypi sympy 1.12 pypi_0 pypi sysroot_linux-64 2.17 h57e8cba_10 tabulate 0.9.0 pypi_0 pypi taming-transformers 0.0.1 pypi_0 pypi tenacity 9.0.0 pypi_0 pypi tensorboard 2.18.0 pypi_0 pypi tensorboard-data-server 0.7.2 pypi_0 pypi tensorboardx 2.6.2.2 pypi_0 pypi termcolor 2.4.0 pypi_0 pypi test-tube 0.7.5 pypi_0 pypi tetgen 0.6.4 pypi_0 pypi threadpoolctl 3.5.0 pypi_0 pypi tifffile 2024.9.20 pypi_0 pypi timm 1.0.9 pypi_0 pypi tk 8.6.14 h39e8969_0 tokenizers 0.20.0 pypi_0 pypi toml 0.10.2 pypi_0 pypi tomli 2.0.2 pypi_0 pypi torch 2.4.0+cu121 pypi_0 pypi torch-ema 0.3 pypi_0 pypi torch-fidelity 0.3.0 pypi_0 pypi torchaudio 2.4.0+cu121 pypi_0 pypi torchmetrics 1.4.2 pypi_0 pypi torchvision 0.19.0+cu121 pypi_0 pypi tornado 6.4.1 pypi_0 pypi tqdm 4.66.5 pypi_0 pypi traitlets 5.14.3 pypi_0 pypi transformers 4.45.1 pypi_0 pypi trimesh 4.4.9 pypi_0 pypi triton 3.0.0 pypi_0 pypi typing-extensions 4.12.2 pypi_0 pypi typing-inspect 0.9.0 pypi_0 pypi tzdata 2024.2 pypi_0 pypi ultralytics 8.3.3 pypi_0 pypi ultralytics-thop 2.0.8 pypi_0 pypi urllib3 1.26.13 pypi_0 pypi urwid 2.6.15 pypi_0 pypi urwid-readline 0.15.1 pypi_0 pypi usd-core 24.8 pypi_0 pypi vtk 9.3.1 pypi_0 pypi wandb 0.18.3 pypi_0 pypi watchdog 5.0.3 pypi_0 pypi wcwidth 0.2.13 pypi_0 pypi werkzeug 3.0.4 pypi_0 pypi wheel 0.44.0 py310h06a4308_0 widgetsnbextension 4.0.13 pypi_0 pypi wrapt 1.16.0 pypi_0 pypi xatlas 0.0.9 pypi_0 pypi xformers 0.0.28.post1 pypi_0 pypi xxhash 3.5.0 pypi_0 pypi xz 5.4.6 h5eee18b_1 yacs 0.1.8 pypi_0 pypi yapf 0.40.2 pypi_0 pypi yarl 1.13.1 pypi_0 pypi zipp 3.20.2 pypi_0 pypi zlib 1.2.13 h5eee18b_1

CUDA Version

nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2023 NVIDIA Corporation Built on Tue_Feb__7_19:32:13_PST_2023 Cuda compilation tools, release 12.1, V12.1.66 Build cuda_12.1.r12.1/compiler.32415258_0 Thu Oct 3 13:53:40 2024 +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 565.51.01 Driver Version: 565.90 CUDA Version: 12.7 | |-----------------------------------------+------------------------+----------------------+ | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |=========================================+========================+======================| | 0 NVIDIA GeForce RTX 3090 On | 00000000:2B:00.0 On | N/A | | 0% 33C P0 104W / 350W | 1394MiB / 24576MiB | 0% Default | | | | N/A | +-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=========================================================================================| | 0 N/A N/A 33 G /Xwayland N/A | +-----------------------------------------------------------------------------------------+

PyTorch Version and CUDA Support

PyTorch Version: 2.4.0+cu121 CUDA Version: 12.1 CUDNN Version: 90100

GCC Version

gcc (conda-forge gcc 11.2.0-16) 11.2.0 Copyright (C) 2021 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

glibc Version

ldd (Ubuntu GLIBC 2.39-0ubuntu8.3) 2.39 Copyright (C) 2024 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. Written by Roland McGrath and Ulrich Drepper.

Installed PyTorch Packages

pytorch-lightning 2.1.0 pytorch3d 0.7.8 torch 2.4.0+cu121 torch-ema 0.3 torch-fidelity 0.3.0 torchaudio 2.4.0+cu121 torchmetrics 1.4.2 torchvision 0.19.0+cu121

pip freeze Output

absl-py==2.1.0 accelerate==0.34.2 addict==2.4.0 aiohappyeyeballs==2.4.3 aiohttp==3.10.8 aiosignal==1.3.1 albucore==0.0.17 albumentations==1.4.17 altair==5.4.1 annotated-types==0.7.0 antlr4-python3-runtime==4.9.3 anyio==4.6.0 asttokens==2.4.1 async-timeout==4.0.3 attrs==24.2.0 autocommand==2.2.2 autoflake==2.3.1 backports.tarfile==1.2.0 bitsandbytes==0.44.1 blinker==1.8.2 boto3==1.35.31 botocore==1.35.31 cachetools==5.5.0 cchardet==2.1.7 certifi==2022.12.7 cffi==1.17.1 chardet==5.2.0 charset-normalizer==2.1.1 chumpy==0.70 click==8.1.7 clip @ git+https://github.com/openai/CLIP.git@dcba3cb2e2827b402d2701e7e1c7d9fed8a20ef1 comm==0.2.2 ConfigArgParse==1.7 contourpy==1.3.0 cycler==0.12.1 Cython==3.0.11 dash==2.18.1 dash-core-components==2.0.0 dash-html-components==2.0.0 dash-table==5.0.0 dataclasses==0.6 dataclasses-json==0.6.7 datasets==3.0.1 dearpygui==1.11.1 decorator==5.1.1 deepspeed==0.15.1 defusedxml==0.7.1 Deprecated==1.2.14 diffusers==0.30.3 dill==0.3.8 distro==1.9.0 docker-pycreds==0.4.0 einops==0.8.0 entrypoints==0.4 et-xmlfile==1.1.0 eval_type_backport==0.2.0 exceptiongroup==1.2.2 executing==2.1.0 face-alignment==1.4.1 fastjsonschema==2.20.0 filelock==3.13.1 flash-attn==2.6.3 Flask==3.0.3 flatbuffers==24.3.25 fonttools==4.54.1 freetype-py==2.5.1 -e git+https://github.com/YuliangXiu/PuzzleAvatar.git@e50e4f10aa2d31103e58bf49b2568c9773a28a35#egg=freqencoder&subdirectory=cores/lib/freqencoder frozenlist==1.4.1 fsspec==2024.2.0 ftfy==6.2.3 func_timeout==4.3.5 future==1.0.0 fvcore==0.1.5.post20221221 gitdb==4.0.11 GitPython==3.1.43 -e git+https://github.com/YuliangXiu/PuzzleAvatar.git@e50e4f10aa2d31103e58bf49b2568c9773a28a35#egg=gridencoder&subdirectory=cores/lib/gridencoder -e git+https://github.com/IDEA-Research/GroundingDINO.git@57535c5a79791cb76e36fdb64975271354f10251#egg=groundingdino grpcio==1.66.2 h11==0.14.0 hjson==3.1.0 httpcore==1.0.6 httpx==0.27.2 huggingface-hub==0.25.1 idna==3.4 imageio==2.35.1 imageio-ffmpeg==0.5.1 importlib_metadata==8.5.0 importlib_resources==6.4.0 inflect==7.3.1 iopath==0.1.10 ipycanvas==0.13.3 ipyevents==2.0.2 ipython==8.28.0 ipywidgets==8.1.5 itsdangerous==2.2.0 jaraco.collections==5.1.0 jaraco.context==5.3.0 jaraco.functools==4.0.1 jaraco.text==3.12.1 jax==0.4.33 jaxlib==0.4.33 jedi==0.19.1 Jinja2==3.1.3 jiter==0.5.0 jmespath==1.0.1 joblib==1.4.2 jsonschema==4.23.0 jsonschema-specifications==2023.12.1 jupyter_client==7.4.9 jupyter_core==5.7.2 jupyterlab_widgets==3.0.13 kaolin==0.16.0 kiwisolver==1.4.7 kornia==0.7.3 kornia_rs==0.1.5 lazy_loader==0.4 lightning-utilities==0.11.7 llvmlite==0.43.0 lpips==0.1.4 Markdown==3.7 markdown-it-py==3.0.0 MarkupSafe==2.1.5 marshmallow==3.22.0 matplotlib==3.9.2 matplotlib-inline==0.1.7 mdurl==0.1.2 mediapipe==0.10.15 mesh-to-sdf @ git+https://github.com/marian42/mesh_to_sdf.git@c9f26e6399f7fd8deb40c7fba02c7e74aca6c657 ml_dtypes==0.5.0 more-itertools==10.3.0 mpmath==1.3.0 multidict==6.1.0 multipledispatch==1.0.0 multiprocess==0.70.16 mypy-extensions==1.0.0 narwhals==1.9.0 nbformat==5.10.4 nest-asyncio==1.6.0 networkx==3.2.1 ninja==1.11.1.1 numba==0.60.0 numpy==1.26.3 -e git+https://github.com/NVlabs/nvdiffrast.git@729261dc64c4241ea36efda84fbf532cc8b425b8#egg=nvdiffrast nvidia-cublas-cu12==12.1.3.1 nvidia-cuda-cupti-cu12==12.1.105 nvidia-cuda-nvrtc-cu12==12.1.105 nvidia-cuda-runtime-cu12==12.1.105 nvidia-cudnn-cu12==9.1.0.70 nvidia-cufft-cu12==11.0.2.54 nvidia-curand-cu12==10.3.2.106 nvidia-cusolver-cu12==11.4.5.107 nvidia-cusparse-cu12==12.1.0.106 nvidia-ml-py==12.560.30 nvidia-nccl-cu12==2.20.5 nvidia-nvjitlink-cu12==12.6.77 nvidia-nvtx-cu12==12.1.105 omegaconf==2.3.0 open3d==0.18.0 openai==1.51.0 opencv-contrib-python==4.10.0.84 opencv-python==4.10.0.84 opencv-python-headless==4.10.0.84 openpyxl==3.1.5 opt_einsum==3.4.0 packaging==24.1 pandas==2.2.3 parso==0.8.4 -e git+https://github.com/huggingface/peft.git@ca8462bb68b48d3cc613f3aafc81eb50634549d1#egg=peft pexpect==4.9.0 pillow==10.2.0 platformdirs==4.3.6 plotly==5.24.1 pooch==1.8.2 portalocker==2.10.1 prompt_toolkit==3.0.48 protobuf==4.25.5 psutil==6.0.0 ptyprocess==0.7.0 pudb==2024.1.2 pure_eval==0.2.3 py-cpuinfo==9.0.0 pyarrow==17.0.0 pybind11==2.13.6 pycocotools==2.0.8 pycparser==2.22 pydantic==2.9.2 pydantic_core==2.23.4 pydeck==0.9.1 pyflakes==3.2.0 pyfqmr==0.2.1 pyglet==2.0.17 pygltflib==1.16.2 Pygments==2.18.0 PyMCubes==0.1.6 pymeshfix==0.17.0 pymeshlab==2023.12.post2 PyOpenGL==3.1.0 pyparsing==3.1.4 pyquaternion==0.9.9 pyrender==0.1.45 pyrr==0.10.3 python-dateutil==2.9.0.post0 pytorch-lightning==2.1.0 pytorch3d==0.7.8 pytz==2024.2 pyvista==0.44.1 PyYAML==6.0.2 pyzmq==26.2.0 rectangle-packer==2.0.2 referencing==0.35.1 regex==2024.9.11 replicate==0.34.1 requests==2.32.3 retrying==1.3.4 rich==13.9.1 rpds-py==0.20.0 Rtree==1.3.0 s3transfer==0.10.2 safetensors==0.4.5 scikit-image==0.24.0 scikit-learn==1.5.2 scipy==1.14.1 scooby==0.10.0 seaborn==0.13.2 segment_anything @ git+https://github.com/facebookresearch/segment-anything.git@dca509fe793f601edb92606367a655c15ac00fdf sentencepiece==0.2.0 sentry-sdk==2.15.0 setproctitle==1.3.3 six==1.16.0 smmap==5.0.1 sniffio==1.3.1 sounddevice==0.5.0 stack-data==0.6.3 streamlit==1.39.0 supervision==0.4.0 sympy==1.12 tabulate==0.9.0 taming-transformers @ git+https://github.com/YuliangXiu/taming-transformers.git@97c8078024f32c86518dda684caac5b6c548140f tenacity==9.0.0 tensorboard==2.18.0 tensorboard-data-server==0.7.2 tensorboardX==2.6.2.2 termcolor==2.4.0 test_tube==0.7.5 tetgen==0.6.4 threadpoolctl==3.5.0 tifffile==2024.9.20 timm==1.0.9 tokenizers==0.20.0 toml==0.10.2 tomli==2.0.2 torch==2.4.0+cu121 torch-ema==0.3 torch-fidelity==0.3.0 torchaudio==2.4.0+cu121 torchmetrics==1.4.2 torchvision==0.19.0+cu121 tornado==6.4.1 tqdm==4.66.5 traitlets==5.14.3 transformers==4.45.1 trimesh==4.4.9 triton==3.0.0 typeguard==4.3.0 typing-inspect==0.9.0 typing_extensions==4.12.2 tzdata==2024.2 ultralytics==8.3.3 ultralytics-thop==2.0.8 urllib3==1.26.13 urwid==2.6.15 urwid_readline==0.15.1 usd-core==24.8 vtk==9.3.1 wandb==0.18.3 watchdog==5.0.3 wcwidth==0.2.13 Werkzeug==3.0.4 widgetsnbextension==4.0.13 wrapt==1.16.0 xatlas==0.0.9 xformers==0.0.28.post1 xxhash==3.5.0 yacs==0.1.8 yapf==0.40.2 yarl==1.13.1 zipp==3.20.2

NVIDIA Driver Version

Thu Oct  3 13:53:44 2024       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 565.51.01              Driver Version: 565.90         CUDA Version: 12.7     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3090        On  |   00000000:2B:00.0  On |                  N/A |
|  0%   32C    P8             34W /  350W |    1394MiB /  24576MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A        33      G   /Xwayland                                   N/A      |
+-----------------------------------------------------------------------------------------+

Hi,

Is there any optimizations or settings I can change to get this running on my 3090 24GB? I'm using the photos included in the demo of Yuilang to test the code.

image

Thanks!

Hi, I added --gradient_checkpointing, but the shipping reports an error, have you encountered the same problem, thanks!
File "/data8/jimlee/PuzzleAvatar/multi_concepts/ptp_utils.py", line 256, in between_steps
self.attention_store[key][i] += self.step_store[key][i]
RuntimeError: The size of tensor a (1024) must match the size of tensor b (256) at non-singleton dimension 1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants