Skip to content

[BUG] cuVS Bench is not working with Deep 100M dataset #1407

@karthik86248

Description

@karthik86248

Describe the bug
I'm following the instruction in https://docs.rapids.ai/api/cuvs/nightly/cuvs_bench/#end-to-end-large-scale-benchmarks-10m-vectors to run cuVS Bench with Deep-100M dataset.

Steps/Code to reproduce bug

  1. conda create --name cuvs_benchmarks_deep100m
  2. conda activate cuvs_benchmarks_deep100m
  3. conda install -c rapidsai -c conda-forge cuvs-bench=25.08 cuda-version=13.0*
  4. conda install -c rapidsai -c conda-forge cuvs-bench=25.08 cuda-version=12.0*
  5. python3 -m cuvs_bench.run --dataset deep-1B --algorithms cuvs_cagra --batch-size 10 -k 10

The deep dataset was downloaded manually and the groundtruth file was split into two files per the instructions in the cuVS Bench documentation.

Expected behavior
The graphs should get generated. After that, the Search should be successful. But, I find that the graph construction is stuck for a long time. I waited for > 24 hrs and step# 5 seems to be hung. From quick debugging, I found the kernel kern_sort is continuously running.

Environment details (please complete the following information):

  • Environment location: [Bare-metal, Docker, Cloud(specify cloud provider)]
  • Method of RAFT install: [conda, Docker, or from source]
    • If method of install is [Docker], provide docker pull & docker run commands used
      GPU : A100 80GB.
      CUDA ver: 13.0
      Driver Version: 580.82.07
      Installation method: Conda
      Environment: Bare Metal

Additional context
Add any other context about the problem here.
The Deep-100M dataset was working with the former RAFT Bench.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions