Fix ShardLevel Structure #765

AdrianGushin · 2025-09-24T15:31:02Z

This pull request adds some fixes for the ShardLevel structure and adds associated test cases. It also updates the CPU struct to facilitate multiple distinct cores.

This reverts commit 2f356dd.

willow-ahrens

Almost there! This is looking great, only small changes requested.

willow-ahrens · 2025-09-24T15:33:29Z

Project.toml

 DataStructures = "0.18"
 Distributions = "0.25"
 HDF5 = "0.17"
+InteractiveUtils = "1.11.0"


InteractiveUtils shouldn't be a Finch dep, we need to remove before merging.

willow-ahrens · 2025-09-24T15:34:05Z

Project.toml

 NPZ = "15e1cf62-19b3-5cfa-8e77-841668bca605"
 SparseArrays = "2f01184e-e22b-5df5-ae63-d93ebab69eaf"
-TensorMarket = "8b7d4fe7-0b45-4d0d-9dd8-5cc9b23b4b77"
+TensorMarket = "8b7d4fe7-0b45-4d0d-9dd8-5cc9b23b4b77"


Tensor market should be a test dep, but I don't think it's an extra right? do we need to move it to the test project.toml?

willow-ahrens · 2025-09-24T15:34:18Z

src/architecture.jl

@@ -1,8 +1,10 @@
+using InteractiveUtils


let's remove this, it was just for debugging

willow-ahrens · 2025-09-24T15:34:32Z

src/architecture.jl


 A datatype representing a device on which tasks can be executed.
 """
+


does this like break the documentation for abstract device?

willow-ahrens · 2025-09-24T15:35:16Z

src/architecture.jl

        end,
    )
-    VirtualCPU(value(n, Int))
+    VirtualCPU(value(n, Int), literal(id))


it's also okay to just use id without wrapping as literal, as long as you wrap it when needed. whatever's convenient

willow-ahrens · 2025-09-24T15:36:34Z

src/architecture.jl

 FinchNotation.finch_leaf(mem::VirtualCPULocalMemory) = virtual(mem)
-function virtualize(ctx, ex, ::Type{CPULocalMemory})
-    VirtualCPULocalMemory(virtualize(ctx, :($ex.device), CPU))
+function virtualize(ctx, ex, ::Type{CPULocalMemory{id}}) where {id}


instead of keying CPULocalMemory on id, let's put the whole CPU type in the type parameter so that further changes to CPU parameterization don't need to affect the localmem

then we can recursively virtualize the cpu

willow-ahrens · 2025-09-24T15:37:28Z

src/architecture.jl

-global_memory(device::CPU) = CPUSharedMemory(device)
+local_memory(device::CPU{id}) where {id} = CPULocalMemory{id}(device)
+shared_memory(device::CPU{id}) where {id} = CPUSharedMemory{id}(device)
+global_memory(device::CPU{id}) where {id} = CPUSharedMemory{id}(device)


same thing here, I think it makes more sense to key on the CPU than the ID. feel free to disagree here.

willow-ahrens · 2025-09-24T15:38:08Z

src/tensors/levels/shard_levels.jl

 function transfer(task::MemoryChannel, arr::MultiChannelBuffer)
    if task.device == arr.device
        temp = arr.data[task.t]
+        @assert isa(temp, Vector)


This is good for debugging, but I don't think this will always be the case, we might have different buffer types than vector

willow-ahrens · 2025-09-24T15:39:33Z

test/runtests.jl


    @testset "Finch" begin
        include("modules/checkoutput_testsetup.jl")
+        include("suites/constructors_tests.jl")


wow. It's crazy that this wasn't already included

willow-ahrens · 2025-09-24T15:40:15Z

test/suites/constructors_tests.jl

+        end
+
+        @test C[4,4] == 12
+    end


Good test! Let's also add a test in perhaps representation.jl which generates some reference output for a shard level kernel

willow-ahrens · 2025-09-24T15:41:06Z

We can merge this to main once we have some more tests.

willow-ahrens

Hi Adrian! This all looks good, the only requirement we need now is a test which calls check_output to compare the generated shardlevel code against a reference output. See elsewhere in the code where this function is used for examples. You'll need to follow instructions in the contributing guide to generate new 64-bit reference, and run the "fixbot" action to generate new 32-bit reference

willow-ahrens · 2025-10-18T00:00:25Z

I've added you to the repo as a collaborator, you can re-open the PR using finch-tensor as the remote and it will automatically run tests.

agushin101 added 30 commits August 21, 2025 17:46

initial attempt

358f453

remove extent from constructor

57f17c9

update creation of ext

670c8c1

fix typo

eeaa67c

modify eltype signature to account for scheduler

6833205

remove erroneous call to subcontext

a92044c

add other params to lambda

0faeb73

debugging

7b6da14

add 4 arguments to vpr lambda

5d706c5

fix error in getnumtasks method call

15f148a

unsafe removal of assert

44e9f57

more unsafe removals

26ca5df

add safety back

71cf3d8

debugging

fb1f84d

debugging

41c3edd

debug stmts

5b134cb

check in other dist

7c85760

check in other dist

222357c

try containing

ddeb820

debug stmts

92679f8

finalize results for thurs

cdd1adc

add debugging to shims

1df5b5c

relocate print

99d2cb8

bugfix

197bc74

printing arguments ONLY

7bab33e

debugging

dc0f56e

test if smaller or if resize

c3ed869

testing API

79346f7

delete error

5509898

checking inner

915bbca

agushin101 added 19 commits August 31, 2025 18:55

debug

d94a1ae

debug

e8a22f3

bug

ddf6308

whoopsie

bd1d8af

bug

b2913cf

typeof

8e0c076

readd print

cbcb764

***Start of major changes to declare_leveladd .**

d4badfd

readd debugging lower output

7bc15bc

testing

e9633b2

woof

481457b

fix resize and io

70bc7ea

fix bugs in interacting with tensors of dim > 1

2f356dd

Revert "fix bugs in interacting with tensors of dim > 1"

7bd371e

This reverts commit 2f356dd.

fix test cases

5d50bf3

add tests, correct error in higher dimensional loops

376a360

finalize test cases

f1a9a7a

add id

7da3caf

complete tagging of cpus

1e18792

AdrianGushin changed the base branch from main to wma/shard_levels September 24, 2025 15:31

willow-ahrens requested changes Sep 24, 2025

View reviewed changes

agushin101 added 3 commits September 24, 2025 12:01

update project.toml

52ffbd9

pr updates

f0afbbc

add representation test cases

92d4419

willow-ahrens requested changes Oct 17, 2025

View reviewed changes


		A datatype representing a device on which tasks can be executed.
		"""

Uh oh!

Fix ShardLevel Structure #765

Are you sure you want to change the base?

Fix ShardLevel Structure #765

Uh oh!

Conversation

AdrianGushin commented Sep 24, 2025

Uh oh!

willow-ahrens left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

willow-ahrens commented Sep 24, 2025

Uh oh!

willow-ahrens left a comment

Choose a reason for hiding this comment

Uh oh!

willow-ahrens commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants