[Feat] toploc2 #360

Jackmin801 · 2025-06-03T09:23:00Z

No description provided.

Jackmin801 · 2025-06-12T02:47:51Z

graphs look ok

Copilot

Pull Request Overview

This PR introduces a new Toploc2Sampler for logit sampling, propagates per-sample seeds through the pipeline into Parquet outputs, and adjusts related tests and utilities to include the seed field.

Add Toploc2Sampler and switch to it when appropriate, disabling chunked prefill for correctness
Introduce and propagate a seed field in configs, inference logic, Parquet schema, and tests
Add a model validator to convert negative logprobs config values to None

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tests/integration/inference/test_debug_pp.py	Standardize model name arguments
tests/conftest.py	Add dummy `seed` column to test tables
src/zeroband/utils/parquet.py	Extend Parquet schema with `seed` field
src/zeroband/inference/toploc2.py	Implement the new Toploc2 sampling layer
src/zeroband/inference/parquet.py	Pass `seed` values into Parquet records
src/zeroband/inference/config.py	Validate and normalize `logprobs` setting
src/zeroband/infer.py	Wire up `Toploc2Sampler` and seed logic

Comments suppressed due to low confidence (2)

src/zeroband/inference/toploc2.py:124

This TODO should be resolved or removed; if rank information is required for logprob metadata, implement a clear method to retrieve it and document the approach.

# TODO: How did the original code know the rank?

src/zeroband/inference/config.py:22

[nitpick] The method name and docstring refer to "negative logprobs," but the field is a count of logprobs; consider renaming to convert_negative_logprobs_count_to_none or clarifying that logprobs < 0 disables computation.

    def convert_negative_logprobs_to_none(self):

tests/integration/inference/test_debug_pp.py

src/zeroband/utils/parquet.py

src/zeroband/inference/toploc2.py

src/zeroband/infer.py

mikasenghaas

hell yea! nice job. have we tested that the produced output tokens are identical for the sampler and toploc sampler, ie. are we absolutely sure we are not altering model behavior? could be nice to have a simple test for this?

also, i think adding toploc 2 into the configs is also important. let's get this merged soon, so that i can rebase onto the config refactor:)

mikasenghaas · 2025-06-12T02:56:40Z

src/zeroband/inference/config.py

+    @model_validator(mode="after")
+    def convert_negative_logprobs_to_none(self):
+        """Convert negative logprobs values to None to disable logprobs calculation."""
+        if self.logprobs is not None and self.logprobs < 0:
+            self.logprobs = None
+        return self


Feels more intuitive to err when passing negative values?

this is necessary to disable logprobs. I couldnt find a way to pass none and since the default is 0, it didnt seem like there was a way to make it None other than this

src/zeroband/infer.py

src/zeroband/inference/toploc2.py

tests/integration/inference/test_debug_pp.py

src/zeroband/infer.py

samsja

lfgtm v2

Jackmin801 · 2025-06-12T21:31:39Z

Should be ok. lets merge!

Jackmin801 marked this pull request as ready for review June 3, 2025 09:44

Jackmin801 force-pushed the feat-toploc2 branch from c4ae93e to d031aca Compare June 11, 2025 04:14

Jackmin801 added 14 commits June 11, 2025 18:40

port toploc2 sampler

f04d9b0

add to infer

33e9b81

fix: sampler needs to be set before pipeline hooks

538ce50

revert output modification

60f8b04

fix: disable chunked prefill

ac48957

fix: deprecate pre-sharded pipeline repo

dc8f1ff

dont return logprobs

f8b7dba

dont override when not doing synthetic data gen

d33cc2a

add back annoying save format

e00d0cb

restore default

909f96b

make it possible to set none

8a20d09

save seed in parquet

952e555

need long

df087d6

add seed to conftest

3d92089

Jackmin801 force-pushed the feat-toploc2 branch from 216f51a to 3d92089 Compare June 12, 2025 01:40

Jackmin801 requested review from Copilot, mikasenghaas and samsja June 12, 2025 02:47

Copilot AI reviewed Jun 12, 2025

View reviewed changes

always use the Toploc2 sampler

97af60e

mikasenghaas approved these changes Jun 12, 2025

View reviewed changes

samsja approved these changes Jun 12, 2025

View reviewed changes

Jackmin801 added 3 commits June 12, 2025 12:29

revert fix

20cb9c7

add comment

cfc5808

add toploc2 flag

c3f1aad

Jackmin801 merged commit 757b768 into main Jun 12, 2025
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feat] toploc2 #360

[Feat] toploc2 #360

Jackmin801 commented Jun 3, 2025

Uh oh!

Jackmin801 commented Jun 12, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Uh oh!

mikasenghaas Jun 12, 2025

Uh oh!

Jackmin801 Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samsja left a comment

Uh oh!

Jackmin801 commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

[Feat] toploc2 #360

[Feat] toploc2 #360

Conversation

Jackmin801 commented Jun 3, 2025

Uh oh!

Jackmin801 commented Jun 12, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Choose a reason for hiding this comment

Uh oh!

mikasenghaas Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Jackmin801 Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samsja left a comment

Choose a reason for hiding this comment

Uh oh!

Jackmin801 commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!