-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add block session support to @helia/interface #398
Conversation
ca4e718
to
b66acc4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 to combining blockBroker method types.
Could you explain how a session would become aware of providers obtained from a delegated-routing call?
packages/interface/src/blocks.ts
Outdated
* This method is optional to maintain compatibility with existing | ||
* blockstores that do not support sessions. | ||
*/ | ||
createSession?(root: CID, options?: AbortOptions & ProgressOptions<GetBlockProgressEvents>): Promise<Blockstore> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To be clear, the signal passed here is for the entire retrival session?
E.g. Your example would abort if the entire session lasted longer than 5000ms, correct?
const sessionBlockstore = await node.blockstore.createSession(rootCid, {
signal: AbortSignal.timeout(5000)
})
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, do we want to return Blockstore
explitly and not Blocks
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To be clear, the signal passed here is for the entire retrival session?
No, the signal is just for the session creation, e.g. when we do a findProvs
and send WANTHAVE
to create the initial set of providers.
Also, do we want to return Blockstore explitly and not Blocks?
It returns Blockstore
intentionally so that you can't create a session from a session.
if (broker.createSession == null) { | ||
return broker | ||
} | ||
|
||
return broker.createSession(root, options) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not really related to this section of code but: how do these session blockstores get providers from delegated-routing to the sessions?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similar to #398 (comment) - you accept an instance of the Helia Routing as a component and query it:
E.g.:
import { createHeliaHttp from } '@helia/http'
import { createDelegatedRoutingV1HttpApiClient } from '@helia/delegated-routing-v1-http-api-client'
import { Routing } from '@helia/interface'
interface MyBlockBrokerComponents {
routing: Routing
}
interface MyBlockBrokerInit {
// broker-specific config
}
class MyBlockBroker {
private routing: Routing
constructor (components: MyBlockBrokerComponents, init: MyBlockBrokerInit) {
this.routing = components.routing
}
createSession (root: CID): Promise<BlockBroker> {
for await (const prov of this.routing.findProviders(root)) {
if (prov.protocols.includes('my-block-broker-protocol')) {
// do block broker stuff
}
}
// ...more code here
}
}
function createMyBlockBroker(init: MyBlockBrokerInit) {
return (components) => new MyBlockBroker(components, init)
}
const node = createHeliaHttp({
blockBrokers: [
createMyBlockBroker({
// ...init stuff here
})
]
routers: [
createDelegatedRoutingV1HttpApiClient('https://example.com')
]
})
// this will call the broker's createSession method
const sessionBlockstore = await node.blockstore.createSesssion(cid)
* A session blockstore is a special blockstore that only pulls content from a | ||
* subset of network peers which respond as having the block for the initial | ||
* root CID. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Out of curiosity what's the reason for putting the session information in the blockstore rather than passing it along with say the signals?
My concern here (which might not be valid) is that it feels like we did something similar in the Go code and needing users to remember to create new block fetching stacks for every session ended up leading to the sessions not being plumbed through the stack well and therefore not having one "code session" per "logical session".
We're currently trying to alter how we do this to allow it to happen via signal (or "context" in Go). See ipfs/boxo#567 and ipfs/kubo#7198.
Every language/environment is different so maybe these concerns would be less likely to surface here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My feeling is that if people are using the API incorrectly it's probably because the correct usage isn't obvious enough, either through basic things like argument names and positions or documentation or both.
We already have people not passing AbortSignal
s into long-lived operations so I think a session identifier there would also get missed.
The primary interaction point for people with DAGs of various shapes are the @helia/*
modules. Since these are (mostly?) stateless they are essentially disposable so I think pushing the session/no session decision to the user (e.g. do I use helia.blockstore
or helia.blockstore.createSession(rootCid)
) and having them interact with separate @helia/unixfs
instances for each session will create a smaller amount of cognitive load.
HTTP Delegated Routers are configured as a router, and when you use the routing methods it'll query them. E.g.:
|
There are no implementations yet but the usage pattern will be something like: ```javascript // unixfs cat command export async function * cat (cid: CID, blockstore: Blocks, options: Partial<CatOptions> = {}): AsyncIterable<Uint8Array> { // create a session for the CID if support is available const blocks = await (blockstore.createSession != null ? blockstore.createSession(cid) : blockstore) const opts: CatOptions = mergeOptions(defaultOptions, options) // resolve and export using the session, if created, otherwise fall back to regular blockstore access const resolved = await resolve(cid, opts.path, blocks, opts) const result = await exporter(resolved.cid, blocks, opts) if (result.type !== 'file' && result.type !== 'raw') { throw new NotAFileError() } if (result.content == null) { throw new NoContentError() } yield * result.content(opts) } ```
b1c5fe9
to
32713ab
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
approved, but left some comments for improvement
Co-authored-by: Russell Dempsey <[email protected]>
await Promise.all( | ||
this.components.blockBrokers.map(async broker => broker.announce?.(cid, block, options)) | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we want allSettled instead of all? do we want a put
to fail if a single announcement fails?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps allSettled
. Maybe we'd want to throw if all blocker provers failed to announce?
@achingbrain anything blocking the merging of this? were you able to test this successfully in helia-http-gateway or should I take a look at that? |
There are no implementations yet but the usage pattern will be something like:
Alternatively the user can control session creation:
Removes the
BlockAnnouncer
/BlockRetriever
single-method interfaceBlockBroker
split because we would have to add anotherBlockSessionFactory
interface for this which starts getting unwieldy. Instead just have all the methods be optional and filter the brokers before use.