Improve design to accommodate Osrank & Registry

PR #8 made certain types more polymorphic but it also reshuffled a bit some of the existing traits. In articular, we added two new methods to the `Node` and `Edge` traits, respectively `fn node_type(&self) -> types::NodeType` and `fn edge_type(&self) -> types::EdgeType`, which return the _concrete_ types.

This is something that @cloudhead wasn't very fond of, but that was initially justified by the fact that for `Osrank` is very convenient to be able to use the concrete types. The same concrete types are also used in the `EdgeRef` struct:

```rust
pub struct EdgeRef<'a, NodeId, EdgeId> {
    pub from: &'a NodeId,
    pub to: &'a NodeId,
    pub id: &'a EdgeId,
    pub edge_type: &'a EdgeType,
}
```

Here I am going to explain in full detail why I have done this, to leave a testament behind of my thought process so that this design can be improved.

The main reason why I did come up with those extra methods/fields in the first place is because I was under the impression the Engineering team settled on a design where we were going to use the concrete types as the "meeting point" between the Registry & Osrank; in such scenario is extremely useful for Osrank & Registry to share the same types. Not only that, but let's take as an example a real piece of code from Osrank:

```rust
impl<W, R> DynamicWeights for Network<W, R>
where
    W: Clone + Mul<Output = W> + Div<Output = W> + From<Weight>,
    R: Clone + Zero,
{
    fn dynamic_weight(
        &self,
        edge: &impl Edge<Self::Weight, <Self::Node as GraphObject>::Id, Self::EdgeData>,
        hyperparams: &types::HyperParameters<Self::Weight>,
    ) -> Self::Weight {
        let e_type = edge.edge_type();

        // Let's start by assigning this edge the stock default value, by
        // reading it from the hyperparams.
        let mut weight: Self::Weight = (hyperparams.get_param(&e_type.to_tag())).clone();

        // others can't be zero as there is at least one edge, i.e. the
        // input one.
        let others = edges_of_same_type(self, edge, Direction::Outgoing, e_type);

        let source_node = self
            .get_node(edge.source())
            .expect("dynamic_weight: source node not found.");

        // Then we need to do something different based on the type of edge.
        match e_type.to_tag() {
            types::EdgeTypeTag::ProjectToUserContribution => {
                // contrib is multiplied by the number of contributions of
                // the account to the project, divided by the total number
                // of contributions in the project.
                let total_project_contrib = source_node.node_type().total_contributions();
                let user_contribs = edge.edge_type().total_contributions();

                weight = weight * Weight::new(user_contribs, total_project_contrib).into()
            }
            types::EdgeTypeTag::UserToProjectContribution => {
                // contrib* and maintain* are multiplied by the number of
                // contributions of the account to the project, divided by
                // the total number of contributions of the account.
                let total_account_contrib = source_node.node_type().total_contributions();
                let user_contribs = edge.edge_type().total_contributions();

                weight = weight * Weight::new(user_contribs, total_account_contrib).into()
            }
            types::EdgeTypeTag::UserToProjectMembership => {
                // The weight is divided by the corresponding count of
                // outgoing edges of the same type on the node.
                weight = weight / others.into()
            }
            types::EdgeTypeTag::ProjectToUserMembership => {
                // contrib* and maintain* are multiplied by the number of
                // contributions of the account to the project, divided by
                // the total number of contributions of the account.
                let total_account_contrib = source_node.node_type().total_contributions();
                let user_contribs = edge.edge_type().total_contributions();

                weight = weight * Weight::new(user_contribs, total_account_contrib).into()
            }
            types::EdgeTypeTag::Dependency => {
                // The weight is divided by the corresponding count of
                // outgoing edges of the same type on the node.
                weight = weight / others.into()
            }
        }

        weight
    }
}
```

Here I was able to write this trait implementation in a fairly polymorphic way by the virtue of the
fact I could rely on calling `.edge_type()` and `.node_type()` and be sure they would return what I was expecting. In particular, I was able to write `impl Edge<Self::Weight, <Self::Node as GraphObject>::Id, Self::EdgeData>` and pass any type which implements that trait. If we were going to remove those `node_type/edge_type` methods (by the virtue of the fact the concrete types will probably be inside the `NodeData/EdgeData`, we would have to write something like this, at the very minimum:

```rust
impl<W, R> DynamicWeights for Network<W, R>
where
    W: Clone + Mul<Output = W> + Div<Output = W> + From<Weight>,
    R: Clone + Zero,
    Self::EdgeData: Into<types::EdgeType>,
    Self::NodeData: Into<types::NodeType>,
{
...
```

Then add proper `std::convert::Into` instances and finally in the code call:

```rust
let edge_type: types::EdgeType = edge.data().into();
```

Which is still do-able, albeit unfortunate.

As regards the `EdgeRef` type, note that we *cannot* write the following:

```
pub struct EdgeRef<'a, NodeId, EdgeId> {
    pub from: &'a NodeId,
    pub to: &'a NodeId,
    pub id: &'a EdgeId,
    pub edge_type: &'a EdgeData,
}
```

This is because an `EdgeData` exist only in the context of a `Graph`. We have two options here:

1. We make the `EdgeRef` polymorphic over the graph:

```rust
pub struct EdgeRef<'a, G> 
where
  G: Graph
{
    pub from: &'a Id<G::Node>,
    pub to: &'a Id<G::Node>,
    pub id: &'a Id<G::Edge>,
    pub edge_type: &'a G::EdgeData,
}
```

2. We make `EdgeData` (or even `EdgeType` at this point?) an extra type parameter:

```rust
pub struct EdgeRef<'a, NodeId, EdgeId, EdgeType> {
    pub from: &'a NodeId,
    pub to: &'a NodeId,
    pub id: &'a EdgeId,
    pub edge_type: &'a EdgeType, // This is not the concrete one but a free variable
}
```

I don't have an intuition on which method is better, but it looks like option 2. feels unnatural, as it makes sense to talk about a specific `EdgeRef` over a Graph `G`. Last but not least, the reason why I have added this `edge_type` to `EdgeRef` in the first place is because it's very handy to have it "pre-computed" in a situation like this:

```rust
                    for eref in network.edges_directed(&current_node_id, Direction::Outgoing) {
                        possible_edge_types.insert(eref.edge_type);
                    }
```

If we didn't have this, we would have to fetch the info from the graph, which is very inefficient:

```rust
                    for eref in network.edges_directed(&current_node_id, Direction::Outgoing) {
                        let edge_type = network.get_edge(eref.id()).unwrap().data().into();
                        possible_edge_types.insert(edge_type);
                    }
```

And that obviously also requires the `Into<types::EdgeType>` constraint.

I hope this is useful as a testament for @MeBrei and the rest of the crew :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve design to accommodate Osrank & Registry #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve design to accommodate Osrank & Registry #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions