Skip to content

[RMP] Support for simple ML/CF Models (like Implicit) in Merlin Models and Systems #104

@viswa-nvidia

Description

@viswa-nvidia

Problem:

Latent factor models enable discovery of the underlying structure between interactions and items. These approaches have been popular over the years to leverage implicit feedback data. Customers who are using simple models via Implicit and LightFM want to be able to deploy those models within the Merlin ecosystem.

Goal:

Constraints:

Systems

  • Serve Implicit/LightFM as a self-contained op with everything required to serve within the exported triton model directory
    Requires installing the python package in the tritonserver environment where it will run.
    Decomposing the serving of these into different operators (retrieval with nearest neighbour search through embedding space)

Blocking issues

  • Inference is blocked w/ issues on serialization of model w/ Implicit

Starting Point:

Merlin-models

wrap Implicit and lightFM in High-level model API:

NVTabular

N/A

Merlin-systems

Examples and Docs (To happen in 22.09)

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions