Skip to content

[Feature] Remote Inference Endpoints Support #1972

Open
2 of 4 issues completed
Open
Feature
2 of 4 issues completed
@joshuayao

Description

@joshuayao

Priority

P1-Stopper

OS type

Ubuntu

Hardware type

Xeon-GNR

Running nodes

Single Node

Description

The feature is from @louie-tsai:

  • GenAIExamples: enabled remote inference endpoints for AgentQnA, Productivity Suite, ChatQnA, DocSum, CodeSum, FinanceAgent, workflowExecAgent, AudioQnA, CodeTrans, MulitmodalQnA, VideoQnA, VisualQnA
  • GenAIComps : Enable remote endpoint for dynamic model switching/ endpoints without model name in it.
  • GenAIInfra: enabled remote inference endpoints in helm chats for ChatQnA, CodeGen and DocSum - Owner : Pramod

Sub-issues

Metadata

Metadata

Labels

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions