feat(weave): Implement integration with 🤗 inference client #2795

soumik12345 · 2024-10-28T18:07:39Z

Description

Implement autopatch integration with 🤗 inference client.

Multi-modal text completion

Sync generation

Expand to see code snippets and traces

Without streaming

import os
import weave
from huggingface_hub import InferenceClient


weave.init("test-huggingface")
client = InferenceClient(api_key=os.getenv("HUGGINGFACE_API_KEY"))

image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
client.chat_completion(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "image_url", "image_url": {"url": image_url}},
                {"type": "text", "text": "Describe this image in one sentence."},
            ],
        }
    ],
    max_tokens=500,
)

Sample Trace

With streaming

import os
import weave
from huggingface_hub import InferenceClient


weave.init("test-huggingface")
client = InferenceClient(api_key=os.getenv("HUGGINGFACE_API_KEY"))

image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
response = client.chat_completion(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "image_url", "image_url": {"url": image_url}},
                {"type": "text", "text": "Describe this image in one sentence."},
            ],
        }
    ],
    max_tokens=500,
    stream=True,
)

for r in response:
    print(r.choices[0].delta.content, end="")

Sample Trace

Note: Usage metadata is coming as None. This is because value.usage is always coming None when stream=True. This might be due to a bug in huggingface_hub.InferenceClient.

Async generation

Expand to see code snippets and traces

Without streaming

import asyncio
import os
import weave
from huggingface_hub import AsyncInferenceClient

weave.init("test-huggingface")
client = AsyncInferenceClient(api_key=os.getenv("HUGGINGFACE_API_KEY"))

image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"

response = asyncio.run(
    client.chat_completion(
        model="meta-llama/Llama-3.2-11B-Vision-Instruct",
        messages=[
            {
                "role": "user",
                "content": [
                    {"type": "image_url", "image_url": {"url": image_url}},
                    {"type": "text", "text": "Describe this image in one sentence."},
                ],
            }
        ],
        max_tokens=500,
        stream=True,
    )
)

Sample Trace

With streaming

import asyncio
import os
import weave
from huggingface_hub import AsyncInferenceClient
import rich

weave.init("test-huggingface")
client = AsyncInferenceClient(api_key=os.getenv("HUGGINGFACE_API_KEY"))

image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"

async def generate():
    response = await client.chat_completion(
        model="meta-llama/Llama-3.2-11B-Vision-Instruct",
        messages=[
            {
                "role": "user",
                "content": [
                    {"type": "image_url", "image_url": {"url": image_url}},
                    {"type": "text", "text": "Describe this image in one sentence."},
                ],
            }
        ],
        max_tokens=500,
        stream=True,
    )

    async for r in response:
        print(r.choices[0].delta.content, end="")


asyncio.run(generate())

Sample Trace

Text-to-image generation

Expand to see code snippets and traces

import os
import weave
from huggingface_hub import InferenceClient

weave.init("test-huggingface")
InferenceClient(api_key=os.getenv("HUGGINGFACE_API_KEY")).text_to_image(
    prompt="A whimsical and creative image depicting a hybrid creature that is a mix of a waffle and a hippopotamus, basking in a river of melted butter amidst a breakfast-themed landscape. It features the distinctive, bulky body shape of a hippo. However, instead of the usual grey skin, the creature's body resembles a golden-brown, crispy waffle fresh off the griddle. The skin is textured with the familiar grid pattern of a waffle, each square filled with a glistening sheen of syrup. The environment combines the natural habitat of a hippo with elements of a breakfast table setting, a river of warm, melted butter, with oversized utensils or plates peeking out from the lush, pancake-like foliage in the background, a towering pepper mill standing in for a tree.  As the sun rises in this fantastical world, it casts a warm, buttery glow over the scene. The creature, content in its butter river, lets out a yawn. Nearby, a flock of birds take flight",
    model="stabilityai/stable-diffusion-3.5-large",
)

Sample Trace

circle-job-mirror · 2024-10-28T18:09:23Z

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=456b0ec6e0e09fc5e286f6225ae018a273c42317

…wering

socket-security · 2025-01-17T15:11:53Z

Updated dependencies detected. Learn more about Socket for GitHub ↗︎

Package	New capabilities	Transitives	Size	Publisher
pypi/[email protected] 🔁 pypi/[email protected]	None	`+169`	1.48 GB

View full report↗︎

ayulockin · 2025-01-22T12:30:30Z

Hey @soumik12345 can you ensure a green CI?

Hey @wandb/weave-team, this integration will help us gain more traction with the open models and projects that use HF inference. Can we get some review time on this PR?

soumik12345 · 2025-01-22T14:01:08Z

Hey @soumik12345 can you ensure a green CI?

Hey @wandb/weave-team, this integration will help us gain more traction with the open models and projects that use HF inference. Can we get some review time on this PR?

Made the ci green, however llamaindex and langchain tests keep failing (don't think that's because of the PR).

soumik12345 · 2025-02-06T08:34:26Z

Continuing at #3612

add: patching for InferenceClient.chat_completion

84f2da9

soumik12345 self-assigned this Oct 28, 2024

soumik12345 requested a review from a team as a code owner October 28, 2024 18:07

soumik12345 marked this pull request as draft October 28, 2024 18:07

soumik12345 added 2 commits October 28, 2024 23:38

Merge branch 'master' into feat/huggingface-inference

2f1cd81

fix: lint

0b30e19

soumik12345 added 23 commits October 29, 2024 00:30

add: huggingface_accumulator

f63bdc4

Merge branch 'master' into feat/huggingface-inference

190f738

Merge branch 'master' into feat/huggingface-inference

92ebfd2

add: patching for AsyncInferenceClient

16eb753

add: patching for document_question_answering and visual_question_ans…

376fa99

…wering

add: tests

9104c9e

add: huggingface integration unit testing shard

a11e072

Merge branch 'master' into feat/huggingface-inference

b3602a0

add: patching for fill_mask + tests

d1ce70b

add: patching for fill_mask + tests

c450f0d

Merge branch 'master' into feat/huggingface-inference

afd1e68

add: patching for question_answering

dbdf332

add: patching for sentence_similarity

82dd603

add: patching for summarization

43c299c

add: patching for table_question_answering

d7fdf13

add: patching for text_classification

7515d1e

add: patching for token_classification

29ef8b8

add: patching for translation

02807ca

update: tests

7826dda

add: patching for zero_shot_classification

0a98a26

add: patching for text_to_image

0504ec5

Merge branch 'master' into feat/huggingface-inference

ed4bd50

Merge branch 'master' into feat/huggingface-inference

8c33c3d

soumik12345 added 3 commits November 25, 2024 21:03

Merge branch 'master' into feat/huggingface-inference

43c9c95

Merge branch 'master' into feat/huggingface-inference

6138efd

Merge branch 'master' into feat/huggingface-inference

cb0c748

Merge branch 'master' into feat/huggingface-inference

1866ba8

soumik12345 marked this pull request as ready for review January 22, 2025 12:03

soumik12345 added 4 commits January 22, 2025 18:42

fix: lint

39ccf64

fix: lint

74dedbb

update: tests

2ca6aad

update: tests

96dac99

soumik12345 and others added 11 commits January 23, 2025 18:42

Merge branch 'master' into feat/huggingface-inference

91f619a

update: integration

af85349

fix: lint

f7cad31

update: integration

12dbacd

add: input postprocessing

1086fe4

fix: lint

de595c5

Merge branch 'master' into feat/huggingface-inference

4ecbdbe

Merge branch 'master' into feat/huggingface-inference

c273d0f

Merge branch 'master' into feat/huggingface-inference

9b137d8

Merge branch 'master' into feat/huggingface-inference

3232579

Merge branch 'master' into feat/huggingface-inference

e65f8d3

soumik12345 mentioned this pull request Feb 6, 2025

feat(weave): Implement integration with Huggingface inference client #3612

Merged

soumik12345 closed this Feb 6, 2025

github-actions bot locked and limited conversation to collaborators Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(weave): Implement integration with 🤗 inference client #2795

feat(weave): Implement integration with 🤗 inference client #2795

Uh oh!

soumik12345 commented Oct 28, 2024 •

edited

Loading

Uh oh!

circle-job-mirror bot commented Oct 28, 2024 •

edited

Loading

Uh oh!

socket-security bot commented Jan 17, 2025 •

edited

Loading

Uh oh!

ayulockin commented Jan 22, 2025

Uh oh!

soumik12345 commented Jan 22, 2025

Uh oh!

soumik12345 commented Feb 6, 2025

Uh oh!

Uh oh!

feat(weave): Implement integration with 🤗 inference client #2795

feat(weave): Implement integration with 🤗 inference client #2795

Uh oh!

Conversation

soumik12345 commented Oct 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Multi-modal text completion

Sync generation

Without streaming

With streaming

Async generation

Without streaming

With streaming

Text-to-image generation

Uh oh!

circle-job-mirror bot commented Oct 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

socket-security bot commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ayulockin commented Jan 22, 2025

Uh oh!

soumik12345 commented Jan 22, 2025

Uh oh!

soumik12345 commented Feb 6, 2025

Uh oh!

Uh oh!

soumik12345 commented Oct 28, 2024 •

edited

Loading

circle-job-mirror bot commented Oct 28, 2024 •

edited

Loading

socket-security bot commented Jan 17, 2025 •

edited

Loading