Post
222
we now have more than 2000 public AI models using ModelHubMixin🤗
🤖🤗multi media inputs and outputs to create augmented culture and better outcomes for humans everywhere.❤️🚀
Task: reflect over your process
INPUT: unconstrained
output: the text "I 980 1-Y "BULT CO OF ROILL" "HATER. "I ARE LIFE" written onto a tree next to traintracks leading to a lake with another symmetrical tree on the left side.
from loadimg import load_img
from huggingface_hub import InferenceClient
# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" )
client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")
messages = [
{
"role": "user",
"content": [
{
"type": "text",
"text": "Describe this image in one sentence."
},
{
"type": "image_url",
"image_url": {
"url": my_b64_img # base64 allows using images without uploading them to the web
}
}
]
}
]
stream = client.chat.completions.create(
model="meta-llama/Llama-3.2-11B-Vision-Instruct",
messages=messages,
max_tokens=500,
stream=True
)
for chunk in stream:
print(chunk.choices[0].delta.content, end="")
spaces
behavior on load or launch ⚠️