data:image/s3,"s3://crabby-images/88573/88573a26a4a998a7e09e7ed66cf9010f4c9dc59b" alt=""
Stable Diffusion Community (Unofficial, Non-profit)
community
AI & ML interests
Enhance and upgrade SD-models
sd-community's activity
data:image/s3,"s3://crabby-images/959ec/959ecfa7dd3bb4676780c4779dd62552a266302c" alt=""
eienmojikiΒ
posted
an
update
14 days ago
Post
2033
πͺ LayerDiffuse - Flux Version (Demo) πͺ
LayerDiffuse - Transparent Image Layer Diffusion using Latent Transparency
Demo: eienmojiki/Flux-LayerDiffuse
LayerDiffuse - Transparent Image Layer Diffusion using Latent Transparency
Demo: eienmojiki/Flux-LayerDiffuse
data:image/s3,"s3://crabby-images/7f465/7f465d287f920c2e9e6d034ee82812f17e0e0c04" alt=""
ameerazam08Β
posted
an
update
22 days ago
Post
3870
I have just released a new blogpost about kv caching and its role in inference speedup π
π https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :
π https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :
Post
1608
R1 is out! And with a lot of other R1 releated models...
Post
1616
we now have more than 2000 public AI models using ModelHubMixinπ€
Post
4004
Published a new blogpost π
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
π https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
π https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :
Post
813
Hey everyone π€!
Check out this new Virtual Try Off model (based on SD1.5): 1aurent/TryOffAnyone
This model isn't as accurate as others (e.g. xiaozaa/cat-try-off-flux based on FLUX.1) but it sure is fast!
Check out this new Virtual Try Off model (based on SD1.5): 1aurent/TryOffAnyone
This model isn't as accurate as others (e.g. xiaozaa/cat-try-off-flux based on FLUX.1) but it sure is fast!
data:image/s3,"s3://crabby-images/bdda0/bdda014ba6ac1a00d5a69236572a79ded275b12a" alt=""
ehristoforuΒ
posted
an
update
2 months ago
Post
3379
βοΈ Ultraset - all-in-one dataset for SFT training in Alpaca format.
fluently-sets/ultraset
β Ultraset is a comprehensive dataset for training Large Language Models (LLMs) using the SFT (instruction-based Fine-Tuning) method. This dataset consists of over 785 thousand entries in eight languages, including English, Russian, French, Italian, Spanish, German, Chinese, and Korean.
π€― Ultraset solves the problem faced by users when selecting an appropriate dataset for LLM training. It combines various types of data required to enhance the model's skills in areas such as text writing and editing, mathematics, coding, biology, medicine, finance, and multilingualism.
π€ For effective use of the dataset, it is recommended to utilize only the "instruction," "input," and "output" columns and train the model for 1-3 epochs. The dataset does not include DPO or Instruct data, making it suitable for training various types of LLM models.
βοΈ Ultraset is an excellent tool to improve your language model's skills in diverse knowledge areas.
fluently-sets/ultraset
β Ultraset is a comprehensive dataset for training Large Language Models (LLMs) using the SFT (instruction-based Fine-Tuning) method. This dataset consists of over 785 thousand entries in eight languages, including English, Russian, French, Italian, Spanish, German, Chinese, and Korean.
π€― Ultraset solves the problem faced by users when selecting an appropriate dataset for LLM training. It combines various types of data required to enhance the model's skills in areas such as text writing and editing, mathematics, coding, biology, medicine, finance, and multilingualism.
π€ For effective use of the dataset, it is recommended to utilize only the "instruction," "input," and "output" columns and train the model for 1-3 epochs. The dataset does not include DPO or Instruct data, making it suitable for training various types of LLM models.
βοΈ Ultraset is an excellent tool to improve your language model's skills in diverse knowledge areas.
Post
2316
ever wondered how you can make an API call to a visual-question-answering model without sending an image url π
you can do that by converting your local image to base64 and sending it to the API.
recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
π https://github.com/not-lain/loadimg
API request example π οΈ:
you can do that by converting your local image to base64 and sending it to the API.
recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
π https://github.com/not-lain/loadimg
API request example π οΈ:
from loadimg import load_img
from huggingface_hub import InferenceClient
# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" )
client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")
messages = [
{
"role": "user",
"content": [
{
"type": "text",
"text": "Describe this image in one sentence."
},
{
"type": "image_url",
"image_url": {
"url": my_b64_img # base64 allows using images without uploading them to the web
}
}
]
}
]
stream = client.chat.completions.create(
model="meta-llama/Llama-3.2-11B-Vision-Instruct",
messages=messages,
max_tokens=500,
stream=True
)
for chunk in stream:
print(chunk.choices[0].delta.content, end="")
data:image/s3,"s3://crabby-images/8bbd3/8bbd3af037a8bb82fb8e0bd642ae9888584403cc" alt=""
1024mΒ
authored
4
papers
4 months ago
RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts
Paper
β’
2410.16659
β’
Published
Large Language Models for Cross-lingual Emotion Detection
Paper
β’
2410.15974
β’
Published
β’
1
1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification
Paper
β’
2410.15998
β’
Published
β’
1
Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence
Paper
β’
2410.15990
β’
Published
β’
1
Post
8311
Realtime Whisper Large v3 Turbo Demo:
It transcribes audio in about 0.3 seconds.
KingNish/Realtime-whisper-large-v3-turbo
It transcribes audio in about 0.3 seconds.
KingNish/Realtime-whisper-large-v3-turbo
Post
8249
Exciting news! Introducing super-fast AI video assistant, currently in beta. With a minimum latency of under 500ms and an average latency of just 600ms.
DEMO LINK:
KingNish/Live-Video-Chat
DEMO LINK:
KingNish/Live-Video-Chat
Post
3302
A super good and fast image inpainting demo is here.
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint