Noa Roggendorff's picture

Noa Roggendorff

nroggendorff

AI & ML interests

None

Recent Activity

Organizations

Gradio-Blocks-Party's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture MLX Community's profile picture Dev Mode Explorers's profile picture Glide's profile picture None yet's profile picture

nroggendorff's activity

replied to their post 2 months ago
view reply

Ah we can work with that, then the issue is that the space is incomplete/misconfigured, (i would reccomend amending your original post to avoid confusion).

I just read your blog post: https://huggingface.co/blog/nroggendorff/train-with-llama-architecture

It provides some useful context, thanks.

From reading the dockerfile and image file, it appears that cuda was never included in the image.

You may find the following resources helpful for using docker with spaces:
https://huggingface.co/docs/hub/en/spaces-sdks-docker

If you are using cuda, this may also help inform on how to setup cuda, and also test if cuda works (with docker):
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

References
https://huggingface.co/spaces/nroggendorff/train-llama/blob/main/Dockerfile

https://hub.docker.com/layers/nroggendorff/train-llama/latest/images/sha256-8cd7859f8a7cc3b669b344e87fa342e3c464e449141e267fbb52cfb48c32310f

Hope you find this helpful,
Let me know if you have any more questions, let me know here or email me.

The base image for that Dockerfile has Cuda installed and configured.

You are welcome to open a PR with your proposed fix on https://github.com/nroggendorff/train-llama.

replied to their post 2 months ago
view reply

Ah we can work with that, then the issue is that the space is incomplete/misconfigured, (i would reccomend amending your original post to avoid confusion).

I just read your blog post: https://huggingface.co/blog/nroggendorff/train-with-llama-architecture

It provides some useful context, thanks.

From reading the dockerfile and image file, it appears that cuda was never included in the image.

You may find the following resources helpful for using docker with spaces:
https://huggingface.co/docs/hub/en/spaces-sdks-docker

If you are using cuda, this may also help inform on how to setup cuda, and also test if cuda works (with docker):
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

References
https://huggingface.co/spaces/nroggendorff/train-llama/blob/main/Dockerfile

https://hub.docker.com/layers/nroggendorff/train-llama/latest/images/sha256-8cd7859f8a7cc3b669b344e87fa342e3c464e449141e267fbb52cfb48c32310f

Hope you find this helpful,
Let me know if you have any more questions, let me know here or email me.

The base image for that Dockerfile has Cuda installed and configured.

replied to their post 2 months ago
view reply

I am not sure if that makes sense, I am under the impression that, if the space is not running(not started), no models can be actively loaded in the space.

Can you share your relevant workflow(docker-compose, app code, etc) so i can see more clearly whats happening?

I might be able to aid in a solution, its possible that there is an issue in the workflow itself.

EDIT: I looked at the spaces, Do you mean this space as an example? 'https://huggingface.co/spaces/nroggendorff/train-llama'
Because this space shows a missing "CUDA_HOME" env var, most your other spaces throwing errors about missing CUDA drivers or are paused. These are configuration errors.

Could you tell me the space and error message?
I might be able to help you fix it.

That’s the one.

replied to their post 2 months ago
replied to their post 2 months ago
view reply

it's pretty specific to my workflow, but spaces now don't get cuda until after they start, so you can't load models or anything until an app is running

reacted to their post with β€οΈπŸ€—πŸš€ 2 months ago
view post
Post
2553
I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)
Β·
posted an update 2 months ago
view post
Post
2553
I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)
Β·
replied to their post 3 months ago
reacted to their post with πŸ€—πŸš€ 3 months ago
view post
Post
3638
200
Β·
posted an update 3 months ago
view post
Post
3638
200
Β·
reacted to clem's post with πŸ€— 3 months ago
view post
Post
4685
We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!
Β·
reacted to their post with πŸ˜”πŸ”₯ 3 months ago
view post
Post
2784
to the nvidia employee that won't respond to my emails: hear me now.

you have made a semi-powerful to irrelevant enemy. you have been warned
  • 4 replies
Β·
posted an update 3 months ago
view post
Post
2784
to the nvidia employee that won't respond to my emails: hear me now.

you have made a semi-powerful to irrelevant enemy. you have been warned
  • 4 replies
Β·
posted an update 3 months ago