Pelochus
/

ezrkllm-collection

@@ -18,17 +18,18 @@ Check the main repo on GitHub for how to install and use: https://github.com/Pel
 Before running any LLM, take into account that the required RAM is between 1.5-3 times the model size (this is an estimation, haven't done extensive testing yet).
 Right now, only converted the following models:
-| LLM                   | Parameters  | Link                                                       |
-| --------------------- | ----------- | ---------------------------------------------------------- |
-| Qwen Chat             | 1.8B        | https://huggingface.co/Pelochus/qwen-1_8B-rk3588           |
-| Gemma                 | 2B          | https://huggingface.co/Pelochus/gemma-2b-rk3588            |
-| Microsoft Phi-2       | 2.7B        | https://huggingface.co/Pelochus/phi-2-rk3588               |
-| Microsoft Phi-3 Mini  | 3.8B        | https://huggingface.co/Pelochus/phi-3-mini-rk3588          |
-| Llama 2 7B            | 7B          | https://huggingface.co/Pelochus/llama2-chat-7b-hf-rk3588   |
-| Llama 2 13B           | 13B         | https://huggingface.co/Pelochus/llama2-chat-13b-hf-rk3588  |
-| TinyLlama v1          | 1.1B        | https://huggingface.co/Pelochus/tinyllama-v1-rk3588        |
-| Qwen 1.5 Chat         | 4B          | https://huggingface.co/Pelochus/qwen1.5-chat-4B-rk3588     |
-| Qwen 2                | 1.5B        | https://huggingface.co/Pelochus/qwen2-1_5B-rk3588          |
 Llama 2 was converted using Azure servers.
 For reference, converting Phi-2 peaked at about 15 GBs of RAM + 25 GBs of swap (counting OS, but that was using about 2 GBs max).

 Before running any LLM, take into account that the required RAM is between 1.5-3 times the model size (this is an estimation, haven't done extensive testing yet).
 Right now, only converted the following models:
+| LLM                   | Parameters  | Link                                                          |
+| --------------------- | ----------- | ------------------------------------------------------------- |
+| Qwen 2                | 1.5B        | https://huggingface.co/Pelochus/deepseek-R1-distill-qwen-1.5B |
+| Qwen Chat             | 1.8B        | https://huggingface.co/Pelochus/qwen-1_8B-rk3588              |
+| Gemma                 | 2B          | https://huggingface.co/Pelochus/gemma-2b-rk3588               |
+| Microsoft Phi-2       | 2.7B        | https://huggingface.co/Pelochus/phi-2-rk3588                  |
+| Microsoft Phi-3 Mini  | 3.8B        | https://huggingface.co/Pelochus/phi-3-mini-rk3588             |
+| Llama 2 7B            | 7B          | https://huggingface.co/Pelochus/llama2-chat-7b-hf-rk3588      |
+| Llama 2 13B           | 13B         | https://huggingface.co/Pelochus/llama2-chat-13b-hf-rk3588     |
+| TinyLlama v1          | 1.1B        | https://huggingface.co/Pelochus/tinyllama-v1-rk3588           |
+| Qwen 1.5 Chat         | 4B          | https://huggingface.co/Pelochus/qwen1.5-chat-4B-rk3588        |
+| Qwen 2                | 1.5B        | https://huggingface.co/Pelochus/qwen2-1_5B-rk3588             |
 Llama 2 was converted using Azure servers.
 For reference, converting Phi-2 peaked at about 15 GBs of RAM + 25 GBs of swap (counting OS, but that was using about 2 GBs max).