@lianghsun on Hugging Face: "🖖 Let me introduce the work I've done over the past three months:…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

lianghsun

posted an update 3 days ago

Post

1588

🖖 Let me introduce the work I've done over the past three months: 𝗟𝗹𝗮𝗺𝗮-𝟯.𝟮-𝗧𝗮𝗶𝘄𝗮𝗻-𝟯𝗕 and 𝗟𝗹𝗮𝗺𝗮-𝟯.𝟮-𝗧𝗮𝗶𝘄𝗮𝗻-𝟯𝗕-𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁, now open-sourced on 🤗 Hugging Face.

𝗹𝗶𝗮𝗻𝗴𝗵𝘀𝘂𝗻/𝗟𝗹𝗮𝗺𝗮-𝟯.𝟮-𝗧𝗮𝗶𝘄𝗮𝗻-𝟯𝗕: This model is built on top of 𝗺𝗲𝘁𝗮-𝗹𝗹𝗮𝗺𝗮/𝗟𝗹𝗮𝗺𝗮-𝟯.𝟮-𝟯𝗕 with continual pretraining. The training dataset consists of a mixture of Traditional Chinese and multilingual texts in specific proportions, including 20B tokens of Traditional Chinese text.

𝗹𝗶𝗮𝗻𝗴𝗵𝘀𝘂𝗻/𝗟𝗹𝗮𝗺𝗮-𝟯.𝟮-𝗧𝗮𝗶𝘄𝗮𝗻-𝟯𝗕-𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁: This is a fine-tuned conversational model based on the foundation model.

This Llama-3.2-Taiwan open-source project is currently a one-person effort (yes, I did everything from text preparation — so exhausting!). If you're interested, feel free to join the Discord server for discussions.

🅱🅴🅽🅲🅷🅼🅰🆁🅺🅸🅽🅶

The evaluation was conducted using ikala/tmmluplus, though the README page does not yet reflect the latest results. The performance is close to the previous versions, indicating that further improvements might require adding more specialized knowledge in the datasets.

🅰 🅲🅰🅻🅻 🅵🅾🆁 🆂🆄🅿🅿🅾🆁🆃

If anyone is willing to provide compute resources, it would be greatly appreciated to help this project continue and grow. 💪

---
🏔️ Foundation model: lianghsun/Llama-3.2-Taiwan-3B
🤖 Instruction model: lianghsun/Llama-3.2-Taiwan-3B-Instruct
⚡ GGUF: lianghsun/Llama-3.2-Taiwan-3B-Instruct-GGUF

JLouisBiz

3 days ago

•

edited 3 days ago

I’m sorry, but this is incorrect. 🚫 The LLAMA license is a proprietary license, not an open-source one. It does not meet the criteria set by the Open Source Initiative (OSI) or align with the principles of the Free Software Foundation (FSF). Calling it "open source" is misleading and inaccurate. 🛑

Open-source software grants users the freedom to use, modify, and distribute the code without restrictive conditions. However, the LLAMA license imposes significant limitations that prevent it from being truly open source. It’s important to use precise language when discussing software licenses to avoid confusion and uphold the integrity of the open-source community. 🌍💡

Let’s ensure we’re clear and accurate in our terminology to respect the values of openness and freedom that define open-source software. 🙌

References:

What is Free Software? - GNU Project - Free Software Foundation:
https://www.gnu.org/philosophy/free-sw.html

The Open Source Definition – Open Source Initiative:
https://opensource.org/osd

Meta’s LLaMa 2 license is not Open Source – Open Source Initiative:
https://opensource.org/blog/metas-llama-2-license-is-not-open-source

Please remove words Open Source from your description to avoid misleading and deceptive tactics!

Tonic

2 days ago

these two organisations have an opinion , many people in the world have another . it might be surprising to you that they can be safely ignored and are not the arbiturs of truth , just as it might be amazing learn nobody needs people that dont put licences on their publications to give lessons on licences https://huggingface.co/datasets/JLouisBiz/my-distiset-be899639/tree/main so just enjoy the model or ignore it :-)

In this post