Hieu Lam PRO
AI & ML interests
Articles
Organizations
lamhieu's activity
Sounds interesting but I think there will be a big breakthrough, a new "architecture/methodology/factor/rethinking" for developing large models. That's what I think, I don't know what it is yet, haha.
๐ฆ Unlock the Power of Ghost 8B Beta 1608: Build Your Personal AI Companion
Ghost 8B Beta 1608 empowers you to create a safe and multilingual AI assistant tailored to your needs, directly on your personal computer. ๐งโ๐ป Leverage AI's capabilities within your own space! ๐ Ghost 8B Beta 1608 is ready to become your AI companion.
~
๐ฆ ๊ฐ์ธ์ฉ AI ๋ณด์กฐ ๋๊ตฌ๋ก Ghost 8B Beta 1608๋ฅผ ํ์ฉํ์ธ์!
Ghost 8B Beta 1608, AI์ ํ์ ํ์ฉํ์ฌ ์์ ํ๊ณ ๊ฐ์ธํ๋ ์ธ์ด ์ง์์ ์ ๊ณตํ๋ AI ๋ณด์กฐ ๋๊ตฌ๋ฅผ ์ง์ ๊ตฌ์ถํ ์ ์์ต๋๋ค. ๐งโ๐ป ๊ฐ์ธ ์ปดํจํฐ์์ AI์ ํํ์ ๋๋ฆฌ์ธ์! ๐ Ghost 8B Beta 1608๋ ๋น์ ์ AI ํํธ๋๊ฐ ๋ ์ค๋น๊ฐ ๋์ด ์์ต๋๋ค.
lamhieu/ghost-8b-beta-8k
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
Key Highlights:
- Superior Performance: Outperforms Llama 3.1 8B Instruct, GPT-3.5 Turbo, Claude 3 Opus, GPT-4, and more in winrate scores.
- Expanded Language Support: Now supports 16 languages, including English, Vietnamese, Spanish, Chinese, and more.
- Enhanced Capabilities: Improved math, reasoning, and instruction-following for better task handling.
With two context options (8k and 128k), Ghost 8B Beta is perfect for complex, multilingual applications, balancing power and cost-effectiveness.
๐ Learn More: https://ghost-x.org/docs/models/ghost-8b-beta
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
thanks @danielus ๐ค
@Dihelson
@llama-anon
@AIWizard76
@danielus
๐ Ghost 8B Beta Released: Game-Changing Language Model
Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.
- See detailed article: https://huggingface.co/blog/lamhieu/ghost-8b-beta-released-game-changing-language-mode
- Model card: https://huggingface.co/ghost-x/ghost-8b-beta
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta
๐ Ghost 8B Beta Released: Game-Changing Language Model
Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.
- See detailed article: https://huggingface.co/blog/lamhieu/ghost-8b-beta-released-game-changing-language-mode
- Model card: https://huggingface.co/ghost-x/ghost-8b-beta
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta
--
Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.
--
* See detailed article: https://huggingface.co/blog/lamhieu/ghost-8b-beta-released-game-changing-language-mode
* Model card: ghost-x/ghost-8b-beta
* Official website: https://ghost-x.org/docs/models/ghost-8b-beta
---
๐ฌ Chat with the model here:
- Playground with Ghost 8B Beta (ฮฒ, 8k): lamhieu/ghost-8b-beta-8k
- Playground with Ghost 8B Beta (ฮฒ, 128k): lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta/
Thank you for your dedication, it sounds great. Here I would like to share some additional information and perspectives so that everyone can better understand the issues we address:
- With language models, when applying in practice we only need it to be understood at 80% or a good overview and combining with RAG will bring better accuracy. So, here we will need a good level of truth telling model and the ability to understand and work with RAG at a very good level to be most effective.
- In Italian, I'm very happy when it speaks well, it proves that my training method and source code for it were correct because it's actually live with the d0x5 version. This is all because Italian was only added later (at the same time as German), responding to the fact that sometimes it can only be described as a translation mays.
- With the ability to reason, I hope you don't misunderstand. It still works well, just when compared to some current superior models like GPT 4o or Claude 3, there will be some songs where it will "lose". It still outperforms a lot of other much larger models. For example, the question "Andrew is free from 11 am to 3 pm, Joanne is free from noon to 2 pm and then 3:30 pm to 5 pm. Hannah is available at noon for half an hour, and then 4 pm to 6 pm. What are some options for start times for a 30 minute meeting for Andrew, Hannah, and Joanne?" taken from OpenAI GPT4 home page.
One note: in reasoning tests, models often set the temperature to 0, with Ghost 8B Beta we always set it to 0.1 as the lowest. The reason is simple because if at this level the model still reasons well, then at level 0.4 (the default level of the current chat) it will still often achieve the same results, and we want to aim for practical efficiency. rather than scores. Let's try to lower the temperature with some reasoning questions to experiment.
After all, you guys are great, thank you so much everyone.
An example of reasoning about time:
An example of a long context with extensive summary capabilities: Paper: Point out the highlights and identify the ideal people to apply it..
@Dihelson It's probably because you told the model to do it again. Try telling the model to change each word. Of course, it could still be because the model misunderstood.
Try the following conversation: (1) ask to write an article -> (2) ask to translate the article into the languages โโyou want.
@AIWizard76 It hasn't gone through any real eval tests to be able to compare, but if we're just talking about ghost 8b beta, it has good translation capabilities for supported languages. It works well for translating long texts and also translating into multiple languages โโsimultaneously.
It's simple, currently the base version will not try to lengthen the text and be more "obedient". Maybe tomorrow or the next day I'll put it up for everyone to try.
Note, the current version is running everything from version "disl-0x5", the new version will improve a lot but it may not be ready right now.
thank you for your comments and encouragement ๐ค
another question, how do you feel when conversing in Italian?
@danielus
let me ask, is this what you want?
@danielus I noticed the explanation model because this is what the chat version (ft from ghost 8b beta, base) does for the chat task (base will not try to explain and will respect the system more strictly). The goal of answering with more information is to help users avoid having to learn more or get side answers from just one question. Of course, this can sometimes be a hassle, we'll try to balance it out.
@Dihelson It supports Portuguese language, try it and let me know what you think. ๐ต๐น
@Dihelson I believe you, don't worry. Please experience it happily~
A note here, the model is capable of working well with 9 major languages โโalong with function tools for the languages. It has a size that can be called a boy compared to other multilingual models (which may be lacking or inferior in things like function tools and performance).
@ZeroWw To be honest, our initial training focused more on math ability than on (abstract) reasoning. It still has just less training data, rest assured as this is a test of training recipes, expanding the training capability domains and languages โโis just a matter of time and resources..
@Dihelson I understand, we value a model that has good reasoning capabilities and that is also our goal. In the early versions, it was focused on the immediate goals of good multilingualism, safety, functional tools support and good general performance. And it has achieved its goals. In the next stage, I will also conquer the things you said and a few other languages. Enjoy ๐ค
Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default.
The languages supported are ๐บ๐ธ English, ๐ซ๐ท French, ๐ฎ๐น Italian, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐ฉ๐ช German, ๐ป๐ณ Vietnamese, ๐ฐ๐ท Korean and ๐จ๐ณ Chinese.
Explore the Potential:
To learn more about this groundbreaking language model, visit the official website or explore the online demo platforms:
- Ghost 8B Beta (ฮฒ, 8k) on Spaces: lamhieu/ghost-8b-beta-8k.
- Ghost 8B Beta (ฮฒ, 128k) on Spaces: lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta
* The languages supported are ๐บ๐ธ English, ๐ซ๐ท French, ๐ฎ๐น Italian, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐ฉ๐ช German, ๐ป๐ณ Vietnamese, ๐ฐ๐ท Korean and ๐จ๐ณ Chinese.
* ๐จโ๐ป Try on Spaces: lamhieu/ghost-8b-beta-8k
* ๐ Official website: https://ghost-x.org/docs/models/ghost-8b-beta
Samba is a powerful hybrid model with an unlimited context length, combining Mamba, MLP, Sliding Window Attention, and MLP stacking. Samba largest version, Samba-3.8B, trained on 3.2 trillion tokens, excels in benchmarks like MMLU, GSM8K, and HumanEval, and shines in long-context tasks with minimal tuning.
---
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Github: https://github.com/microsoft/Samba
Supported languages: ๐บ๐ธ English, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐ซ๐ท French, ๐ฎ๐น Italian, ๐ฉ๐ช German, ๐ป๐ณ Vietnamese, ๐ฐ๐ท Korean, ๐จ๐ณ Chinese, and !?
Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* https://huggingface.co/ghost-x
Ghost X is currently very open to invitations to cooperate, share and support.
๐คฏ๐
Supported languages: ๐บ๐ธ English, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐ซ๐ท French, ๐ฎ๐น Italian, ๐ฉ๐ช German, ๐ป๐ณ Vietnamese, ๐ฐ๐ท Korean, ๐จ๐ณ Chinese, and !?
Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* https://huggingface.co/ghost-x
๐คฏ๐
๐ฆ There are now over 30++ high-quality datasets available so you can start creating interesting models. It will be updated in the future, glad if it helps someone.
lamhieu/blackhole-66473b7feec034b4fb70818a