mlp
#145
by
Sin2pi
- opened
Did you not understand what moe is meant for? My guess is that the bad design choices are to defend the compute.
Edit: there's nothing new to see with the model architecture and it's super disappointing .. not even a talking parrot video. I think thats one of the coders over there telling us to help them.. like sos .. by writing bad code. real bad boring code