yarn scale to 122,880 context length
#41
by
nbroad
- opened
Please don't merge or close this. I'm just going to use this pr revision to run the model at 122k sequence length
Please don't merge or close this. I'm just going to use this pr revision to run the model at 122k sequence length