Ahmadzei's picture
update 1
57bdca5
raw
history blame
116 Bytes
GPT-J was followed by OPT, a family of decoder-only models, the largest of which is 175B and trained on 180B tokens.