Ahmadzei's picture
update 1
57bdca5
raw
history blame
160 Bytes
During training, both BART and T5 will make the appropriate
decoder_input_ids and decoder attention masks internally. They usually do not need to be supplied.