ZhangQiao123
/

medical-model-grpo-16bit

Text Generation

text-generation-inference

Model card Files Files and versions Community

请教一下奖励函数是如何设计的

#1

by lhlhlsc - opened about 1 month ago

lhlhlsc

about 1 month ago

如题

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment