All these strategies select the next token from the probability | |
distribution over the entire vocabulary with various strategy-specific adjustments. | |
num_return_sequences: the number of sequence candidates to return for each input. This option is only available for | |
the decoding strategies that support multiple sequence candidates, e.g. variations of beam search and sampling. |