Processes long text by splitting it into independent chunks and streams the translation in real-time.
Text will be split into chunks of this size for translation.
Controls randomness. Higher values mean more random outputs.
Selects from tokens with a cumulative probability mass up to this value.
Selects from the top 'k' most likely tokens at each step.