Thumbnail 1583708
thumbnail
Large (256x256)

Articles

Batched reward model inference and Best-of-N sampling
Comments
1