Tokenization for language modeling: BPE vs. Unigram Language Modeling (2020) from Hacker News on 2025-05-30 08:59 (#6XMNE) Comments