RedPajama v2 Open Dataset with 30T Tokens for Training LLMs by from Hacker News on 2023-10-30 23:38 (#6G04R) Comments