Efficient Streaming Language Models with Attention Sinks by from Hacker News on 2023-10-02 16:56 (#6F85G) Comments