2 comments

  • binyang_qiu 5 hours ago ago

    For me, the interesting part here isn't GPT-2, it's the memory discipline. I feel like most inference runtimes slowly leak allocations everywhere as features pile up.

  • grelikt 12 hours ago ago

    [dead]