Show HN: Run 30B model in 4GB Active Memory

(github.com)

4 points | by vkkhare 2 days ago ago

2 comments

  • nrjpoddar a day ago ago

    Link github/sparse_transformers seems to be broken

    • vkkhare a day ago ago

      updated the link