SMG: The Case for Disaggregating CPU from GPU in LLM Serving

(pytorch.org)

2 points | by gmays 8 days ago ago

No comments yet.