• Search
  • Topics
  • Nodes
< back

#paper

1 20 Toot LinkedIn
A chart of hourly posts over the last week (for big screens). A chart of hourly posts over the last week (for small screens).

1

From arxiv.org

Serving Large Language Models on Huawei CloudMatrix384

1 5

The rapid evolution of large language models (LLMs), driven by growing parameter scales, adoption of mixture-of-experts (MoE) architectures, and expanding context lengths, imposes unprecedented demands on AI infrastructure. Traditional AI clusters face limitations in compute intensity, memory...

#ai #llm #genai #paper

on Tue, 9AM

Showing first 1 out of 1