Intel, Ampere show running LLMs on CPUs isn't as crazy as it sounds - NextComputing

Via The Register

“Popular generative AI chatbots and services like ChatGPT or Gemini mostly run on GPUs or other dedicated accelerators, but as smaller models are more widely deployed in the enterprise, CPU-makers Intel and Ampere are suggesting their wares can do the job too – and their arguments aren’t entirely without merit.”

NextComputing is proud to offer systems featuring Ampere^® Altra processors, built for high performance and energy efficiency, crucial for the latest high-volume AI workloads at the network edge. Click below to learn more.

NextComputing and Ampere CPUs