Reference and Inference

Cornelis and NextSilicon to Build Joint Reference Architectures for AI and HPC

Cornelis and NextSilicon today announced at ISC High Performance 2026 a collaboration to build and evaluate joint reference architectures for AI and high-performance computing. The work pairs the ...

11d

QumulusAl Signs More Than $124 Million in AI Inference Infrastructure Agreements

Workload-optimized Nvidia Blackwell deployments designed to reduce AI inference costs by approximately 20% compared with standard reference architectures ATLANTA, GA / ACCESS Newswire / June 11, 2026 ...

13d

d-Matrix Corsair AI Inference Platform Enters Full Production to Meet Customer Demand

Matrix, the pioneer in low-latency AI inference for data centers, today announced its Corsair™ inference accelerator platform ...

InfoWorld

AI is all about inference now

You train the model once, but you run it every day. Making sure your model has business context and guardrails to guarantee reliability is more valuable than fussing over LLMs. We’re years into the ...

Seeking Alpha

AMD: Inference Is The Future Of AI

AMD is strategically positioned to dominate the rapidly growing AI inference market, which could be 10x larger than training by 2030. The MI300X's memory advantage and ROCm's ecosystem progress make ...

VentureBeat

Nvidia triples and Intel doubles generative AI inference performance on new MLPerf benchmark

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More MLCommons is out today with its MLPerf 4.0 benchmarks for inference, once ...

Semiconductor Engineering

TOPS, Memory, Throughput And Inference Efficiency

Dozens of companies have or are developing IP and chips for Neural Network Inference. Almost every AI company gives TOPS but little other information. What is TOPS? It means Trillions or Tera ...

Forbes

How AI Inference Costs Are Reshaping The Cloud Economy

While the tech world obsesses over headlines about the $100 million price tag to train GPT-4, the real economic story is happening in inference: the ongoing cost of actually running AI models in ...

Business Insider

Nvidia might actually lose in this key part of the AI chip business

You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. In AI hardware circles almost everyone is talking about inference. Nvidia CFO Colette Kress said on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results