0 Members and 1 Guest are viewing this topic.
TL;DR - We see 1.3% of Qualcomm's NPU 45 Teraops/s claim when benchmarking Windows AI PCs
The first obvious thing is that the NPU results, even without float conversion, are slower than the CPU.
By contrast, running the same model on an Nvidia Geforce RTX 4080 Laptop GPU runs in 3.2ms, an equivalent of 2,160 billion operations per second, almost four times the throughput.