this post was submitted on 23 Nov 2023
1 points (100.0% liked)

Hardware

33 readers
1 users here now

A place for quality hardware news, reviews, and intelligent discussion.

founded 11 months ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 10 months ago (6 children)

the 4070 ti at 294mm2 (full ad104) with 160 Tflops of Fp16

The 7900xtx GCD is 300 mm2 (Full Navi31 GCD only) with 122 tflops of Fp16

Doubt its that.

Where there might be reasons is that RDNA doesnt hasve AI cores. The tasks are accelerated on the shader cores.Hence the term AI Accelarators. Now assumming nvidia cards ignore the tensor cores.

The 4090 can do only 82.6 Tflop of FP16 (Non-Tensor).

The 7900xtx would still retain its 122 tflops of FP16. making it faster in Fp16 performance.

[–] [email protected] 1 points 10 months ago (2 children)

The actual rule has hard numbers, no need to speculate. And it's no more than 300 TFLOPS of fp16 (or 150 fp32, 600 fp8, etc) so it ain't TFLOPS that are the culprit. As for performance density, it's equivalent to those figures at an 830mm^2 die, so again not that.

[–] [email protected] 1 points 10 months ago (1 children)

Ok I didn’t know the actual numbers that’s helpful. Maybe they’re just holding off to apply for an export license? I heard the 4090 is in a “gray area”.

[–] [email protected] 1 points 10 months ago

No gray area, at base clocks the 4090 exceeds the limit by 10% already.

load more comments (3 replies)