1)

6个V100,37分钟。120核 16小时。
6*37=222卡分钟
120*16*60=115200核分钟,
V100相对单核 是518倍加速效果。
单个CPU是20核的话,有25倍加速效果。
不知道是单精度还是双精度?
2)

感觉 是5倍的加速效果。
3)

按照这个说法,应该也是5倍的加速效果。
4)
Game-changing computational performance • Xeon: One run in ~9 months on 5,000 SKL cores with 10-day waits for 5-day jobs • Summit: Six runs done in 4.5 days on 3,312 GPUs
5)
New campaign runs 4-day sims on 6 billion elements using 5532 V100s • Throughput of ~2.2M Xeon cores • DES with 10 species, 19 reactions • 90 GB asynchronous I/O every 60 secs; total of ~1 petabyte per sim
6)
• NVIDIA Tesla V100 GPU outperforms Intel Xeon Skylake CPU by 4-5x
• New NVIDIA Tesla A100 GPU improves to 7-8x
• GPUs typically bundled in nodes with 4, 6, or 8 GPUs
• GPU nodes are more expensive, but still a win on performance / $