単精度

C:\ProgramData\NVIDIA Corporation\NVIDIA GPU Computing SDK 3.2\C\bin\win64\Release
>nbody.exe -benchmark
Run "nbody -benchmark [-n=<numBodies>]" to measure perfomance.
        -fullscreen (run n-body simulation in fullscreen mode)
        -fp64       (use double precision floating point values for simulation)

> Windowed mode
> Simulation data stored in video memory
> Single precision floating point simulation
> Compute 2.0 CUDA device: [GeForce GTX 580]
16384 bodies, total time for 10 iterations: 70.128 ms
= 38.278 billion interactions per second
= 765.562 single-precision GFLOP/s at 20 flops per interaction