single
[t_azu@machine]$ optirun nbody -benchmark
[nbody] starting...
Run "nbody -benchmark [-n=<numBodies>]" to measure perfomance.
-fullscreen (run n-body simulation in fullscreen mode)
-fp64 (use double precision floating point values for simulation)
-numdevices=N (use first N CUDA devices for simulation)
> Windowed mode
> Simulation data stored in video memory
> Single precision floating point simulation
> 1 Devices used for simulation
> Compute 2.1 CUDA device: [GeForce GT 520M]
1024 bodies, total time for 10 iterations: 5.223 ms
= 2.008 billion interactions per second
= 40.153 single-precision GFLOP/s at 20 flops per interaction
[nbody] test results...
PASSED
> exiting in 3 seconds: 3...2...1...done!
double
[t_azi@machine]$ optirun nbody -fp64 -benchmark
[nbody] starting...
Run "nbody -benchmark [-n=<numBodies>]" to measure perfomance.
-fullscreen (run n-body simulation in fullscreen mode)
-fp64 (use double precision floating point values for simulation)
-numdevices=N (use first N CUDA devices for simulation)
> Windowed mode
> Simulation data stored in video memory
> Double precision floating point simulation
> 1 Devices used for simulation
> Compute 2.1 CUDA device: [GeForce GT 520M]
1024 bodies, total time for 10 iterations: 42.682 ms
= 0.246 billion interactions per second
= 7.370 double-precision GFLOP/s at 30 flops per interaction
[nbody] test results...
PASSED
> exiting in 3 seconds: 3...2...1...done!