windows_parallel.qbk 4.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117
  1. [/===========================================================================
  2. Copyright (c) 2017 Steven Ross, Francisco Tapia, Orson Peters
  3. Distributed under the Boost Software License, Version 1.0
  4. See accompanying file LICENSE_1_0.txt or copy at
  5. http://www.boost.org/LICENSE_1_0.txt
  6. =============================================================================/]
  7. [section:windows_parallel 3.5- Windows Benchmark]
  8. [:
  9. These results are obtained from complex benchmarks, not included in the library because they use non free software.
  10. (If you are interested in the details, contact fjtapia@gmail.com)
  11. There are 3 types of benchmarks,
  12. [:
  13. *64 bits integers
  14. *strings
  15. *objects of several sizes.
  16. The objects are arrays of integers. The heavy comparison sums all the elements in each, and the light comparison uses only the first number of the array.
  17. This benchmark ran on a VirtualBox virtual machine with 8 threads and 16 GB of RAM,
  18. running over a Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz with 6 cores and 2 threads by core, and 15M of cache.
  19. ]
  20. [h4[_100 000 000 NUMBERS OF 64 BITS RANDOMLY FILLED]]
  21. [*[teletype]
  22. ``
  23. | | |
  24. | Time | Maximum |
  25. | secs | Memory Used |
  26. --------------------------+---------+-------------+
  27. PPL parallel sort | 2.4016 | 786 MB |
  28. PPL parallel_buffered_sort| 2.0373 | 1567 MB |
  29. block_indirect_sort | 1.6101 | 785 MB |
  30. | | |
  31. sample sort | 2.1191 | 1565 MB |
  32. parallel stable sort | 2.4503 | 1175 MB |
  33. | | |
  34. ``
  35. ]
  36. [h4[_10 000 000 STRINGS RANDOMLY FILLED]]
  37. [*[teletype]
  38. ``
  39. | | |
  40. | Time | Maximum |
  41. | secs | Memory Used |
  42. --------------------------+---------+-------------+
  43. PPL parallel sort | 4.3241| 887 MB |
  44. PPL parallel_buffered_sort| 3.5434| 1199 MB |
  45. block_indirect_sort | 3.5732| 1601 MB |
  46. | | |
  47. sample sort | 3.8107| 1198 MB |
  48. parallel stable sort | 5.0277| 1041 MB |
  49. | | |
  50. ``
  51. ]
  52. [h4[_OBJECTS RANDOMLY FILLED]]
  53. [:
  54. The objects are arrays of 64 bits numbers
  55. They are compared in two ways :
  56. [:
  57. (H) Heavy : The comparison is the sum of all the numbers of the array.
  58. (L) Light : The comparison is using only the first element of the array,
  59. as a key
  60. ]
  61. ]
  62. [*[teletype]
  63. ``
  64. | | | | | | | |
  65. | 100000000 | 50000000 | 25000000 | 12500000 | 6250000 | 1562500 | |
  66. | objects of| objects of|objects of |objects of |objects of |objects of | Maximum |
  67. | 8 bytes | 16 bytes | 32 bytes | 64 bytes | 128 bytes | 512 bytes | Memory |
  68. | | | | | | | Used |
  69. | H L | H L | H L | H L | H L | H L | |
  70. ---------------------------+-----------+-----------+-----------+-----------+-----------+-----------+-------------+
  71. | | | | | | | |
  72. PPL parallel sort | 2.50 2.40| 1.34 1.16| 0.85 0.73| 0.70 0.57| 0.72 0.45| 0.54 0.40| 785 MB |
  73. PPL parallel_buffered_sort | 2.20 2.26| 1.34 1.24| 1.03 0.79| 1.00 0.83| 0.90 0.85| 0.78 0.87| 1567 MB |
  74. | | | | | | | |
  75. block_indirect_sort | 1.62 1.59| 0.94 0.87| 0.63 0.57| 0.50 0.44| 0.58 0.38| 0.55 0.35| 785 MB |
  76. | | | | | | | |
  77. sample sort | 2.19 2.25| 1.69 1.54| 1.12 1.14| 1.18 1.14| 1.03 1.13| 1.09 1.17| 1565 MB |
  78. parallel stable sort | 2.54 2.49| 1.69 1.52| 1.25 1.10| 1.10 1.03| 1.07 1.00| 1.05 0.97| 1175 MB |
  79. | | | | | | | |
  80. ``
  81. ]
  82. [br]
  83. [br]
  84. ]
  85. [endsect]