MP Scheduler PerformanceEach benchmark data point is generated by running a particular flowgraph.The flowgraphs have a rectangular format, described by the number of pipelines and the number of stages in each pipeline. For the purposes of the benchmark, only the fir_filter_fff blocks count as stages; the null_source, head and null_sinks get folded into the overhead. E.g., 1 <table width="100%" border="0"> Each FIR has 256 taps. Since these are implemented with a dot-product, we count 2 floating point operations (FLOP) per tap, per sample. For each run of the benchmark, we measure the user, system and real time. In addition we know the number of samples processed by the graph and the topology, thus we can compute the total number of floating point operations. We compute GFLOPS as the total FLOPs / real time / 1e9. The benchmark codeThe benchmark code and raw data are in
You can plot the raw data and fly it around in 3D using x86 and x86_64 ProcessorsOn the x86 machines the kernel of gr.fir_filter_fff is implemented with SSE. http://gnuradio.org/images/perf-data-images/dual-quad-core.png http://gnuradio.org/images/perf-data-images/dual-quad-core-2.33-clovertown.png http://gnuradio.org/images/perf-data-images/core2-duo.png http://gnuradio.org/images/perf-data-images/core-duo.png PowerPC Processors (using Altivec)Note the differences in scaling between the same machines depending on
whether we're using http://gnuradio.org/images/perf-data-images/js21-altivec.png In the next two graphs running on cell processors we're not yet using the
SPEs. http://gnuradio.org/images/perf-data-images/qs21-altivec.png http://gnuradio.org/images/perf-data-images/ps3-altivec.png PowerPC Processors (without Altivec)Please note that the benchmark does not contain any AltiVec
code. Thus these machines are http://gnuradio.org/images/perf-data-images/js21.png In the next two graphs running on cell processors we're not yet using the
SPEs. http://gnuradio.org/images/perf-data-images/qs21.png http://gnuradio.org/images/perf-data-images/ps3.png 注:MP Scheduler Performance(原文出处,翻译整理仅供参考!) |