Thursday, December 17, 2015

Re: [Discuss-gnuradio] [VOLK] GPU acceleration -> OpenCL integration?

Hi,

> RUN_VOLK_TESTS: volk_32f_x2_matrix_nxn_multiply_puppet_32f(1000000,10)
> generic completed in 28482ms
> a_opencl completed in 13364.3ms

Question is how does that number change for smaller problem sizes ?
And what would be the average problem size encountered in real env.

For SIMD optimization the result of "who's the fastest" doesn't vary
too much depending on problem size because they don't have much setup
/ teardown size.
For OpenCL I very much doubt that would be the case and if you end up
with an app making a lot of "smallish" (and given the default buffer
size of GR, I feel the calls to volk aren't processing millions of
samples at a time in a single call)


Cheers,

Sylvain

_______________________________________________
Discuss-gnuradio mailing list
Discuss-gnuradio@gnu.org
https://lists.gnu.org/mailman/listinfo/discuss-gnuradio

No comments:

Post a Comment