correlate_image.cl: an image kernel that actually works (also fast, also simple) |
|
More...
|
about 13 years ago
|
README: add HD 6950 numbers for GLSL |
|
More...
|
about 13 years ago
|
README: add 64-bit numbers for Athlon II X4 |
|
More...
|
over 13 years ago
|
README: update with Athlon II X4 640 figures |
|
More...
|
over 13 years ago
|
corr.cpp: set OpenMP num_threads to a higher number |
|
More...
|
over 13 years ago
|
sse.h: double2 and double4 recip and rsqrt methods, license notice, doc changes |
|
More...
|
over 13 years ago
|
build.sh: parallel compilation |
|
More...
|
over 13 years ago
|
MIT LICENSE |
|
More...
|
over 13 years ago
|
build.sh: set up build envvars |
|
More...
|
over 13 years ago
|
README update |
|
More...
|
over 13 years ago
|
build script |
|
More...
|
over 13 years ago
|
more understandable 2x2-blocking loop |
|
More...
|
over 13 years ago
|
fast GLSL kernel |
|
More...
|
over 13 years ago
|
naive GLSL implementation |
|
More...
|
over 13 years ago
|
GPU timing tweaks, faster 500x500 |
|
More...
|
over 13 years ago
|
corr_500.cpp: 500x500 optimized version |
|
More...
|
over 13 years ago
|
update README figures |
|
More...
|
over 13 years ago
|
faster GPU kernel |
|
More...
|
over 13 years ago
|
sse.h: swizzles, double4 two-vector shuffle, double2 x() & y() |
|
More...
|
almost 14 years ago
|
sse.h: add sqrt, min, max, andnot, &, |, ^ to vectors and recip and rsqrt to float4 |
|
More...
|
almost 14 years ago
|
README update |
|
More...
|
almost 14 years ago
|
corr.cpp: move parallel for to outermost in sse_optimized |
|
More...
|
almost 14 years ago
|
move SSE vector lib to sse.h, benchmark doubles |
|
More...
|
almost 14 years ago
|
corr_naive.cpp: naive OpenCL GPU kernel |
|
More...
|
almost 14 years ago
|
README: perf numbers for GPU 500x500 input size, new manual SSE |
|
More...
|
almost 14 years ago
|
corr.cpp: faster manually optimized sse |
|
More...
|
almost 14 years ago
|
correlate_500x500.cl: kernel for 500x500 problem size |
|
More...
|
almost 14 years ago
|
better CPU kernel |
|
More...
|
almost 14 years ago
|
use images if supported, report rest of opencl overhead |
|
More...
|
almost 14 years ago
|
bogus bandwidth benchmarks |
|
More...
|
almost 14 years ago
|