Optimizing Closure :: pouët.net

Optimizing Closure

category: code [glöplog]

you said "my framerate stays the same (240fps vs 240fps in my testCase)" so what's the other code you ran?

added on the 2012-09-18 17:27:22 by Gargaj

the same code without using the post-fx-shadercode (snippet) ! whats so hard to understand there, garg?

added on the 2012-09-18 17:28:17 by ɧ4ɾɗվ.

I used this fine device and could reproduce your measurements
BB Image

sadly only for resolutions of 1x1.

added on the 2012-09-18 17:29:37 by las

So basically the performance with a 128-times-texlookup full screen blur pass is exactly the same as without the pass. That sounds like somebody royally screwed up his measurements.

added on the 2012-09-18 17:30:06 by kb_

It's called unlimited non-detail!

added on the 2012-09-18 17:32:03 by xTr1m

when we see ms instead of fps measured on GPU, then a sensible perf argument can be made!

added on the 2012-09-18 17:32:06 by dv$

I'd rather say "come back as soon as you can present us the results gathered in PIX and/or NVPerfHud and/or AMD GPUPerfStudio".

added on the 2012-09-18 17:34:32 by kb_

Quote:

the same code without using the post-fx-shadercode (snippet) ! whats so hard to understand there, garg?

added on the 2012-09-18 17:36:30 by Gargaj

use the perf tools, luke!

added on the 2012-09-18 17:36:38 by dv$

i am not up to it! i KNOW i dont loose any fps, due to years of abusage! las, your turn! also: i said so, its true! ;)
i dont care about ms at all, if the fps stay the same!

added on the 2012-09-18 17:38:29 by ɧ4ɾɗվ.

kb: its a 256times-texture-lookup btw :p

added on the 2012-09-18 17:40:59 by ɧ4ɾɗվ.

Quote:

i dont care about ms at all

Translation: I don't care about performance at all, if you don't get enough fps, buy a better computer.

added on the 2012-09-18 17:41:01 by kb_

thats all about what intel/pcs are about!
AMIGA ftw!

added on the 2012-09-18 17:43:08 by ɧ4ɾɗվ.

Quote:

i dont care about ms at all

added on the 2012-09-18 17:46:43 by xTr1m

I'm almost ready for the next level.

Can I request some nice pony pictures about time measurements of convolution kernels?

I'm sorry - this thread is over. If you (the interested reader) want to read some nice information on kernel optimization - switch to page one and start reading until you reach posts about mysterious miracles and high performance hypnoglows.

added on the 2012-09-18 17:47:59 by las

i am doing 4ks! i know my last one was a fastshot and even won against yours, but i know why! my HypnoGlow was for free!

added on the 2012-09-18 17:48:35 by ɧ4ɾɗվ.

Our FLOPS beat your FLOPS
BB Image

added on the 2012-09-18 17:50:14 by xTr1m

i am done here aswell, only Q left is: las, what did you measure at 720p with my snippet?

added on the 2012-09-18 17:50:44 by ɧ4ɾɗվ.

no, every platform is there to be used properly and to push the boundaries of what it can do not hide lazy hacking without proper knowledge of the hardware. You learn to abuse the hardware whatever system it is and this thread is about optimisation on PC gfx hardware :)

added on the 2012-09-18 17:50:46 by dv$

added on the 2012-09-18 17:53:33 by ___

DVS: thats what i did! i answered mu6x on his usage of Blur...and i told him the performance-hit fps-wise is almost not measurable! all this went into francy here, just due to ppl not reading my comments correctly! i said ALMOST! i just told mu6x its unnecessary to downsample by 4 by now, with newest graphics cards. i never had onBoard-GPUs in mind, as i just answered his post, before reading the entire Thread!

added on the 2012-09-18 17:57:24 by ɧ4ɾɗվ.

Quote:

i told him the performance-hit fps-wise is almost not measurable

no, _you_ couldn't measure it, because your methods are inaccurate.

added on the 2012-09-18 18:00:02 by Gargaj

i dont see why its inaccurate, its working, i can watch my 4ks and see the HypnoGlow working! (except i comment out "#define POST_FX")

added on the 2012-09-18 18:03:31 by ɧ4ɾɗվ.

oh, my methids to measure are inaccurate, fraps...i see!

added on the 2012-09-18 18:04:05 by ɧ4ɾɗվ.

I know that it is troll time, but seriously, I don't see why 256 texture lookups for blurring has to be super expensive. The code that is run for each pixel is the same, and those 256 texture lookups are repeated for neighboring pixels. So if the GPU plays it smart, most of the lookups could be cached. And then it is down to 256 MADs per pixel, which should not be much of a problem, or?

added on the 2012-09-18 18:05:24 by chock

Optimizing Closure

login