Comments on: Branchless selections

By: EddieEdwards

EddieEdwards — Wed, 13 May 2009 15:21:56 +0000

I like the XOR trick. Very appropriate for SSE2.

By: Branch free Clamp() « Miles Macklin

Branch free Clamp() « Miles Macklin — Fri, 09 Jan 2009 23:05:27 +0000

[…] https://realtimecollisiondetection.net/blog/?p=90 […]

By: tony

tony — Mon, 15 Dec 2008 05:49:39 +0000

GoW3 teaser,

Neogaf has gone bizarre over the footage. Underwhelming is what they are saying, thanks to Jaffe’s comment about art coming to life.
But I hope you guys will deliver it, no doubt.

By: SolomonS

SolomonS — Thu, 11 Dec 2008 05:23:57 +0000

Sorry, it got cut off. Let’s try again.

unsigned int _max(unsigned int x, unsigned int y)
{
unsigned int r = x – ((x-y) & -(x

By: SolomonS

SolomonS — Thu, 11 Dec 2008 05:21:36 +0000

supzi, this is not my trick, but it is branchless, since the cmp instruction isn’t a branch.

unsigned int _max(unsigned int x, unsigned int y)
{
unsigned int r = x – ((x – y) & -(x

By: supzi

supzi — Wed, 10 Dec 2008 23:24:35 +0000

ooops, there’s an error my previous post, this is how it should be :

const unsigned int uint_bit_mask = unsigned int(-1)/2;
uint first_bit_sign = sign( diff( r1 >> uint_bit_shift, r2 >> uint_bit_shift ) );
uint last_bits_sign = sign( diff( r1 & uint_bit_mask, r2 & uint_bit_mask ) );

By: supzi

supzi — Wed, 10 Dec 2008 23:17:39 +0000

Solomon,

If I can’t use 64 bits ints or 33 bits, in other words there’s no way to know if a subtraction did an overflow then I would consider splitting the 32 bits, check the first bit, then check the remaining bits, something like this :

const int uint_bit_shift = sizeof( uint )*8-1;
const unsigned int uint_bit_mask = ~(1 > uint_bit_shift, r2 >> uint_bit_shift ) );
uint last_bits_sign = sign( diff( r1 & uint_bit_mask, r2 & uint_bit_mask ) );

return branchless_sel(r1, r2, first_bit_sign | last_bits_sign );

May be not the best solution ever but it is still branchless!

I presume this is not the solution you are waiting for, perhaps you have a cool trick that you want to share?

By: guardian

guardian — Tue, 09 Dec 2008 12:04:05 +0000

Hello,

In today’s applications, where would this trick find a use (apart from sprite code)?

By: SolomonS

SolomonS — Tue, 02 Dec 2008 08:31:59 +0000

supzi, can you think of a solution that doesn’t require 64 bit ints? or 33 bit ints for that matter.

By: supzi

supzi — Mon, 01 Dec 2008 22:18:33 +0000

Sorry for the very late reply Solomon,

I don’t see why it is limited to a specific range, the solution itself is just pseudocode, you can always adapt it to your needs

Here’s a c++ solution that is compatible with 32 bits unsigned int :

typedef long long int64;
typedef unsigned long long uint64;
typedef unsigned int uint;

int64 diff( uint a, uint b ) { return (int64( a ) – int64( b )); }
uint sign( int64 val ) { return uint( uint64( val ) >> ( (sizeof( uint64 ) * 8) -1 )); }
uint branchless_sel( uint r1, uint r2, uint val ) { return r1*(1-val) + r2*val; }
uint max_branchless( uint r1, uint r2 ) { return branchless_sel(r1, r2, sign(diff(r1, r2)) ); }