Dramatic performance differences: debug vs release

Question

Dramatic performance differences: debug vs release

I have a simple algorithm that converts a bayer image channel (BGGR, RGGB, GBRG, GRBG) to rgb (demosaicing, but without neighbors). In my implementation, I have predefined offset vectors that help me translate the bayer channel index into their corresponding rgb channel indices. The only problem is that I get terrible debugging performance using MSVC11. In the release section, to enter 3264X2540 size, the function ends in ~ 60 ms. For the same input in debugging, the function ends in ~ 20,000 ms. This is more than the difference in the X300, and since some developers run my application in debugging, this is unacceptable.

My code is:

void ConvertBayerToRgbImageDemosaic(int* BayerChannel, int* RgbChannel, int Width, int 

Height, ColorSpace ColorSpace)
{
    int rgbOffsets[4]; //translates color location in Bayer block to it location in RGB block. So R->0, G->1, B->2
    std::vector<int> bayerToRgbOffsets[4]; //the offsets from every color in the Bayer block to (bayer) indices it will be copied to (R,B are copied to all indices, Gr to R and Gb to B).
    //calculate offsets according to color space
    switch (ColorSpace)
    {
    case ColorSpace::BGGR:
            /*
             B G
             G R
            */ 
        rgbOffsets[0] = 2; //B->0
        rgbOffsets[1] = 1; //G->1
        rgbOffsets[2] = 1; //G->1
        rgbOffsets[3] = 0; //R->0
        //B is copied to every pixel in it block
        bayerToRgbOffsets[0].push_back(0);
        bayerToRgbOffsets[0].push_back(1);
        bayerToRgbOffsets[0].push_back(Width);
        bayerToRgbOffsets[0].push_back(Width + 1);
        //Gb is copied to it neighbouring B
        bayerToRgbOffsets[1].push_back(-1);
        bayerToRgbOffsets[1].push_back(0);
        //GR is copied to it neighbouring R
        bayerToRgbOffsets[2].push_back(0);
        bayerToRgbOffsets[2].push_back(1);
        //R is copied to every pixel in it block
        bayerToRgbOffsets[3].push_back(-Width - 1);
        bayerToRgbOffsets[3].push_back(-Width);
        bayerToRgbOffsets[3].push_back(-1);
        bayerToRgbOffsets[3].push_back(0);
        break;
    ... other color spaces
    }

    for (auto row = 0; row < Height; row++)
    {
        for (auto col = 0, bayerIndex = row * Width; col < Width; col++, bayerIndex++)
        {
            auto colorIndex = (row%2)*2 + (col%2); //0...3, For example in BGGR: 0->B, 1->Gb, 2->Gr, 3->R
            //iteration over bayerToRgbOffsets is O(1) since it is either sized 2 or 4.
            std::for_each(bayerToRgbOffsets[colorIndex].begin(), bayerToRgbOffsets[colorIndex].end(), 
                [&](int colorOffset)
                {
                    auto rgbIndex = (bayerIndex + colorOffset) * 3 + rgbOffsets[offset];
                    RgbChannel[rgbIndex] = BayerChannel[bayerIndex];
                });
        }
    }
}

, : (/O2) . for_each for, . , bayer "" rgb ( ), std::vector, debug release ( X2-X3). , std::vector ? , ?

+5

c++ performance visual-studio-2012

eladidan 11 . '13 13:06

2

VS , Disabled (/Od). ( (/O1), (/O2), (/Ox) ). , ...

0

Saqlain 11 . '13 13:43

Roger Rowland · Accepted Answer · 2013-04-11T13:08:45+0000

std::vector, .

MSDN , .

, #define, STL:

#define _HAS_ITERATOR_DEBUGGING 0

, Debug, , , .

Dramatic performance differences: debug vs release

More articles: