Difference between revisions of "Enfuse reference manual"

Revision as of 23:03, 2 March 2009

\1

Overview

Enfuse merges overlapping images using the Mertens-Kautz-Van Reeth exposure fusion algorithm. (1) This is a quick way for example to blend differently exposed images into a nice output image, without producing intermediate high-dynamic range (HDR) images that are then tonemapped to a viewable image. This simplified process often works much better than tonemapping algorithms.

Enfuse can also be used to build extended depth-of-field (DOF) images by blending a focus stack.

The idea is that pixels in the input images are weighted according to qualities such as, for example, proper exposure, good contrast, or high saturation. These weights determine how much a given pixel will contribute to the final image.

A Burt-Adelson multiresolution spline blender (2) is used to combine the images according to the weights. The multiresolution blending ensures that transitions between regions where different images contribute are difficult to spot.

Enfuse uses up to four criteria to judge the quality of a pixel, which tab:weighting-criteria briefly describes.

Exposure: The exposure criteria favors pixels with luminance close to the middle of the range. These pixels are considered better exposed as those with high or low luminance levels.
Saturation: The saturation criteria favors highly-saturated pixels. (Note that saturation is only defined for color pixels.)
Contrast: The contrast criteria favors pixels inside a high-contrast neighborhood. Enfuse can use standard deviation or Laplacian magnitude or a blend of both as local contrast measure.
Entropy: The entropy criteria prefers pixels inside a high-entropy neighborhood. In addition, Enfuse allows the user to mitigate the problem of noisy images when using entropy weighting by setting a black threshold.

Table 1.1: Enfuse's four weighting criteria.

For the concept of pixel weighting and details on the different weighting functions see Weighting Functions.

Adjust how much importance is given to each criterion by setting the weight parameters on the command line. For example, if you set ‘--wExposure=1.0’ and ‘--wSaturation=0.5’, Enfuse will favor well-exposed pixels over highly-saturated pixels when blending the source images. The effect of these parameters on the final result will not always be clear in advance. The quality of the result is subject to your artistic interpretation. Playing with the weights may or may not give a more pleasing result. The authors encourage you to experiment, perhaps using down-sized (3) or cropped images for speed.

Enfuse expects but does not require each input image to have an alpha channel. By setting the alpha values of pixels to zero, users can manually remove those pixels from consideration when blending. If an input image lacks an alpha channel, Enfuse will issue a warning and continue assuming all pixels should contribute to the final output. Any alpha value other than zero is interpreted as “this pixel should contribute to the final image”.

Find out more about Enfuse on its webpage.

Invocation

enfuse [OPTIONS] -o OUTPUT-FILE INPUT-FILES....

Fuse the sequence of images INPUT-FILES… into OUTPUT-FILE.

Common Options

Common options control some overall features of Enfuse.

--compression=COMPRESSION: Write a compressed output file.Depending on the output file format Enfuse accepts different values for COMPRESSION.

JPEG COMPRESSION is a JPEG quality level ranging from 0–100. TIFF COMPRESSION is one of the keywords:

‘NONE’ Do not compress. This is the default. ‘DEFLATE’ Use the Deflate compression scheme also called ZIP-in-TIFF. Deflate is a lossless data compression algorithm that uses a combination of the LZ77 algorithm and Huffman coding. ‘LZW’ Use Lempel-Ziv-Welch (LZW) adaptive compression scheme. LZW compression is lossless. ‘PACKBITS’ Use PackBits compression scheme. PackBits is a particular variant of run-length compression; it is lossless.

Any other format Other formats like the PNG do not accept a COMPRESSION setting.

-h
--help: Print information on the available options then exit.
-l LEVELS: Use exactly this many LEVELS for pyramid blending.This option allows to trades off quality of results for slightly shorter execution time and lower memory usage. The default is to use as many levels as possible given the size of the overlap region. Enfuse may still use a smaller number of levels if the geometry of the images demands it.
-o OUTPUT-FILE: Required option that specifies the name of the OUTPUT-FILE.
-v
--verbose: Increase the verbosity of progress reporting. Giving one or more ‘-v’ options will make Enfuse more verbose.
-V
--version: Output information on the Enblend version and some configuration details like extra features that have been compiled in.
-w: Blend around the ± 180° boundary. This is useful for full 360° panoramas.Version 3.2-cvs of Enfuse, the one described here, does not blend neither zenith nor or nadir, so you may still see some seams in these areas.

Extended Options

Extended options control the image cache, the color model, and the cropping of the output image.

-b BLOCKSIZE: Set the BLOCKSIZE in kilobytes (KB) of Enfuse's image cache.This is the amount of data that Enfuse will move to and from the disk in one go. The default is 2048KB, which should be ok for most systems.See Tuning Memory Usage for details.
-c: Use the CIECAM02 color appearance model for blending colors.The input files should have embedded ICC profiles when this option is specified. If no ICC profile is present, Enfuse will assume that the image uses the sRGB color space. The difference between this option and Enfuse's default color blending algorithm is slight, and will be only noticeable when areas of different primary colors are blended together.
-d
--depth=DEPTH: Force the number of bits per channel and the numeric format of the output image.All DEPTH specifications are valid in lowercase as well as uppercase letters. For integer format use

8, uint8 Unsigned 8 bit; range: 0..255 int16 Signed 16 bit; range: -32768..32767 16, uint16 Unsigned 16 bit; range: 0..65536 int32 Signed 32 bit; range: -2147483648..2147483647 32, uint32 Unsigned 32 bit; range: 0..4294967295For floating-point format use r32, real32, float IEEE754 single precision floating-point, 32 bits wide, 24 bit significant - Minimal normalized value: 1.2 × 10^-38 - Epsilon: 1.2 × 10^-7 - Maximal finite value: 3.4 × 10³⁸

r64, real64, double IEEE754 double precision floating-point, 64 bits wide, 53 bit significant - Minimal normalized value: 2.2 × 10^-308 - Epsilon: 2.2 × 10^-16 - Maximal finite value: 1.8 × 10³⁰⁸ If the requested DEPTH is not supported by the output file format, Enblend warns and chooses the DEPTH that matches best.

-g: Save alpha channel as “associated”. Gimp (before version 2.0) and CinePaint (see section Helpful Additional Programs) exhibit unusual behavior when loading images with unassociated alpha channels. Use option ‘-g’ to work around this problem. With this flag Enfuse will create the output image with the associated alpha tag set, even though the image is really unassociated alpha.
-f WIDTHxHEIGHT
-f WIDTHxHEIGHT+xX-OFFSET+yY- OFFSET: Set the size of the output image manually to WIDTH× HEIGHT. Optionally specify the X-OFFSET and Y-OFFSET, too. This option is useful when the input images are cropped TIFF files, such as those produced by nona. The stitcher nona is part of Hugin. See section Helpful Additional Programs.
-m CACHESIZE: Set the CACHESIZE in megabytes (MB) of Enfuse's image cache.This is the amount of memory Enfuse will use for storing image data before swapping to disk. The default is 1024MB, which is good for systems with 3–4gigabytes (GB) of RAM.See Tuning Memory Usage for details.

Fusion Options

Fusion options define the proportion to which each input image's pixel contributes to the output image.

--wContrast=WEIGHT: Sets the relative WEIGHT of high-contrast pixels. Default: 0.0. Valid range: 0 ≤ WEIGHT ≤ 1. See Contrast Weighting and Option ContrastWindowSize.
--wEntropy=WEIGHT: Sets the relative WEIGHT of high entropy pixels. Default: 0.0. Valid range: 0 ≤ WEIGHT ≤ 1. See Entropy Weighting and Options EntropyWindowSize and EntropyCutoff.
--wExposure=WEIGHT: Sets the relative WEIGHT of the well-exposedness criterion. Increasing this weight relative to the others will make well exposed pixels contribute more to the final output. Default: 1.0. Valid range: 0 ≤ WEIGHT ≤ 1 . See section Exposure Weighting.
--wMu=MEAN: Set the MEAN (this is, the center) of the Gaussian exposure weight curve. Default: 0.5. Valid range: 0 ≤ MEAN ≤ 1 .Use this option to fine-tune exposure weighting (see section Exposure Weighting).
--wSaturation=WEIGHT: Sets the relative WEIGHT of high-saturation pixels. Increasing this weight makes pixels with high saturation contribute more to the final output. Default: 0.2. Valid range: 0 ≤ WEIGHT ≤ 1. Saturation weighting is only defined for color images. See section Saturation Weighting.
--wSigma=STD-DEV: Standard deviation STD-DEV of the Gaussian exposure weight curve. Default: 0.2. Low numbers give less weight to pixels that are far from ‘--wMu’ and vice versa. Valid range: 0 ≤ STD-DEV ≤ 1 .Use this option to fine-tune exposure weighting (see section Exposure Weighting).

Expert Options

Expert options influence the workings of Enfuse that require the user to read the manual before applying them successfully.

--ContrastWindowSize=SIZE: Set the window SIZE for local contrast analysis. The window will be a square of SIZE× SIZE pixels.For contrast analysis SIZE values larger than 5 might result in a blurry composite image. Values of 3 and 5 have given good results on focus stacks.Valid range: SIZE ≥ 3 .Given an even SIZE Enfuse will automatically use the next odd number.See also Option –wContrast and ‘--HardMask’ below.
--EdgeScale=EDGE-SCALE
--EdgeScale=EDGE-SCALE:LCE-SCALE:LCE-FACTOR: A non-zero value for EDGE-SCALE switches on the Laplacian-of-Gaussian (LoG) edge detection algorithm. EDGE-SCALE is the radius of the Gaussian used in the search for edges. Default: 0 pixels.A positive LCE-SCALE turns on local contrast enhancement (LCE) before the LoG edge detection. LCE-SCALE is the radius of the Gaussian used in the enhancement step, LCE-FACTOR is the weight factor (“strength”).; $enhanced=(1+LCE-FACTOR)*original-LCE-FACTOR*GaussianSmooth(original,LCE-SCALE)$; LCE-SCALE defaults to 0 pixels and LCE-FACTOR defaults to 0. Append ‘%’ to LCE-SCALE to specify the radius as a percentage of EDGE-SCALE. Append ‘%’ to LCE-FACTOR to specify the weight as a percentage.
--EntropyCutoff=LOWER-CUTOFF
--EntropyCutoff=LOWER-CUTOFF:UPPER-CUTOFF: The first form defines the lower cutoff value below of which pixels are treated as pure black when calculating the local entropy. The second form also defines the upper cutoff value above of which pixels are treated as pure white.For color images LOWER-CUTOFF and UPPER-CUTOFF are applied separately and independently to each channel.Defaults: 0% for LOWER-CUTOFF and 100% for UPPER-CUTOFF, this is, all pixels' values are taken into account. Append a ‘%’ to specify the cutoff relative to maximum pixel value in the source image (for example 255 or 65535). fig:entropy-cutoff shows an example.

Figure 2.1: Linear lightness Y in comparison with an entropy-cutoff function for LOWER-CUTOFF = 5% and UPPER-CUTOFF = 90% which are rather extreme values. Note that a high LOWER-CUTOFF value lightens the resulting image, as dark (and presumably noisy) pixels are averaged with equal weights. With ‘--EntropyCutoff=0’, the default, on the other hand, “noise” might be interpreted as high entropy and the noisy pixels get a high weight, which in turn renders the resulting image darker. Analogously, a low UPPER-CUTOFF darkens the output image.
--EntropyWindowSize=SIZE: Window SIZE for local entropy analysis. The window will be a square of SIZE× SIZE pixels.In the entropy calculation SIZE values of 3 to 7 yield an acceptable compromise of the locality of the information and the significance of the local entropy value itself.Valid range: SIZE ≥ 3 .Given an even SIZE Enfuse will automatically use the next odd number.
--GrayProjector=PROJECTOR: Use gray projector PROJECTOR for conversion of RGB images to grayscale masks: R G B &rightarrow; Y . In version 3.2-cvs of Enfuse, the option is effective for exposure weighting contrast weighting. Default: ‘average’.Valid values for PROJECTOR are:
average: Average red, green, and blue channel with equal weights. This is the default and it often is a good projector for gamma = 1 data.; $Y={(R+G+B)}/{3}$
l-star: Use the L-channel of the L*a*b*-conversion of the image as its grayscale representation. This is a useful projector for gamma = 1 data. It reveals minute contrast variations even in the shadows and the highlights. This projector is computationally expensive.See Wikipedia for a detailed description of the Lab color space.
lightness: Compute the lightness of each RGB pixel as in an Hue-Saturation-Lightness (HSL) conversion of the image.; $Y={(max(R,G,B)+min(R,G,B))}/{2}$
value: Take the Value-channel of the Hue-Saturation-Value (HSV) conversion of the image.; $Y=max(R,G,B)$
luminance: Use the weighted average of the RGB pixel's channels as defined by CIE (“Commission Internationale de l'Éclairage”) and the JPEG standard.; $Y=0.30*R+0.59*G+0.11*B$
channel-mixer:RED-WEIGHT:GREEN-WEIGHT:BLUE-WEIGHT: Weight the channels as given.; $Y=RED-WEIGHT*R+GREEN-WEIGHT*G+BLUE-WEIGHT*B$; The weights are automatically normalized to one, so

 --GrayProjector=channel-mixer:0.25:0.5:0.25
 --GrayProjector=channel-mixer:1:2:1
 --GrayProjector=channel-mixer:25:50:25

all define the same mixer configuration.The three weights RED-WEIGHT, GREEN-WEIGHT, and BLUE-WEIGHT define the relative weight of the respective color channel. The sum of all weights is normalized to one.
--HardMask: Force hard blend masks on the finest scale. This is the opposite flag of ‘--SoftMask’.This blending mode avoids averaging of fine details (only) at the expense of increasing the noise. However it considerably improvs the sharpness of focus stacks. Blending with hard masks has solely proven useful with focus stacks.See also Option –wContrast and ‘--ContrastWindowSize’ above.
--MinCurvature=CURVATURE: Define the minimum CURVATURE for the LoG edge detection. Default: 0. Append a ‘%’ to specify the minimum curvature relative to maximum pixel value in the source image (for example 255 or 65535).A positive value makes Enfuse use the local contrast data (controlled with ‘--ContrastWindowSize’) for curvatures less than CURVATURE and LoG data for values above it.A negative value truncates all curvatures less than -CURVATURE to zero. Values above CURVATURE are left unchanged. This effectively suppresses weak edges.
--SoftMask: Consider all masks when fusing. This is the default.

Option Delimiters

Enfuse allows the arguments supplied to the program's options to be separated by different separators. The online documentation and this manual, however, exclusively uses the colon ‘:’ in every syntax definition and in all examples. Valid delimiters are the comma ‘,’, the semicolon ‘;’, the colon ‘:’, and the slash ‘/’. All delimiters may be mixed within any option.

Examples:

‘--EdgeScale=0.667: 6.67:3.5’; Separate all arguments with colons.
‘--EdgeScale=0.667,6.67,3.5’: Use commas.
‘--EdgeScale=0.667;6.67/3.5’: Mix semicolon and slash in weird ways.
‘--EntropyCutoff=3%/99%’: All delimiters also work in conjuction with percentages.
‘--GrayProjector=channel-mixer: 3/6/1’; Separate arguments with a colon and two slashes.
‘--GrayProjector=channel-mixer/30;60,10’: Go wild and Enfuse will understand.

Weighting Functions

As has been noted in the Overview (see section Overview), Enfuse supports four different types of weighting. The following subsections describe the concept of weighting and all weighting functions in detail.

Weighting Pixels

Image fusion maps each pixel P(i, x, y) of every input image i to a single pixel Q(x, y) in the output image:
$P(i,x,y)-->Q(x,y),\,$ where x runs from 1 to the common width of the images, y from 1 to the common height, and i from 1 to the number of input images n.

Enfuse allows for weighting the contribution of each P(i, x, y) to the final Q(x, y):
$w(P(1,x,y))*P(1,x,y)+...+w(P(n,x,y))*P(n,x,y)-->Q(x,y),\,$ where

each w is non-negative to yield a physical intensity and
the sum of all w is one to leave the total intensity unchanged.

The pixel weights w themselves are weighted sums with the same constraints

$w(P)=w_{exp}*f_{exp}(P)+w_{sat}*f_{sat}(P)+w_{cont}*f_{cont}(P,r_{cont})+w_{ent}*f_{ent}(P,r_{ent}),\,$

where we have abbreviated P i x y to P for simplicity. The user defines the constants w exp , w sat , w cont , and w ent with the options `--wExposure', `--wSaturation', `--wContrast', and `--wEntropy' respectively. The functions f exp , f sat , f cont , and f ent along with the window sizes r cont and r ent are explained in the next sections.

Weighted Average

By default, Enfuse uses a weighted average, where each pixel contributes as much as its weight demands. Of course the weights can be extreme, favouring only a few pixels or even only one pixel in the input stack. Extremes are untypical, however.

Equal weights are another extreme that turns (W) into an arithmetic average. this is why we sometimes speak of the “averaging property” of this weighting algorithm, like smoothing out noise.

Disabling Averaging: Option ‘-- HardMask’

The weighted average computation as described above has proven to be widely successful with the exception of one special case: focus stacking (see section Focus Stacks – Depth-of-Field Increase), where the averaging noticeably softens the final image.

Use ‘--HardMask’ to switch Enfuse into a different (“Super Trouper”) weighting mode, where the pixel with the highest weight wins, this is, gets weight one, and all other pixels get the weight of zero (“The Winner Takes It All.”). With ‘--HardMask’ Equation (W) becomes P i x y &rightarrow; Q x y ,  where  w ⁡ ( P i x y ) ? w ⁡ ( P j x y )  for all  1 ? j ? n .

Note that this “averaging” scheme lacks the nice noise-reduction property of the weighted average (W), because only a single input pixel contributes to the output.

Exposure Weighting

Exposure weighting prefers pixels with a luminance Y close to the center of the normalized, real-valued luminance interval [0, 1].

RGB-pixels get converted to luminance using the grayscale projector given by ‘--GrayProjector’, which defaults to average. Grayscale pixels are identified with luminance.

In the normalized luminance interval 0.0 represents pure black and 1.0 represents pure white independently of the data type of the input image. This is, for a JPEG image the luminance 255 maps to 1.0 in the normalized interval and for a 32bit TIFF picture the highest luminance value 4294967295 also maps to 1.0. The middle if the luminance interval, 0.5, is where a neutral gray tone ends up with every camera that had no exposure correction dialed in, for example the image of a gray- or white-card.

The exposure weighting algorithm only looks at a single pixel at a time; the pixel's neighborhood is not taken into account.

The weighting function is the Gaussian w exp ⁡ ( Y ) = exp ⁡ ( - 1 2 ⁢ ( Y - Mu Sigma ) 2 ) whose center Mu and width Sigma are controlled by the command line options ‘--wMu’ and ‘-- wSigma’ respectively. Mu defaults to 0.5, the middle of the luminance interval. Sigma defaults to 0.2. fig:gaussian shows a Gaussian with these parameters.

Figure 3.1: Gaussian function with Enfuse's default parameters Mu = 0.5 and Sigma = 0.2.

The options ‘--wMu’ and ‘--wSigma’ are for fine-tuning the final result without changing the set of input images. Option ‘--wMu’ sets the point Mu of optimum exposure. The default is Mu = 0.5, exactly the middle between pure black (0.0) and pure white (1.0). Increasing Mu makes Enfuse prefer lighter pixels, rendering the final image lighter and vice versa. Option ‘--wSigma’ defines the range Sigma of acceptable exposures. The default is Sigma = 0.2, which means at Mu ± 0.2 the weight will be approximately 0.88. Small values of Sigma penalize exposures deviant from Mu more and vice versa.

Summary of influential options

‘--wExposure’: Fusion Options
‘--wMu’: Fusion Options
‘--wSigma’: Fusion Options
‘--GrayProjector’: Expert Options

Saturation Weighting

Saturation weighting prefers pixels with a high saturation.

Enfuse computes the saturation of a pixel according to the following algorithm.

max := maximum(R, 

G, B)
min := minimum(R, G, B)
if max = min then
    saturation := 0
else
    sum := max + min
    difference := max - min
    if sum <= 1 then
        saturation := difference / sum
    else
        saturation := difference / (2 - sum)
    end if
end if

Obviously, saturation weighting can only be defined for RGB images, not for grayscale ones! If you need something similar check out Entropy Weighting; entropy weighting works for both RGB and grayscale pictures.

The saturation weighting algorithm only looks at a single pixel at a time; the pixel's neighborhood is not taken into account.

Summary of influential options

‘--wSaturation’: Fusion Options

Contrast Weighting

Contrast weighting favors pixels inside a high contrast neighborhood. The notion of “high contrast” is defined either by two different criteria or by a blend of both:

The standard deviation (SDev) of all the pixels in the local analysis window is large. See section Standard Deviation.
The Laplacian-of-Gaussian (LoG) has a large magnitude. See section Laplacian of Gaussian.
If the LoG magnitude is below a given threshold, use SDev data, otherwise stick with LoG. See section Blend Standard Deviation and Laplacian of Gaussian.

Enfuse converts every RGB image to grayscale before it determines its contrast. Option ‘--GrayProjector’ (see section Expert Options) controls the projector function. Depending on the subject one of several grayscale projectors may yield the best black-and-white contrast for image fusion.

In the following sections we describe each algorithm in detail.

Standard Deviation

The pixel under consideration C sits exactly in the center of a square, the so-called local analysis window. It always has an uneven edge length. The user sets the size with option ‘--ContrastWindowSize’. fig:local-analysis-window shows two windows with different sizes.

Figure 3.2: Examples of local analysis windows for the sizes 3 and 5. “C” marks the center where the pixel gets the weight. “N” are neighboring pixels, which all contribute equally to the weight.

During the analysis Enfuse scans the local analysis window across all rows and all columns (4) of each of the input images to compute the contrast weight of every pixel.

Summary of influential options

‘--wContrast’: Fusion Options
‘--HardMask’: Fusion Options
‘--ContrastWindowSize’: Expert Options
‘--GrayProjector’: Expert Options

Statistical Moments

We start with the probability function w of the random variable X: w : x &rightarrow; p ⁡ ( { ? : X ⁡ ( ? ) = x } ) . It associates a probability p with each of the n different possible outcomes ? of the random variable X. Based on w, we define the expectation value or “First Moment” of the random variable X: Ex ⁡ X := ? i = 1 n x i × w ⁡ ( x i ) . Using the definition of the expectation value, we define the variance, or “Second Moment” as Var ⁡ X := Ex ⁡ ( ( X - Ex ⁡ X ) 2 ) and the standard deviation as ? ⁡ X := Var ⁡ X . Obviously, the variance of X is the expectation value of the squared deviation from the expectation value of X itself. Note that the variance's dimension is X's dimension squared; the standard deviation rectifies the dimension to make it comparable with X itself again.

Estimators

In Enfuse, we assume that X follows a uniform probability function w(x) = const. This is, all pixel values in the local analysis window are considered to be equally probable. Thus, the expectation value and the variance can be estimated from the pixel values like this Ex ⁡ X := 1 n ⁢ ? i = 1 n x i . In other words: the expectation value is the arithmetic mean of the lightness of all pixels in the local analysis window. Analogously, the variance becomes Var ⁡ X := 1 n - 1 ⁢ Ex ⁡ ( ( X - Ex ⁡ X ) 2 ) .

Laplacian of Gaussian

The Laplacian of Gaussian (LoG) is an operator to detect edges in an image. Sometimes the LoG-operator is also called Marr-Hildreth operator. A Laplacian-of-Gaussian operator, [http://kogs-www.informatik.uni-

hamburg.de/~koethe/vigra/doc/vigra/group__CommonConvolutionFilters.html vigra::laplacianOf Gaussian] is part of the package Vigra that Enfuse is built upon and is used for edge detection if option ‘--EdgeScale’ is non-zero and ‘--MinCurvature’ equal to or less than zero.

Let the Gaussian function be g x y = 1 2 ⁢ ? ⁢ ? 2 ⁢ exp ⁡ ( - x 2 + y 2 2 ⁢ ? 2 ) The parameter ? , the argument of option ‘--EdgeScale’, is the length scale on which edges are detected by g(x, y). We apply the Laplacian operator in Cartesian coordinates ? ? ? · ? = &PartialD; 2 &PartialD; x 2 + &PartialD; 2 &PartialD; y 2 to g(x, y), to arrive at a continous representation of the two-dimensional filter kernel k x y = ? 2 - 1 ? ⁢ ? 4 ⁢ exp ⁡ ( - ? 2 ) , where we have used the dimensionless distance ? from the origin ? 2 = x 2 + y 2 2 ⁢ ? 2 . Enfuse uses a discrete approximation of k in the convolution with the image. The operator is radially symmetric with respect to the origin, which is why we can easily plot it in fig:laplacian-of-gaussian, setting R = x 2 + y 2 .

Figure 3.3: Laplacian-of-Gaussian function for ? = 0.5.

Blend Standard Deviation and Laplacian of Gaussian

Enfuse can team the standard deviation computation and Laplacian of Gaussian to deliver the best of both methods. Use a positive argument CURVATURE with option ‘--MinCurvature’ to combine both algorithms. In this mode of operation Enfuse computes the SDev-weight and the LoG-weight, then uses the LoG to decide whether to go with that value or prefer the SDev data. If the LoG is greater than CURVATURE Enfuse uses the weight delivered by the LoG, otherwise the SDev-weight is rescaled such that its maximum is equal to CURVATURE and the scaled SDev is used as weight.

This technique merges the two edge detection methods where the are best. The LoG excels with clear edges and cannot be fooled by strong but smooth gradients. However, it is bad at detecting faint edges and it is susceptible to noise. The SDev on the othe hand shines with even the most marginal edges and resists noise quite well. Its weakness is that is is easily deceived by strong and smooth gradients. Tuning CURVATURE the user can pick the best threshold for a given set of images.

Summary of influential options

‘--wContrast’: Fusion Options
‘--HardMask’: Fusion Options
‘--ContrastWindowSize’: Expert Options
‘--GrayProjector’: Expert Options
‘--EdgeScale’: Expert Options
‘--MinCurvature’: Expert Options

Scaling and Choice of Mode

Experience has shown that neither the parameters EDGESCALE and CURVATURE nor the mode of operation (SDev-only, LoG-only, or a blend of both) scales to different image sizes. In practice this means that if you start with a set of reduced size images, say 2808× 1872 pixels, carefully optimize EDGESCALE, CURVATURE and so on, moreover find LoG-only the best mode and then switch to the original resolution of 5616× 3744 pixels, multiplying (or dividing) the parameters by four and sticking to LoG-only might not result in the best fused image. For best quality perform the parameter optimization and the search for the most appropriate mode at the final resolution.

Entropy Weighting

Entropy weighting prefers pixels inside a high entropy neighborhood.

Let S be an n-ary source. Watching the output of S an observer on average gains the information H a ⁡ ( n ) := &Sum; x &Element; S p ⁡ ( x ) × log a ⁡ ( 1 / p ⁡ ( x ) ) per emitted message, where we assume the knowledge of the probability function p(S). The expectation value H_a(n) is called entropy of the source S. Entropy measures our uncertainty if we are to guess which message gets chosen by the source in the future. The unit of the entropy depends on the choice of the constant a > 1. Obviously H b ⁡ ( n ) = H a ⁡ ( n ) / log a ⁡ ( b ) holds for all b > 1. We use a = 2 for entropy weighting and set the entropy of the “impossible message” to zero according to lim p &rightarrow; p × log a ⁡ ( 1 / p ) = . fig:entropy shows an entropy function.

Figure 3.4: Entropy function H for an experiment with exactly two outcomes.

For more on (information) entropy visit Wikipedia.

Enfuse computes a pixel's entropy by considering the pixel itself and its surrounding pixels quite similar to Contrast Weighting. The size of the window is set by ‘--EntropyWindowSize’. Choosing the right size is difficult, because there is a serious tradeoff between the locality of the data and the size of the sample used to compute H. A large window results in a large sample size and therefore in a reliable entropy, but considering pixels far away from the center degrades H into a non-local measure. For small windows the opposite holds true.

Another difficulty arises from the use of entropy as a weighting function in dark parts of an image, this is, in areas where the signal-to-noise ratio is low. Without any precautions high noise is taken to be high entropy, which might not be desired. Use option ‘--EntropyCutoff’ to control the black level when computing the entropy.

On the other extreme side of lightness, very light parts of an image, the sensor might already have overflown without the signal reaching 1.0 in the normalized luminance interval. For these pixels the entropy is zero and Enfuse can be told of the threshold by properly setting the second argument of ‘--EntropyCutoff’.

Summary of influential options

‘--wEntropy’: Fusion Options
‘--EntropyWindowSize’: Expert Options
‘--EntropyCutoff’: Expert Options

Understanding Masks

A binary mask indicates for every pixel of an image if this pixel must be considered in further processing or ignored. For a weight mask, the value of the mask determines how much the pixel contributes, zero again meaning “no contribution”.

Masks arise in two places: as part of the input files and as separate files, showing the actual pixel weights prior to image blendung or fusion. We shall explore both occurrences in the next sections.

Masks in Input Files

Each of the input files for Enfuse and Enblend can contain its own mask. Both applications interpret them as binary masks no matter how many bits per image pixel they contain.

Use ImageMagick's identify or, for TIFF files, tiffinfo to inquire quickly whether a file contains a mask. Helpful Additional Programs shows where to find these programs on the web.

$ identify -format "%f %m %wx%h %r %q-

bit" remapped-0000.tif
remapped-0000.tif TIFF 800x533 DirectClassRGBMatte 8-bit
                                             ^^^^^ mask

$ tiffinfo remapped-0000.tif
TIFF Directory at offset 0x1a398a (1718666)
  Subfile Type: (0 = 0x0)
  Image Width: 800 Image Length: 533
  Resolution: 150, 150 pixels/inch
  Position: 0, 0
  Bits/Sample: 8
  Sample Format: unsigned integer
  Compression Scheme: PackBits
  Photometric Interpretation: RGB color
  Extra Samples: 1<unassoc-alpha>            <<<<< mask
  Orientation: row 0 top, col 0 lhs
  Samples/Pixel: 4                           <<<<< R, G, B, and mask
  Rows/Strip: 327
  Planar Configuration: single image plane

The “Matte” part of the image class and the “Extra Samples” line tell us that the file features a mask. Also, many interactive image manipulation programs show the mask as a separate channel, sometimes called “Alpha”. There, the white (high mask value) parts of the mask enable pixels and black (low mask value) parts suppress them.

The multitude of terms all describing the concept of a mask is confusing.

Mask: A mask defines a selection of pixels. A value of zero represents an unselected pixel. The maximum value (“white”) represents a selected pixel and the values between zero and the maximum are partially selected pixels. See Gimp-Savy.
Alpha Channel: The alpha channel stores the transpacency value for each pixel, typically in the range from zero to one. A value of zero means the pixel is completely transparent, thus does not contribute to the image. A value of one on the other hand means the pixel is completely opaque.
Matte: The notion “matte” as used by ImageMagick refers to an inverted alpha channel, more precisely: 1 - alpha. See ImageMagick for further explanations.

Enblend and Enfuse only consider pixels that have an associated mask value different from zero. If an input image does not have an alpha channel, Enblend warns and assumes a mask of all non-zero values, this is, it will use every pixel of the input image for fusion.

Stitchers like nona add a mask to their output images.

Sometimes it is helpful to manually modify a mask before fusion. For example to suppress unwanted objects (insects and cars come into mind) that moved across the scene during the exposures. If the masks of all input images are black at a certain position, the output image will have a hole there.

Weight Mask Files

...

Tuning Memory Usage

The default configuration of Enfuse assumes a system with 3–4GB of RAM.

If Enfuse has been compiled with the C++-preprocessor symbol ENBLEND_CACHE_IMAGES it uses its own image cache, whose size is user configurable with the option ‘-m CACHE-SIZE’ (see section Extended Options). Furthermore, option ‘-b BUFFER-SIZE’ (see section Extended Options) allows for fine-tuning the size of a single buffer inside the image cache. Note that CACHE-SIZE is given in megabytes, whereas BUFFER-SIZE is given in kilobytes.

Usually the user lets the operating system take care of the memory management of all processes. However, a user of Enfuse might want to control the balance between the operating systems' Virtual Memory System and Enfuse's image cache for several reasons.

Paging in or out parts of a process' image runs at kernel level and thus can make user processes appear unresponsive or “jumpy”. The caching mechanism of Enfuse of course runs as a user process, which is why it has less detrimental effects on the system's overall responsiveness.
Enfuse's image cache has been optimized for accesses to image data. And all algorithms in Enfuse have been carefully arranged to play nice with the image cache. An operating system's cache has no knowledge of these particular memory access patterns.
The disk access of the operating system to the swap device has been highly optimized. Enfuse on the other hand uses the standard IO-layer which is a slower interface.
Limiting the amount of image cache prevents Enfuse from eating up most or all RAM, thereby forcing all user applications into the swap device.

The CACHE-SIZE should be set in a way as to reconcile all of the above aspects even for the biggest data sets (this is many images and large images).

Suggested cache- and buffer-sizes for different amounts of available RAM.

RAM	CACHE-SIZE	BUFFER-SIZE	Comment
MB	MB	KB
4096	1024	2048	default
2048	512–1024	1024
1024	256–512	256–512

Table 5.1: Suggested cache-size settings

Applications of Enfuse

The section describes some of the novel possibilities that Enfuse offers the photographer. In contrast to the previous chapters it centers around the image effects.

What Makes Images Fusable?

Images should align well to be suitable for fusion. However, there is no hard mathematical rule what “well” means. The alignment requirements for 16MPixel images to yield a sharp 4"× 6" print at 300dpi (“dpi” means dots per inch) or even for web presentation are relatively low, whereas the alignment of 8MPixel images for a 12"× 18" print ought to be tight.

If the input images need to be aligned, Hugin (see section Helpful Additional Programs) is the tool of choice. It produces images exactly in the format that Enfuse expects.

Sometimes images naturally align extremely well so that no re-alignment is required. An image series with preprogrammed exposure steps taken in rapid succession where the camera is mounted on a heavy tripod and a humongous ball head, mirror lockup and a cable release are used comes into mind.

When in doubt what will work, try it, and judge for yourself.

Useful ideas for a good alignment:

Fix all camera parameters that are not explicitely varied.
Aperture

Engage full manual (<M>) or aperture-priority (<A>) mode.

Auto-focus

Disable “Auto Focus”. Be aware that the auto-focus function could be linked to shutter-release button position “half pressed” or to the shutter release in insidious ways.

Closed eyepiece

(This applies only to single lens reflex cameras.) Close the eyepiece when using a cable release to suppress variations in stray light.

Exposure time/Shutter speed

Use the shortest possible exposure time or, in other words, use the fastest shutter speed to avoid blur caused by camera shake or motion blur.

Flash power

Explicitely control the flash power of all flashes. This is sometimes called “flash exposure lock”.

Sensitivity

Disable “Auto ISO”.

White balance

Disable “Auto White Balance”. Instead use the most suitable fixed white balance or take the white balance off a white card. When in doubt use the setting “Daylight” or equivalent.
Steady the camera by any means.
- Apply your best camera bracing technique combined with controlled breathing.
- Prefer a monopod or better rigid tripod with a heavy head.
- (This applies to cameras with a moving mirror only.) Engage “mirror lockup”.
- Consider automatic bracketing when applicable.
- Activate camera- or lens-based image stabilization if you are sure that it improves the image quality in your particular case, otherwise disengage the feature. For some lens-based image stabilization systems it is known that they “lock” into different positions every time they are activated. Moreover, some stabilization systems decrease the image quality when the lens is mounted on a tripod.
Fire in rapid succession.

Repetition – Noise Reduction

Main Purpose: Reduce noise

With the default settings Enfuse computes a weighted average of the input pixels. For a series of images, repeated with identical settings this results in a reduction of (photon shot) noise. In other words, the dynamic range increases slightly, because the higher signal-to-noise ratio makes darker shades usable. Furthermore, smooth or glossy surfaces get a “cleaner” look and edges become visually sharper. The nitty-gritty reportage look that sometimes stems from a high sensitivity setting disappears.

Averaged images and therefore low-noise images are the base for a multitude of techniques like for example differences. The most prominent method in this class is dark-frame subtraction.

The defaults set ‘--wExposure=1.0’ and ‘-- wSaturation=0.2’. Eliminating the saturation component with ‘--wSaturation=0.0’ can be worth an extra run.

Exposure Series – Dynamic Range Increase

Main Purpose: Increase manageable dynamic range

An exposure series is a set of images taken with identical parameters except for the exposure time. Some cameras even provide special functions to automate recording exposure series. See the instruction manual of your model for details.

Enfuse's defaults, ‘--wExposure=1.0’ and ‘-- wSaturation=0.2’ are well suited for fusion of color images. Remember that saturation weighting only works for RGB data. Option ‘--wSaturation’ helps controlling burnt-out highlights as these are heavily desaturated. If no image suffers from troublesome highlights, the relative saturation weight can be reduced and even be set to zero. For black and white images ‘--wEntropy’ can be an alternative to ‘-- wSaturation’ because it suppresses overexposed pixels as these contain little information. However, saturation weighting is not limited to grayscale data. It has been successfully applied to RGB images, too. Note that saturation weighting considers each color channel of an RGB image separately and chooses the channel with the minimum entropy as representative for the whole pixel.

Enfuse offers the photographer tremendous flexibility on fusing differenly exposed images. Whether you combine only two pictures or a series of 21, Enfuse imposes no limits on you. Accordingly, the photographic effects achieved range from subtle to surreal like the late 1980s “Max Headroom” TV-Series to really unreal. Like some time ago in the chemical days of photography a new developer opened unseen possibilities for artists, exposure fusion extends a photographer's expressive space in the digital age. Whether the results look good or bad, whether the images are dull or exciting is entirely up the artist.

In the next sections we give assistance to starters and rectify several misconceptions about Enfuse.

Tips For Beginners

Here are some tips to get you in business quickly.

Include the best single exposure.: Include the exposure you would have taken if you did not use Enfuse in your series. It gives you a solid starting point. Think of the other images as augmenting this best single exposure to bring out the light and dark features you would like to see.
Begin with as little images as possible.: Pre-visualizing the results of Enfuse is difficult. The more images enter the fusion process and the wider their EV-spacing is, the more challenging thinking of the output image becomes. Therefore, start off with as little images as possible.You can take a larger series of images and only use part of it.
Start with a moderate EV-spacing.: As has been pointed out in the previous item, a wide EV-spacing makes pre-visualization harder. So set out with a spacing of 2/3EV to 1+1/3EV.

Common Misconceptions

Here are some surprisingly common misconceptions about exposure series.

A single image cannot be the source of an exposure series.: Raw-files in particular lend themselves to be converted multiple times and the results being fused together. The technique is simpler, faster, and usually even looks better than digital blending (as opposed to using a graduated neutral density filter) or blending exposures in an image manipulation program. Moreover, perfect alignment comes free of charge!
An exposure series must feature symmetric exposures.: Twice wrong! Neither do the exposures have to be “symmetric” like {0EV, -2/3EV, +2/3EV}, nor does the number of exposures have to be odd. Series like {-1-1/3EV, -1/3EV, +1/3EV} or {-1EV, 1EV} might be just right. By the way, the order in which the images were taken does not matter either.
An exposure series must cover the whole dynamic range of the scene.: Unless you do not want to cover the whole range, you do not have to. Some HDR programs require the user to take a light probe, (5) whereas Enfuse offers the user complete freedom of exposure.
All exposure values must be different.: You can repeat any exposure as often as you like. That way you combine and exposure series with parts of Repetition – Noise Reduction, emphasizing the multiply occuring exposures and reducing noise.

Flash Exposure Series – Directed Lighting

Main Purpose: ???

...

Polarization Series – Saturation Enhancement

Main Purpose: Reflection suppression, saturation enhancement

In the current implementation of Enfuse it is not possible in general to fuse a polarization series. Naïvely abusing ‘--wSaturation’ will not work.

Focus Stacks – Depth-of-Field Increase

Main Purpose: Synthetic Depth-of-Field Increase

A focus stack is a series of images where the distance of the focal plane from the sensor varies. Sloppyly speaking, the images were focussed at different distances. Fusing such a stack increases the depth-of-field (DOF) beyond the physical limits of diffraction.

Why create focus stacks?

Given

a fixed sensor or film size,
a lens' particular focal length, and
a notion about “sharpness”, technically speaking the size of the circle-of-confusion (CoC)

the photographer controls the depth-of-field with the aperture. Smaller apertures – this is larger aperture numbers – increase the DOF and vice versa. However, smaller apertures increase diffraction which in turn renders the image unsharp. So, there is an optimum aperture where the photographer gets maximum DOF. Sadly, for some purposes like macro shots it is not enough. One way out is to combine the sharp parts of images focused at different distances, thereby artifically increasing the total DOF. This is exactly what Enfuse can do.

All lenses have a so called “sweet spot” aperture, where their resolution is best. Taking pictures at this aperture the photographer squeezes the maximum quality out of the lens. But: the “sweet spot” aperture often is only one or two stops away from wide open. Wouldn't it be great to be able combine these best-possible images to form one high-quality, sufficient-DOF image? Welcome to Enfuse's contrast selection abilities.

Preparing Focus Stacks

We are going to combine images with limited DOF to increase their in-focus parts. The whole process is about image sharpness. Therefore, the input images must align very well, not just well, but very well. For optimum results the maximum controlpoint distance in Hugin (see section Helpful Additional Programs) should not exceed 0.3–0.5pixels to ensure perfect blending.

As in all image fusion operations it is preferable to use 16bit linear (gamma = 1) images throughout, but 8bit gamma encoded images will do. Naturally, high SNR input data always is welcome.

Contrast Based Fusing

A bare bones call to Enfuse for focus stacking could look like this.

enfuse \
    --wExposure=0 \
    --wSaturation=0 \
    --wContrast=\
    --HardMask \
    … \
    -o output.tif \
    input-<0000-9999>.tif

Here is what the option cause.

‘--wExposure=0’: Switch off exposure based pixel selection. The default weight is 1.
‘--wSaturation=0’: Switch off saturation based pixel selection. The default weight is 0.2.
‘--wContrast=1’: Switch on pixel selection based on contrast.
‘--HardMask’: Select the best pixel from the image stack and ignore all others. Without this option Enfuse uses all pixels in the stack and weights them according to their respective quality, which in our case would be contrast. Without ‘--HardMask’ the result will always look a bit soft. See section Contrast Weighting.

If you want to see some entertaining progress messages – contrast weighting takes a while –, also pass the ‘-v’ option for a verbose progress report.

Basic Focus Stacking

For a large class of image stacks Enfuse's default algorithm, as selected in Contrast Based Fusing, to determine the sharpness produces nice results. The algorithm uses a moving square window, the so called contrast window. It computes the standard deviation of the pixels inside of the window. The program then selects the window's center pixel of the image in the stack where the standard deviation is largest, this is the local contrast reaches the maximum.

However, the algorithm fails to deliver good masks for images which exhibit high contrast edges on the scale of the contrast window size. The typical artifacts that show up are

faint dark seams on the light side of the high contrast edges and
extremely soft, slighly lighter seams on the dark side of the high contrast edges,

where the distance of the seams from the middle of the edge is comparable to the contrast window size.

If your results do not show any of this artifacts, stick with the basic algorithm. Advanced focus stacking as described in the next sections delivers superior results in case of artifacts, though requires manually tuning several parameters.

Advanced Focus Stacking

If your fused image shows any of the defects described in the previous section, you can try a more difficult to use algorithm that effectively works around the seam artifacts. It is described in the next section.

A Detailed Look at the Problem

Let us use an example to illustrate the problem of relating the sharpness with the local contrast variations. Say we use a 5× 5 contrast window. Moreover, let sharp_edge and smooth_edge be two specific configurations:

sharp_edge =  #(    0,    0,  200,    0,    0;
                    0,  225,    0,    0,    0;
                    0,  255,    0,    0,    0;
                  215,    0,    0,    0,    0;
                  200,    0,    0,    0,    0)

smooth_edge = #(    0,   62,  125,  187,  250;
                    1,   63,  126,  188,  251;
                    2,   65,  127,  190,  252;
                    3,   66,  128,  191,  253;
                    5,   67,  130,  192,  255)

Imagess fig:sharp-edge and fig:smooth-edge show plots of the matrices sharp_edge and smooth_edge.

Figure 6.1: 3D plot augmented by contour plot of the matrix sharp_edge.

Figure 6.2: 3D plot augmented by contour plot of the matrix smooth_edge.

Our intuition lets us “see” an extremely sharp edge in the first matrix, whereas the second one describes an extraordinarily smooth diagonal intensity ramp. Which one will be selected? Well, sharp_edge has a standard deviation of 88.07 and smooth_edge has 88.41. Thus, smooth_edge wins, contradicting our intuition and even worse our intention!

Sadly, configurations like smooth_edge occur more often with high-quality, good bokeh lenses. In fact they are the very manifestation of “good bokeh”. Therefore, Laplacian edge detection plays an important role when working with high-quality lenses.

Laplacian Edge Detection

Enfuse provides a Laplacian-based algorithm that can help in situations where weighting based on the standard deviation fails. It is activated with a positive value for SCALE in --EdgeScale=SCALE. The Laplacian will detect two-dimensional curvature on the scale of SCALE. Here and in the following we simply speak of “curvature” where we mean “magnitude of curvature”. This is, we shall not distinguish between convex and concave edges. Enfuse always use the magnitude of curvature for weighting.

Typically, SCALE ranges between 0.1pixels and 0.5pixels, where 0.3pixels is a good starting point. To find the best value for SCALE though, usually some experimentation will be necessary. Use the ‘--debug’ option to get all ‘mask####.tif’ and ‘mask####_wta.tif’ files (see section Understanding Masks) and check how different scales affect the artifacts.

Local Contrast Enhancement

Sometimes Enfuse misses smoother edges with ‘--EdgeScale’ and a little local contrast enhancement (LCE) helps. Set --EdgeScale=SCALE:LCE-SCALE:LCE-FACTOR. where LCE-SCALE and LCE-FACTOR work like the unsharp mask filters in various image manipulation programs. Start with LCE-SCALE ten times the value of SCALE and a LCE-FACTOR of 2–5.

LCE-SCALE can be specified as a percentage of SCALE. LCE-FACTOR also can be specified as a percentage. Examples:

--EdgeScale=0.3:3.0:3
--EdgeScale=0.3:1000%:3.0
--EdgeScale=0.3:3:300%
--EdgeScale=0.3:1000%:300%

By default LCE is turned off.

Suppressing Noise or Recognizing Faint Edges

The Laplacian-based algorithm much better resists the seam problem than the local-contrast algorithm, but it has two shortcomings:

The Laplacian is very susceptible to noise and
it fails to recognize faint edges.

The ‘--MinCurvature’ option helps to mitigate both flaws.

The argument to --MinCurvature=CURVATURE either is an absolute lightness value, for example 0..255 for 8bit data and 0..65535 for 16bit data, or, when given with a ‘%’-sign it is a relative lightness value ranging from 0% to 100%.

To suppress unreal edges or counter excessive noise use the ‘--MinCurvature’ option with a negative curvature measure CURVATURE. This forces all curvatures less than -CURVATURE to zero.

A positive curvature measure CURVATURE makes Enfuse merge the LoG data with the local-contrast data. Every curvature larger or equal to CURVATURE is left unchanged and every curvature less than CURVATURE gets replaced by the rescaled local-contrast data such that the largest local contrast is just below CURVATURE. This combines the best parts of both techniques and ensures a precise edge detection over the whole range of edge curvatures.

Summary

--EdgeScale=0.3: Use LoG to detect edges on a scale of 0.3 pixels. Apply the default grayscale projector: average.
--EdgeScale=0.3 --GrayProjector=l-star: Use LoG to detect edges on a scale of 0.3 pixels. Apply the L*-grayscale projector.
--EdgeScale=0.3: 3:300%; Use LoG to detect edges on a scale of 0.3 pixels, pre-sharpen the input images by 300% on a scale of 3 pixels. Apply the default grayscale projector: average.
--EdgeScale=0.3 --MinCurvature=-0.5%: Use LoG to detect edges on a scale of 0.3 pixels. Apply the default grayscale projector: average and throw away all edges with a curvature of less than 0.5%.
--EdgeScale=0.3 --MinCurvature=0.5% --ContrastWindowSize=7: Use LoG to detect edges on a scale of 0.3 pixels. Apply the default grayscale projector: average and throw away all edges with a curvature of less than 0.5% and replace the LoG data between 0% and 0.5% with SDev data. Usa a window of 7× 7 pixel window to compute the SDev.

Tips For Focus Stacking Experts

We have collected some advice of which even focus stacking adepts can benefit.

Ensure that the sensor is clean. Aligning focus stacks requires varying the viewing angle, which corresponds to a changing focal length. Hence, the same pixel on the sensor gets mapped onto different positions in the final image. Dirt spots will occur not only once but as many times as there are images in the stack – something unfunny to correct in postprocessing. Along the same lines the photographer may want to consider to prepare dark frames before and possibly also after the shoot of the focus stack to subtract hot pixels before fusion.
Prefer a low-sensitivity setting on the camera to get low-noise images. Fusing with ‘--HardMask’ does not average and thus does not suppress any noise in the input images.
If the transition of in-focus to out-of-focus areas is too abrupt, record the images with closest and farthest focusing distances twice: first with the intended working aperture and a second time with a small aperture (large aperture number). The small aperture will give the fused image a more natural in-focus to out-of-focus transition and the working-aperture shots supply the detail in the in-focus regions.

Helpful Additional Programs

Several libraries and programs have proven helpful when working with Enfuse and Enblend.

Raw Image Conversion

DCRaw is a universal raw-converter.
UFRaw, a raw converter based on DCRaw, adds a GUI (ufraw), versatile batch processing (ufraw-batch), and some additional features like, for example, cropping, noise reduction with wavelets, and automatic lens error correction.

Image Alignment and Rendering

ALE, David Hilvert's anti-lamenessing engine for the real die-hard command-line users aligns, filters, and renders images.
Hugin is a GUI that aligns and stitches images.
It comes with several command line tools, like nona to stitch panorama images, align_image_stack to align overlapping images for HDR or create focus stacks, and fulla to correct lens errors.
PanoTools the successor of Helmut Dersch's original PanoTools offers a set of command-line dirven applications to create panoramas. Most notable are PTOptimizer and PTmender.

Image Manipulation

CinePaint is a branch of an early Gimp forked off at version 1.0.4. It sports much less features than the current Gimp, but offers 8bit, 16bit and 32bit color channels, HDR (for example floating-point TIFF, and OpenEXR), and a tightly integrated color management system.
The [www.gimp.org/ Gimp] is a general purpose image manipulation program. At the time of this writing it is still limited to images with only 8bits per channel.
ImageMagick and its clone GraphicsMagick are general purpose command-line driven image manipulation programs.

High Dynamic Range

OpenEXR offers libraries and some programs to work with the EXR HDR format.
PFSTools create, modify, and tonemap high-dynamic range images.

Libraries

LibJPEG is a library for handling the JPEG (JFIF) image format.
LibPNG is a library that handles the Portable Network Graphics (PNG) image format.
LibTIFF offers a library and utility programs to manipulate the ubiquitous Tagged Image File Format, TIFF. The nifty tiffinfo command quickly inquires the properties of TIFF files.

Meta-Data Handling

EXIFTool reads and writes EXIF meta data. In particular it copies meta data from one image to another.
LittleCMS is the color management library used by Hugin, DCRaw, UFRaw, Enblend, and Enfuse. It supplies some binaries, too. tifficc, an ICC color profile applier, is of particular interest.

A. Authors

Andrew Mihal (acmihal@users.sourceforge.net) has written Enfuse.
Thanks to Simon Andriot and Pablo Joubert for suggesting the Mertens-Kautz-Van Reeth technique and the name “Enfuse”.
The contrast criteria was added by Pablo d'Angelo (dangelo@users.sourceforge.net).
Dr. Christoph L. Spiel added the gray projectors, the LoG-based edge detection, an O(n)-algorithm for the calculation of local contrast, and entropy weighting.

List of Tables

Table 1.1: Weighting criteria
Table 5.1: Suggested cache-size settings

List of Figures

Figure 2.1: Entropy cutoff function
Figure 3.1: Gaussian function
Figure 3.2: Local analysis window
Figure 3.3: Laplacian-of- Gaussian
Figure 3.4: Entropy function
Figure 6.1: Sharp edge
Figure 6.2: Smooth edge

Program Index

Jump to:	A C D E F G H I L N O P T U

Index Entry	Section

A
`ALE`	Helpful Additional Programs
`align_image_stack`	Helpful Additional Programs
`Anti-Lamenessing Engine`	Helpful Additional Programs

C
`cinepaint`	Helpful Additional Programs
`CinePaint`	Extended Options
`CinePaint`	Helpful Additional Programs

D
`DCRaw`	Helpful Additional Programs
`dcraw`	Helpful Additional Programs

E
`EXIFTool`	Helpful Additional Programs
`exiftool`	Helpful Additional Programs

F
`fulla`	Helpful Additional Programs

G
`Gimp`	Extended Options
`gimp`	Helpful Additional Programs
`Gimp`	Helpful Additional Programs
`GraphicsMagick`	Helpful Additional Programs

H
`hugin`	Helpful Additional Programs
`Hugin`	Extended Options
`Hugin`	What Makes Images Fusable?
`Hugin`	Helpful Additional Programs

I
`identify`	Masks in Input Files
`ImageMagick`	Helpful Additional Programs

L
`LibJPEG`	Helpful Additional Programs
`LibPNG`	Helpful Additional Programs
`LibTIFF`	Helpful Additional Programs
`LittleCMS, TiffICC`	Helpful Additional Programs

N
`nona`	Extended Options
`nona`	Helpful Additional Programs

O
`OpenEXR`	Helpful Additional Programs

P
`PanoTools`	Helpful Additional Programs
`PFSTools`	Helpful Additional Programs
`PTmender`	Helpful Additional Programs
`PTOptimizer`	Helpful Additional Programs

T
`tifficc`	Helpful Additional Programs
`tiffinfo`	Masks in Input Files
`tiffinfo`	Helpful Additional Programs

U
`UFRaw`	Helpful Additional Programs
`ufraw`	Helpful Additional Programs
`ufraw-batch`	Helpful Additional Programs

Jump to:	A C D E F G H I L N O P T U

Option Index

Jump to:	- S

Index Entry	Section

-
`-b`	Extended Options
`-b`	Tuning Memory Usage
`-c`	Extended Options
`--compression`	Common Options
`--ContrastWindowSize`	Expert Options
`-d`	Extended Options
`--depth`	Extended Options
`--EdgeScale`	Expert Options
`--EntropyCutoff`	Expert Options
`--EntropyWindowSize`	Expert Options
`-f`	Extended Options
`-g`	Extended Options
`--GrayProjector`	Expert Options
`-h`	Common Options
`--HardMask`	Expert Options
`--HardMask`	Disabling Averaging: Option ‘-- HardMask’
`--HardMask`	Tips For Focus Stacking Experts
`--help`	Common Options
`-l`	Common Options
`-m`	Extended Options
`-m`	Tuning Memory Usage
`--MinCurvature`	Expert Options
`-o`	Common Options
`-V`	Common Options
`-v`	Common Options
`--verbose`	Common Options
`--version`	Common Options
`-w`	Common Options
`--wContrast`	Fusion Options
`--wEntropy`	Fusion Options
`--wExposure`	Fusion Options
`--wMu`	Fusion Options
`--wSaturation`	Fusion Options
`--wSigma`	Fusion Options

S
`SoftMask`	Expert Options

Jump to:	- S

General Index

Jump to:	3 A B C D E F G H I L M N O P S T U V W

Index Entry	Section

3
360° panorama	Common Options

A
advanced focus stacking	Advanced Focus Stacking
advanced focus stacking, recognizing faint edges	Suppressing Noise or Recognizing Faint Edges
advanced focus stacking, suppressing noise	Suppressing Noise or Recognizing Faint Edges
alpha channel	Overview
alpha channels, unassociated	Extended Options
aperture, sweet spot	Why create focus stacks?
applications of enfuse	Applications of Enfuse
authors, list of	A. Authors
average, weighted	Weighted Average
averaging, disabling	Disabling Averaging: Option ‘-- HardMask’

B
basic focus stacking	Basic Focus Stacking
binary mask	Understanding Masks
bits per channel	Extended Options
blending exposures	Common Misconceptions
Burt-Adelson multiresolution spline	Overview

C
channel, alpha	Overview
CIECAM02	Extended Options
circle-of-confusion	Why create focus stacks?
CoC	Why create focus stacks?
color appearance model	Extended Options
compression	Common Options
compression, deflate	Common Options
compression, LZW	Common Options
compression, packbits	Common Options
contrased-based fusing	Contrast Based Fusing
contrast enhancement, local	Local Contrast Enhancement
contrast weighting using a blend of methods	Blend Standard Deviation and Laplacian of Gaussian
contrast weighting using laplacian- of–gaussian	Laplacian of Gaussian
contrast weighting using standard deviation	Standard Deviation
conversion, Lab*	Expert Options

D
dark frame	Tips For Focus Stacking Experts
deflate compression	Common Options
delimiters, option	Option Delimiters
depth-of-field	Why create focus stacks?
depth-of-focus increase	Focus Stacks – Depth-of-Field Increase
digital blending	Common Misconceptions
disabling averaging	Disabling Averaging: Option ‘-- HardMask’
DOF	Why create focus stacks?
dynamic range increase	Exposure Series – Dynamic Range Increase
dynamic range increase	Flash Exposure Series – Directed Lighting

E
edge detection, laplacian	Laplacian Edge Detection
`ENBLEND_CACHE_IMAGES` (preprocessor symbol)	Tuning Memory Usage
entropy	Entropy Weighting
entropy, definition	Entropy Weighting
estimators	Estimators
expectation value	Statistical Moments
expert focus stacking ttops	Tips For Focus Stacking Experts
exposure series	Exposure Series – Dynamic Range Increase
exposure series, common misconceptions	Common Misconceptions
exposure series, tips for beginners	Tips For Beginners

F
flash exposure series	Flash Exposure Series – Directed Lighting
focus stacking, advanced	Advanced Focus Stacking
focus stacking, basic	Basic Focus Stacking
focus stacks	Focus Stacks – Depth-of-Field Increase
focus stacks, fusing	Contrast Based Fusing
focus stacks, preparation	Preparing Focus Stacks
focus stacks, why create them	Why create focus stacks?
fusing, contrased-based	Contrast Based Fusing

G
general index	General Index
gray projector	Expert Options
gray projector, ‘average’	Expert Options
gray projector, ‘channel- mixer’	Expert Options
gray projector, ‘l- star’	Expert Options
gray projector, ‘lightness’	Expert Options
gray projector, ‘luminance’	Expert Options
gray projector, ‘value’	Expert Options

H
helpful programs	Helpful Additional Programs
hot pixels	Tips For Focus Stacking Experts

I
image cache, block size	Extended Options
image cache, cache size	Extended Options
images, fusable	What Makes Images Fusable?
index, general	General Index
index, option	Option Index
index, program	Program Index
input masks	Masks in Input Files
invocation	Invocation

L
Lab* conversion	Expert Options
laplacian edge detection	Laplacian Edge Detection
Laplacian of Gaussian	Laplacian of Gaussian
levels, pyramid	Common Options
light probe	Common Misconceptions
local analysis window	Standard Deviation
local contrast enhancement	Local Contrast Enhancement
local contrast problem	A Detailed Look at the Problem
LoG	Laplacian of Gaussian
LZW compression	Common Options

M
mask, binary	Understanding Masks
mask, weight	Understanding Masks
masks, input files	Masks in Input Files
masks, undestanding	Understanding Masks
masks, weight	Weight Mask Files
memory, tuning usage of	Tuning Memory Usage
Mertens-Kautz-Van Reeth exposure fusion	Overview
mode of operation (SDev, LoG, …)	Scaling and Choice of Mode

N
natural sharp-unsharp transition	Tips For Focus Stacking Experts
noise reduction	Repetition – Noise Reduction

O
option delimiters	Option Delimiters
option index	Option Index
options, common	Common Options
options, expert	Expert Options
options, extended	Extended Options
options, fusion	Fusion Options
output file compression	Common Options
output image, set size of	Extended Options
overview	Overview

P
packbits compression	Common Options
pixels, hot	Tips For Focus Stacking Experts
polarization series	Polarization Series – Saturation Enhancement
probability function	Statistical Moments
problem, local contrast	A Detailed Look at the Problem
program index	Program Index
programs, helpful additional	Helpful Additional Programs
pyramid levels	Common Options

S
saturation enhancement	Polarization Series – Saturation Enhancement
scaling of parameters	Scaling and Choice of Mode
sensor, use of clean	Tips For Focus Stacking Experts
series, exposure	Exposure Series – Dynamic Range Increase
series, flash exposure	Flash Exposure Series – Directed Lighting
series, polarization	Polarization Series – Saturation Enhancement
series, simple	Repetition – Noise Reduction
simple series	Repetition – Noise Reduction
standard deviation	Statistical Moments
statistical moments	Statistical Moments
subtraction of dark frame	Tips For Focus Stacking Experts
sweet spot aperture	Why create focus stacks?

T
tips, focus stacking experts	Tips For Focus Stacking Experts
transition, natural sharp-unsharp	Tips For Focus Stacking Experts

U
undestanding masks	Understanding Masks

V
variance	Statistical Moments

W
weight mask	Understanding Masks
weight masks	Weight Mask Files
weighted average	Weighted Average
weighting functions	Weighting Functions
weighting, contrast	Overview
weighting, contrast	Contrast Weighting
weighting, contrast using a blend of methods	Blend Standard Deviation and Laplacian of Gaussian
weighting, contrast using laplacian- of–gaussian	Laplacian of Gaussian
weighting, contrast using standard deviation	Standard Deviation
weighting, entropy	Overview
weighting, entropy	Entropy Weighting
weighting, exposure	Overview
weighting, exposure	Exposure Weighting
weighting, general concept of	Weighting Pixels
weighting, saturation	Overview
weighting, saturation	Saturation Weighting
window, local-analysis	Standard Deviation

Jump to:	3 A B C D E F G H I L M N O P S T U V W

Footnotes

(1) Tom Mertens, Jan Kautz, and Frank van Reeth, “Exposure Fusion”, Proceedings of the 15th Pacific Conference on Computer Graphics and Applications, pages 382–390.

(2) Peter J. Burt and Edward H. Adelson, “A Multiresolution Spline With Application to Image Mosaics”, ACM Transactions on Graphics, Vol. 2, No. 4, October 1983, pages 217–236.

(3) Downsampling with a good interpolator reduces noise, which might not desired to judge the image quality of the original-size image. Cropping might be an alternative, though.

(4) In the current implementation a floor(ContrastWindowSize / 2) wide border around the images remains unprocessed and gets a weight of zero.

(5) Paul E. Debevec defines: “A light probe image is an omnidirectional, high dynamic range image that records the incident illumination conditions at a particular point in space.”

@@ Line 188: / Line 188: @@
 ; <code>--EdgeScale=<var>EDGE-SCALE</var></code>
 ; <code>--EdgeScale=<var>EDGE-SCALE</var>:<var>LCE-SCALE</var>:<var>LCE-FACTOR</var></code>
 : {{anchor|IDX54}} A non-zero value for <var>EDGE-SCALE</var> switches on the Laplacian-of-Gaussian (LoG) edge detection algorithm. <var>EDGE-SCALE</var> is the radius of the Gaussian used in the search for edges. Default: 0 pixels.A positive <var>LCE-SCALE</var> turns on local contrast enhancement (LCE) before the LoG edge detection. <var>LCE-SCALE</var> is the radius of the Gaussian used in the enhancement step, <var>LCE-FACTOR</var> is the weight factor (“strength”).
-<math>enhanced = (1 + LCE-FACTOR) * original - LCE-FACTOR * GaussianSmooth(original, LCE-SCALE) \,</math>
+:<math>enhanced = (1 + LCE-FACTOR) * original - LCE-FACTOR * GaussianSmooth(original, LCE-SCALE)</math>
-<var>LCESCALE</var> defaults to 0 pixels and <var>LCE-FACTOR</var> defaults to 0. Append ‘%’ to <var>LCE-SCALE</var> to specify the radius as a percentage of <var>EDGE-SCALE</var>. Append ‘%’ to <var>LCE- FACTOR</var> to specify the weight as a percentage.
+:<var>LCE-SCALE</var> defaults to 0 pixels and <var>LCE-FACTOR</var> defaults to 0. Append ‘%’ to <var>LCE-SCALE</var> to specify the radius as a percentage of <var>EDGE-SCALE</var>. Append ‘%’ to <var>LCE-FACTOR</var> to specify the weight as a percentage.
 ; <code>--EntropyCutoff=<var>LOWER-CUTOFF</var></code>
 ; <code>--EntropyCutoff=<var>LOWER-CUTOFF</var>:<var>UPPER-CUTOFF</var></code>
@@ Line 198: / Line 198: @@
 ; <code>--GrayProjector=<var>PROJECTOR</var></code>
 : {{anchor|IDX57}} {{anchor|IDX58}} Use gray projector <var>PROJECTOR</var> for conversion of RGB images to grayscale masks:    R G B  &rightarrow; Y .   In version 3.2-cvs of Enfuse, the option is effective for exposure weighting contrast weighting. Default: ‘average’.Valid values for <var>PROJECTOR</var> are:
-;; <code>average</code>
+; <code>average</code>
-:: {{anchor|IDX59}} Average red, green, and blue channel with equal weights. This is the default and it often is a good projector for gamma = 1 data.<br><math>Y = {(R + G + B)}/{3} \,</math>
+: {{anchor|IDX59}} Average red, green, and blue channel with equal weights. This is the default and it often is a good projector for gamma = 1 data.
-;; <code>l-star</code>
+: <math>Y = {(R + G + B)}/{3}</math>
-:: {{anchor|IDX60}} {{anchor|IDX61}} {{anchor|IDX62}} Use the L-channel of the L*a*b*-conversion of the image as its grayscale representation. This is a useful projector for gamma = 1 data. It reveals minute contrast variations even in the shadows and the highlights. This projector is computationally expensive.See [http://en.wikipedia.org/wiki/Lab_color_space Wikipedia] for a detailed description of the Lab color space.
+; <code>l-star</code>
-;; <code>lightness</code>
+: {{anchor|IDX60}} {{anchor|IDX61}} {{anchor|IDX62}} Use the L-channel of the L*a*b*-conversion of the image as its grayscale representation. This is a useful projector for gamma = 1 data. It reveals minute contrast variations even in the shadows and the highlights. This projector is computationally expensive.See [http://en.wikipedia.org/wiki/Lab_color_space Wikipedia] for a detailed description of the Lab color space.
-:: {{anchor|IDX63}} Compute the lightness of each RGB pixel as in an Hue-Saturation-Lightness (HSL) conversion of the image.<br><math>Y = {(max(R, G, B) + min(R, G, B))}/{2} \,</math>
+; <code>lightness</code>
-;; <code>value</code>
+: {{anchor|IDX63}} Compute the lightness of each RGB pixel as in an Hue-Saturation-Lightness (HSL) conversion of the image.
-:: {{anchor|IDX64}} Take the Value-channel of the Hue-Saturation-Value (HSV) conversion of the image.<br><math>Y = max(R, G, B) \,</math>
+: <math>Y = {(max(R, G, B) + min(R, G, B))}/{2}</math>
-;; <code>luminance</code>
+; <code>value</code>
-:: {{anchor|IDX65}} Use the weighted average of the RGB pixel's channels as defined by CIE (“Commission Internationale de l'Éclairage”) and the JPEG standard.<br><math>Y = 0.30 * R + 0.59 * G + 0.11 * B \,</math>
+: {{anchor|IDX64}} Take the Value-channel of the Hue-Saturation-Value (HSV) conversion of the image.
-;; <code>channel-mixer:<var>RED-WEIGHT</var>:<var>GREEN-WEIGHT</var>:<var>BLUE-WEIGHT</var></code>
+: <math>Y = max(R, G, B)</math>
-:: {{anchor|IDX66}} Weight the channels as given.<br><math>Y = RED_{WEIGHT} * R + GREEN_{WEIGHT} * G + BLUE_{WEIGHT} * B \,</math>
+; <code>luminance</code>
-The weights are automatically normalized to one, so{|
+: {{anchor|IDX65}} Use the weighted average of the RGB pixel's channels as defined by CIE (“Commission Internationale de l'Éclairage”) and the JPEG standard.
-|
+: <math>Y = 0.30 * R + 0.59 * G + 0.11 * B</math>
-     --GrayProjector=channel-
+; <code>channel-mixer:<var>RED-WEIGHT</var>:<var>GREEN-WEIGHT</var>:<var>BLUE-WEIGHT</var></code>
+: {{anchor|IDX66}} Weight the channels as given.
- mixer:0.25:0.5:0.25
+: <math>Y = RED-WEIGHT * R + GREEN-WEIGHT * G + BLUE-WEIGHT * B</math>
-     --GrayProjector=channel-mixer:1:2:1
+: The weights are automatically normalized to one, so
-     --GrayProjector=channel-mixer:25:50:25
+<pre>
-|}all define the same mixer configuration.The three weights <var>RED-WEIGHT</var>, <var>GREEN-WEIGHT</var>, and <var>BLUE-WEIGHT</var> define the relative weight of the respective color channel. The sum of all weights is normalized to one.
+ --GrayProjector=channel-mixer:0.25:0.5:0.25
+ --GrayProjector=channel-mixer:1:2:1
+ --GrayProjector=channel-mixer:25:50:25
+</pre>
+:all define the same mixer configuration.The three weights <var>RED-WEIGHT</var>, <var>GREEN-WEIGHT</var>, and <var>BLUE-WEIGHT</var> define the relative weight of the respective color channel. The sum of all weights is normalized to one.
 ; <code>--HardMask</code>
 : {{anchor|IDX67}} Force hard blend masks on the finest scale. This is the opposite flag of ‘--SoftMask’.This blending mode avoids averaging of fine details (only) at the expense of increasing the noise. However it considerably improvs the sharpness of focus stacks. Blending with hard masks has solely proven useful with focus stacks.See also [[#SEC5|Option –wContrast]] and ‘--ContrastWindowSize’ above.

Weighted Average	Enfuse's default weighting algorithm
Disabling Averaging: Option ‘--HardMask’	“Super Trouper” weighting for focus stacks

Common Options	General options
Extended Options	Memory control and others
Fusion Options	Image fusion control
Expert Options	Contrast and entropy selection configuration
Option Delimiters	How to separate options' arguments

Weighting Pixels	General concept of weighting pixels
Exposure Weighting	Weighting by exposure
Saturation Weighting	Weighting by saturation
Contrast Weighting	Weighting by local contrast
Entropy Weighting	Weighting by local entropy

Standard Deviation	Standard deviation (SDev)
Laplacian of Gaussian	LoG, a second derivative method
Blend Standard Deviation and Laplacian of Gaussian	Mix and match SDev and LoG
Scaling and Choice of Mode	How parameters do not scale; neither does mode

Tips For Beginners	Some hints for beginners
Common Misconceptions	What works despite the hype

Why create focus stacks?	Why take the hassle?
Preparing Focus Stacks	How to get suitable input images
Contrast Based Fusing	Fundamental command line options
Basic Focus Stacking	Simple, standard deviation method
Advanced Focus Stacking	Advanced, Laplacian technique
Tips For Focus Stacking Experts	Tips for focus stacking experts

A Detailed Look at the Problem	What is the problem Kenneth?
Laplacian Edge Detection	Using a Laplacian-of- Gaussian to detect edges
Local Contrast Enhancement	Boosting local contrast before weighting
Suppressing Noise or Recognizing Faint Edges	The best of both worlds

Difference between revisions of "Enfuse reference manual"

Revision as of 23:03, 2 March 2009

Overview

Invocation

Common Options

Extended Options

Fusion Options

Expert Options

Option Delimiters

Weighting Functions

Weighting Pixels

Weighted Average

Disabling Averaging: Option ‘-- HardMask’

Exposure Weighting

Saturation Weighting

Contrast Weighting

Standard Deviation

Statistical Moments

Estimators

Laplacian of Gaussian

Blend Standard Deviation and Laplacian of Gaussian

Scaling and Choice of Mode

Entropy Weighting

Understanding Masks

Masks in Input Files

Weight Mask Files

Tuning Memory Usage

Applications of Enfuse

What Makes Images Fusable?

Repetition – Noise Reduction

Exposure Series – Dynamic Range Increase

Tips For Beginners

Common Misconceptions

Flash Exposure Series – Directed Lighting

Polarization Series – Saturation Enhancement

Focus Stacks – Depth-of-Field Increase

Why create focus stacks?

Preparing Focus Stacks

Contrast Based Fusing

Basic Focus Stacking

Advanced Focus Stacking

A Detailed Look at the Problem

Laplacian Edge Detection

Local Contrast Enhancement

Suppressing Noise or Recognizing Faint Edges

Tips For Focus Stacking Experts

Helpful Additional Programs

A. Authors

List of Tables

List of Figures

Program Index

Option Index

General Index

Footnotes

Table of Contents

Navigation menu

Search