Difference between revisions of "Historical:SoC 2008 Masking in GUI"

Revision as of 19:01, 9 May 2008

Introduction

The objective of this project is to provide the user with an easy to use interface for quickly creating blending masks. After the images are aligned and shown in the preview window, users will have the option of creating blending masks. Currently the goal is to provide option for mask creation in the preview window. Since it already shows the aligned images, it would be easier for users to create appropriate masks when all the images aligned.

Project Outline

Implementation will be done in two phases. In the first phase, the basic framework will be created. Users will be able to mark regions as either foreground or background and specify the contribution of each segment. Based on the marked region a polygonal outline of the foreground object will be created. Since the outline may not always be accurate (eg. in the case of low contrast edge), a polygon editing option will be provided (the idea is from [2]). When the user chooses to create panorama the masks are generated as output and provided to enblend.The editing features that are going to be available in this phase are -

Option for zooming in/out
Set brush stroke size
Polygon editing mode for fine-tuning boundary regions

The second phase will focus on 3D segmentation. For instance if an object is moving in front of the camera and we want to exclude that object from the final scene then the user has to mark that object in every image. The second phase will make this simpler by extending the segmentation into 3D.

Timeline

1. Before Start of Coding Phase:

Determine input data type, format and how the user will interact
Construct a preliminary design of the software
Outline of how the algorithm will work
Finalize the scope of the project
Start porting the existing implementation of image segmentation to use wxWidget and VIGRA

2. Coding Phase:

2.1 Before Mid Term Evaluation

Stand-alone application for testing the basic framework

Implement a basic framework that can –

Take an image stack of a particular format.
Allow users to mark regions
Incorporate algorithm to learn the color model from the user defined area
Start implementing 2D multi-label image segmentation (this may not be necessary)

2.2 After Mid Term Evaluation

Fix issues with first phase(bugs, usability, etc.)
Perform image segmentation on the stack of images (3D segmentation problem where the user will only need to roughly mark the region on a small subset of the images). At the end of this stage the segmentation algorithm should be able to correctly identify similar region in successive images.
Implement custom max flow/min cut algorithm

Deliverable

The final deliverable will be –

An extensible framework that works with a subset of the problem.
An extensible interaction system that supports the primary forms of interaction.
A Graph-cut based technique for multi-label image segmentation that works on a stack of image.

Glossary of Terms

Graph Cut: An optimization technique.

Image Segmentation:

Multi-label Image Segmentation:

References

[1] Yury Boykov, Marie-Pierre Jolly, "Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images," Proc. Eighth IEEE ICCV, vol.1, no., pp.105-112 vol.1, 2001. webpage

[2] Yin Li, Jian Sun, Chi-Keung Tang and Heung-Yeung Shum, "Lazy Snapping," ACM Transaction on Graphics(Proceedings of SIGGRAPH), Vol 23, No. 3, April 2004.paper video

[3] Aseem Agarwala, Mira Dontcheva, Maneesh Agrawala, Steven Drucker, Alex Colburn, Brian Curless, David Salesin, Michael Cohen, "Interactive Digital Photomontage," ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2004. webpage

@@ Line 36: / Line 36: @@
 ==Deliverable==
 The final deliverable will be –
-An extensible framework that works with a subset of the problem.
-An extensible interaction system that supports the primary forms of interaction.
+* An extensible framework that works with a subset of the problem.
-A Graph-cut based technique for multi-label image segmentation that works on a stack of image.
+* An extensible interaction system that supports the primary forms of interaction.
+* A Graph-cut based technique for multi-label image segmentation that works on a stack of image.
 ==Glossary of Terms==