Project 6 report

Each image has associated coordinates with it, camera position. I took image (8, 8) from 17x17 grid as the center and shifted all other images to that center image to get a focusing effect. shift_yx = depth_factor * (image_yx - center_image_yx). Here are the results that are gotten after varying depth parameter:

Depth -0.5

Depth 0

Depth 0.5

Aperture adjustment

By selecting a radius within which we choose to average images we can simulate aperture. If sqrt(shift_yx^2) <= a then we add average the image.

a = 1

a = 20

a = 40

Summary

This project showed me how you can get great results by collecting a lot of data and operating on it..

Project 2: Pyramid-Based Texture Analysis.

Project tasks:

Introduction

This is a simple approach to construct textures, since all textures have some structure to them we try to catch that structure with steerable pyramid - a pyramid similar to laplaccian pyramid, but using several bandpass filters.

Creating steerable pyramids

Filter 1

Filter 2

Filter 3

Filter 4

Original

Convolved with filter 1

Convolved with filter 2

Convolved with filter 3

Convolved with filter 4

Original(1000x1000)

Highpass residual(1000x1000)

Level 1 orientation 1(1000x1000)

Level 1 orientation 2(1000x1000)

Level 1 orientation 3(1000x1000)

Level 1 orientation 4(1000x1000)

Level 2 orientation 1(500x500)

Level 2 orientation 2(500x500)

Level 2 orientation 3(500x500)

Level 2 orientation 4(500x500)

Level 3 orientation 1(250x250)

Level 3 orientation 2(250x250)

Level 3 orientation 3(250x250)

Level 3 orientation 4(250x250)

Level 4 orientation 1(125x125)

Level 4 orientation 2(125x125)

Level 4 orientation 3(125x125)

Level 4 orientation 4(125x125)

Level 5 orientation 1(62x62)

Level 5 orientation 2(62x62)

Level 5 orientation 3(62x62)

Level 5 orientation 4(62x62)

Level 6 orientation 1(31x31)

Level 6 orientation 2(31x31)

Level 6 orientation 3(31x31)

Level 6 orientation 4(31x31)

Low pass residual

The pyramid looks correct, however I encountered issues when collapsing it to reconstruct the image. I realised the problem was in the borders, I used the regular "same" borders of scipy after upscaling(upscaling had to be zero padded, no interpolation). So I wrote my own padding to reconstruct image with minimal errors.

Original

Reconstructed

Difference of reconstructed and original

Histogram matching

To match histograms I decided not to use CDF matching from first paper, and used the simpler sort matching from the second. I sort the pixels by value and set the k'th lowest pixels of image 1 to k'th lowest pixel of image 2, im1[idx_of_sorted_im1_pixels] = im2[idx_of_sorted_im2_pixels]. It behaves as expected and gives good results:

Reference image

Reference histogram

Noise

Noise histogram before matching

Noise after matching

Noise histogram after matching (exactly the same because of how I match the histograms)

Generating textures

Now we can generate noise and histogram match every level of steerable pyramid, upon reconstruction we should get something similar to reference image. We iterate over histogram matching of pyramid to get better results, i.e we generate noise and WHILE NOT HAPPY: create pyramid, match pyramid to reference. Generated textures with 7 iterations, 4 filters, maximum acceptable depth of pyramid. To get colors working I had to convert to PCA color space, that decorrelated the colors.

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Synthesized with 40 filters

Reference image

Synthesized

Synthesized with 360 filters

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

Reference image

Synthesized

From all the results we can see that it works relatively well, especially if you squeeze your eyes or look from far away, this means that even though we create steerable pyraimd we cant catch all the higher frequency features, low frequency features are captured, but higher frequency structure is not captured. Even when we did 360 filters it only caught low frequency features.

BELLS AND WHISTLES:Tuning hyperparameters

We could change number of filters(orientations) in our steerable pyramid, depth of the pyramid, and number of orientations.

Tuning number of iterations, 0 iterations means we dont deconstruct out noise into pyramid

Reference image

0 iter

1 iter

2 iters

3 iters

4 iters

5 iters

6 iters

7 iters

8 iters

9 iters

10 iters

11 iters

12 iters

13 iters

14 iters

15 iters

16 iters

17 iters

18 iters

19 iters

As we can see we need several iterations to capture the underlying structure, however number of iterations has diminishing returns. I chose 7 for my images since that seems to be safe value

Reference image

Height 1

Height 2

Height 3

Height 4

Height 5

Here everything is simple: deeper the pyramid, better the results

Reference image

1 Filter

2 Filters

3 Filters

4 Filters

5 Filters

6 Filters

7 Filters

8 Filters

9 Filters

360 Filters

Here its harder to interpret the results, first 4 filters used are vertical, 2 diagonal ones, horizontal, so results make sense up to 4. However next filters are added on random, and seem to degrade image quality by picking up on some structure not seen by eye. Maybe its because I uniformly add random steered filters, but each added filter should be balanced with symmetrical one. But as we can see the difference between 360 and 4 filters is not high, indicating that 4 filters catch most of the structure in this image(since image is mostly diagonal)

CS180 Final Projects

Artem Shumay

Project 1: Lightfield camera.

Project tasks:

Depth refocusing

Aperture adjustment

Summary

Project 2: Pyramid-Based Texture Analysis.

Project tasks:

Introduction

Creating steerable pyramids

Histogram matching

Generating textures

BELLS AND WHISTLES:Tuning hyperparameters