RBE306TC Computer Vision Systems
Assignment
Before you dive into this Exercise 1 to Exercise 3, please check the following OpenCV functions in
Python Coding Platform for example: imread, shape, imshow, imwrite, imnoise, resize, calcHist, equalizeHist, etc.
Some other Python built-in functions, or functions in Scipy package may also be used. Please refer to online resources.
Hint : read the descriptions about each of the previous functions and any other function you might use. You may find descriptive sections of Algorithms(s) in some of the Python functions.
Exercise 1 (20%)
In this task, we use the monochrome image Lenna (i.e., lenna512.bmp) with the following tasks.
Let's regard this reference image Lenna as IM .
• (a). Add Gaussian white noise with 0 mean and variance 10 to the image IM and display the noisy image. We name it as IM_WN . Please write one function to generate this image instead of calling Matlab function directly (4%).
• (b). Add salt & pepper noise with noise density 10% to the image IM and display the noisy image.
We name it as IM_SP. Please write one function to generate this image instead of calling
Matlab function directly (4%).
• (c). Display the histograms of all the previous images and compare them with the histogram of the reference image, comments and briefly explain your finding (4%).
• (d). Use the command histeq to enhance the image constrast
( lenna512_low_dynamic_range.bmp ) and display the enhanced image (4%).
• (e). Moreover, display the histograms of both original image and enhanced image, and explain your finding in the assignment (4%).
Exercise 2 (25%)
Recall salt & pepper images generated in Task 1 IM_SP based on the IM .
• (a). Apply the median filter with a 3 × 3 window and a 5 × 5 window on the image IM_SPrespectively. Display and evaluate the PSNR of the obtained images. For each window size, comment on how effectively the noise is reduced while sharp edges and features in the image are preserved (8%).
• (b). Use the average filter (mean filter) 3 × 3 to filter the image IM_SP . Compute the PSNR and display the filtered image (8%).
• (c). As you experimented with the mean and median algorithms what different property did you notice? Was the average or median filter better and why (9%)?
Exercise 3 (55%)
In this exercise, you will be asked to build a VGG-16 and VGG-19 (see the following architecture) to train a classifier on cifar10 dataset. based on the python + PyTorch codes implemented in Lab 4 for LeNet.
The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images
per class. There are 50000 training images and 10000 test images. The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another.
Between them, the training batches contain exactly 5000 images from each class.
RBE306TC Computer Vision Systems
_0206girl2024-01-01 3:07
相关推荐
qzhqbb14 分钟前
基于统计方法的语言模型冷眼看人间恩怨39 分钟前
【话题讨论】AI大模型重塑软件开发:定义、应用、优势与挑战2401_8830410840 分钟前
新锐品牌电商代运营公司都有哪些?AI极客菌2 小时前
Controlnet作者新作IC-light V2:基于FLUX训练,支持处理风格化图像,细节远高于SD1.5。阿_旭2 小时前
一文读懂| 自注意力与交叉注意力机制在计算机视觉中作用与基本原理王哈哈^_^2 小时前
【数据集】【YOLO】【目标检测】交通事故识别数据集 8939 张,YOLO道路事故目标检测实战训练教程!Power20246663 小时前
NLP论文速读|LongReward:基于AI反馈来提升长上下文大语言模型数据猎手小k3 小时前
AIDOVECL数据集:包含超过15000张AI生成的车辆图像数据集,目的解决旨在解决眼水平分类和定位问题。好奇龙猫3 小时前
【学习AI-相关路程-mnist手写数字分类-win-硬件:windows-自我学习AI-实验步骤-全连接神经网络(BPnetwork)-操作流程(3) 】沉下心来学鲁班3 小时前
复现LLM:带你从零认识语言模型