RBE306TC Computer Vision Systems

RBE306TC Computer Vision Systems

Assignment

Before you dive into this Exercise 1 to Exercise 3, please check the following OpenCV functions in

Python Coding Platform for example: imread, shape, imshow, imwrite, imnoise, resize, calcHist, equalizeHist, etc.

Some other Python built-in functions, or functions in Scipy package may also be used. Please refer to online resources.

Hint: read the descriptions about each of the previous functions and any other function you might use. You may find descriptive sections of Algorithms(s) in some of the Python functions.

Exercise 1 (20%)

In this task, we use the monochrome image Lenna (i.e., lenna512.bmp) with the following tasks.

Let's regard this reference image Lenna as IM.

• (a). Add Gaussian white noise with 0 mean and variance 10 to the image IM and display the noisy image. We name it as IM_WN. Please write one function to generate this image instead of calling Matlab function directly (4%).

• (b). Add salt & pepper noise with noise density 10% to the image IM and display the noisy image.

We name it as IM_SP. Please write one function to generate this image instead of calling

Matlab function directly (4%).

• (c). Display the histograms of all the previous images and compare them with the histogram of the reference image, comments and briefly explain your finding (4%).

• (d). Use the command histeq to enhance the image constrast

(lenna512_low_dynamic_range.bmp) and display the enhanced image (4%).

• (e). Moreover, display the histograms of both original image and enhanced image, and explain your finding in the assignment (4%).

Exercise 2 (25%)

Recall salt & pepper images generated in Task 1 IM_SP based on the IM.

• (a). Apply the median filter with a 3 × 3 window and a 5 × 5 window on the image IM_SPrespectively. Display and evaluate the PSNR of the obtained images. For each window size, comment on how effectively the noise is reduced while sharp edges and features in the image are preserved (8%).

• (b). Use the average filter (mean filter) 3 × 3 to filter the image IM_SP. Compute the PSNR and display the filtered image (8%).

• (c). As you experimented with the mean and median algorithms what different property did you notice? Was the average or median filter better and why (9%)?

Exercise 3 (55%)

In this exercise, you will be asked to build a VGG-16 and VGG-19 (see the following architecture) to train a classifier on cifar10 dataset. based on the python + PyTorch codes implemented in Lab 4 for LeNet.

The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images

per class. There are 50000 training images and 10000 test images. The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another.

Between them, the training batches contain exactly 5000 images from each class.

相关推荐
老王以为几秒前
Claude Code 从 GUI 到 TUI:开发者界面的范式回归
前端·人工智能·全栈
薛定e的猫咪5 分钟前
【ICML 2025】MODULI:基于扩散模型解锁离线多目标强化学习的偏好泛化
人工智能·学习·算法·机器学习
舒旻7 分钟前
教育行业AI解决方案
人工智能·产品经理
初心未改HD19 分钟前
机器学习之朴素贝叶斯分类器详解
人工智能·机器学习·概率论
GIS数据转换器19 分钟前
蓄能电力大数据监管平台
大数据·人工智能·分布式·数据挖掘·数据分析·智慧城市
qq_4112624224 分钟前
四博 AI 双目智能音箱方案:四路触控、震动反馈、姿态感应、语音克隆和专属知识库一次拉满
人工智能·智能音箱
Luhui_Dev24 分钟前
Anthropic 2026 最新 Agent Harness 架构完整拆解:Managed Agents
人工智能·agent·claude
云原生指北27 分钟前
开源的透明度曾是护城河,AI 正在让它变成负担
人工智能·安全
guslegend27 分钟前
第8节:工程初始化-后端骨架与公共基础设施
大数据·人工智能
生活观察站27 分钟前
高频计算 AI 服务器性价比优选,数聚红芯 HG8480X 适配高频场景全解
人工智能