RBE306TC Computer Vision Systems

RBE306TC Computer Vision Systems
Assignment
Before you dive into this Exercise 1 to Exercise 3, please check the following OpenCV functions in
Python Coding Platform for example: imread, shape, imshow, imwrite, imnoise, resize, calcHist, equalizeHist, etc.
Some other Python built-in functions, or functions in Scipy package may also be used. Please refer to online resources.
Hint : read the descriptions about each of the previous functions and any other function you might use. You may find descriptive sections of Algorithms(s) in some of the Python functions.
Exercise 1 (20%)
In this task, we use the monochrome image Lenna (i.e., lenna512.bmp) with the following tasks.
Let's regard this reference image Lenna as IM .
• (a). Add Gaussian white noise with 0 mean and variance 10 to the image IM and display the noisy image. We name it as IM_WN . Please write one function to generate this image instead of calling Matlab function directly (4%).
• (b). Add salt & pepper noise with noise density 10% to the image IM and display the noisy image.
We name it as IM_SP. Please write one function to generate this image instead of calling
Matlab function directly (4%).
• (c). Display the histograms of all the previous images and compare them with the histogram of the reference image, comments and briefly explain your finding (4%).
• (d). Use the command histeq to enhance the image constrast
( lenna512_low_dynamic_range.bmp ) and display the enhanced image (4%).
• (e). Moreover, display the histograms of both original image and enhanced image, and explain your finding in the assignment (4%).
Exercise 2 (25%)
Recall salt & pepper images generated in Task 1 IM_SP based on the IM .
• (a). Apply the median filter with a 3 × 3 window and a 5 × 5 window on the image IM_SPrespectively. Display and evaluate the PSNR of the obtained images. For each window size, comment on how effectively the noise is reduced while sharp edges and features in the image are preserved (8%).
• (b). Use the average filter (mean filter) 3 × 3 to filter the image IM_SP . Compute the PSNR and display the filtered image (8%).
• (c). As you experimented with the mean and median algorithms what different property did you notice? Was the average or median filter better and why (9%)?
Exercise 3 (55%)
In this exercise, you will be asked to build a VGG-16 and VGG-19 (see the following architecture) to train a classifier on cifar10 dataset. based on the python + PyTorch codes implemented in Lab 4 for LeNet.
The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images
per class. There are 50000 training images and 10000 test images. The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another.
Between them, the training batches contain exactly 5000 images from each class.

相关推荐
CodeLinghu4 分钟前
提示词链模式:一种利用LLM大语言模型处理复杂任务的强大范式
前端·人工智能·语言模型
Wilber的技术分享7 分钟前
【大模型实战笔记 8】深入理解 LangGraph:构建可持久化、多智能体的 LLM 工作流
人工智能·笔记·agent·langgraph·智能体开发
小二·11 分钟前
AI工程化实战《二》:RAG 高级优化全解——从 HyDE 到 Self-RAG,打造高精度企业问答系统
人工智能·microsoft·机器学习
yuhaiqun198913 分钟前
学AI Agent:从React模式到Plan框架,3条路径一次学透
人工智能·经验分享·笔记·react.js·机器学习·ai·aigc
zhonghua88101622 分钟前
spring ai alibab agent之ReactAgent深度解读
java·人工智能·spring
大模型教程.25 分钟前
收藏级教程:ReAct模式详解,让大模型从回答问题到解决问题
前端·人工智能·机器学习·前端框架·大模型·产品经理·react
飞凌嵌入式26 分钟前
AIoT出海背景下,嵌入式主控的国际认证之路与价值思考
大数据·人工智能·嵌入式硬件·区块链·嵌入式
Robot侠33 分钟前
多模态大语言模型(Multimodal LLM)技术实践指南
人工智能·语言模型·自然语言处理·transformer·rag·多模态大模型
roman_日积跬步-终至千里1 小时前
【计算机视觉概述】:从像素到理解的完整图景
人工智能·计算机视觉
Light601 小时前
【MCP原生时代】第7篇|治理与合规:在模型驱动自动化中把控法律、隐私与伦理风险——把“能做什么”变成可审计、可解释、可追责的企业能力
人工智能·隐私·审计·治理·合规·mcp·伦理