Computer Vision COMP90086

Introduction
Finding correspondences between keypoints is a critical step in many computer vision applications. It can be used to align images when constructing a panorama from lots of separate photogtraps, and it is
used to find point correspondences between keypoints detetected in multiple views of a scene.
iuww520iuww520iuww520iuww520iuww520iuww520iuww520iuww520
This assignment uses a dataset generated from many views of the Trevi fountain in Rome. Finding correspondences between detected keypoints is a critical step in the pipeline for reconstructing a 3D representation of the fountain from individual photographs.
The dataset in this assignment is generated as a set of pairs of image patches taken centred at detected keypoints. The image patches are 64x64 pixels each and each training sample is made of two patches placed side by side to make a 128x64 image. For half the training set (10,000 examples in the '1good' subdirectory) the two patches are from two separate views of the same keypoint. For the other half (10,000 examples in the '0bad' subdirectory) the two patches are from two different keypoints. Figure
1 shows an example of each of these. The validation directory is similarly structured but contains four times as many non-matching pairs (2000 examples in '0bad') as matching pairs (500 examples in '1good').
Figure 1: Corresponding (left) and non-corresponding (right) pairs of image patches Your task is to create and train some neural networks that can tackle the problem of determining whether the two patches correspond or not.
1. Baseline Neural Network [2 pt]
Run the baseline neural network implementation in the provided python notebook and in your report,
you should include the loss and accuracy curves for the training and validation sets in your report and
discuss what these imply about the baseline model.
The validation set contains more bad examples than good. Why might this be a sensible way of
testing for the task of finding feature correspondences? Should the training environment also reflect
this imbalance?
2. Regularizing your Neural Network [2pt]
To regularize the network, your should try adding a regularization layer (see the Keras documenation for these layers). Try adding a Dropout() layer after Flatten() and try different rate values to see what the effect of this parameter is. Include the loss and accuracy plots in your report for three different
choices of the rate parameter. Describe the changes you see in these loss and accuracy plots in your report and suggest what the best choice of rate value is from the three you have reported.
3. Convolutional Neural Network [3pt]
Design a Convolutional Neural Network to solve this challenge. If you use Conv2D() layers imme diately after the LayerNormalization layer these convolutions will apply identically to both image patches in each input sample. Try using one or two Conv2D() layers with relu activations. You should explore the value of having different numbers of filters, kernel sizes, and strides before the Flatten() layer.
Briefly describe the set of settings you tried in your report in a table (this should be around 10 settings).
For each setting, report the final training loss and accuracy as well as the validation loss and accuracy.
Include a discussion of the results of these experiments in your report. Identify your best performing
design and discuss why you think this may have been best.

相关推荐
技术与健康3 分钟前
LLM实践系列:利用LLM重构数据科学流程07 - 工程化实践与挑战
人工智能·机器学习·重构·大模型工程化实践
MobotStone13 分钟前
AI Agent工程师≠Prompt工程师:能力断层在哪
人工智能
深瞳智检16 分钟前
目标检测数据集 第007期-基于yolo标注格式的茶叶病害检测数据集(含免费分享)
人工智能·深度学习·yolo·目标检测·计算机视觉
区块链蓝海17 分钟前
UPCX与日本电信公司NTT就新一代去中心化支付系统签署合作协议
人工智能·web3·区块链
berling0036 分钟前
【论文阅读 | arXiv 2025 | WaveMamba:面向RGB-红外目标检测的小波驱动Mamba融合方法】
论文阅读·人工智能·目标检测
CHEN5_0240 分钟前
时序数据库选型“下半场”:从性能竞赛到生态博弈,四大主流架构深度横评
数据库·人工智能·ai·架构·时序数据库
top_designer1 小时前
作品集PDF又大又卡?我用InDesign+Acrobat AI构建轻量化交互式文档工作流
人工智能·pdf·自动化·设计规范·acrobat·indesign·交互式pdf
涡能增压发动积1 小时前
MySQL数据库为何逐渐黯淡,PostgreSQL为何能新王登基
人工智能·后端
瓦力wow1 小时前
Pytorch安装详细步骤
人工智能·pytorch·python
Java中文社群1 小时前
重磅!Ollama发布UI界面,告别命令窗口!
java·人工智能·后端