【CS4495】Computer Vision

Course Motivation:

Computer vision is a field that involves the development of computer programs to automatically analyze and understand the content of images acquired from a sensor. The image data can take many forms, such as video sequences, views from multiple cameras, depth measurements from the Xbox Kinect, or multi-dimensional data from a medical scanner. The objective is to produce some form of numerical or symbolic representation of the contents of the scene. At times, the field has been concerned with duplicating the human visual system's procedure for visual perception. Often models used involve the development of mathematical tools, borrowed from the fields of geometry, probability and statistics, physics, machine learning, and others. As a scientific discipline, computer vision is concerned with the theory behind artificial systems that extract information from images. As a technological discipline, computer vision seeks to apply its theories and models to the construction of computer vision systems. This course will cover essential topics in the field of computer vision, providing students with background on some theoretical components of the field, as well as hands-on experience through practical and fun assignments. The course would complement existing courses in computer graphics, embedded systems, artificial intelligence and signal processing. It is important to note that computer vision is currently in a phase of high growth worldwide. Hardware companies (e.g. Intel, Samsung, Qualcomm), software companies (e.g. Google, Facebook), as well as car and entertainment companies (e.g. Disney, Microsoft: Xbox) are all currently heavily investing in these domains, and aggressively recruiting in the area. Many companies with products in the telecommunication area such as Apple, Blackberry, Google, Bell, Apple, and Nokia all have significant activities in developing vision and image manipulation apps for use in cellphones. Furthermore, a large number of startups in this area have recently enjoyed tremendous success due to the maturity and availability of the algorithms in the field as well as the ubiquitous nature of cameras today. The material covered in this course is aimed at senior undergraduates, both among those seeking employment the above application domains, and students considering the field as an area for graduate research.

Learning Outcomes:

During this course, the student will acquire a broad understanding of a variety of problems addressed by researchers in the field of computer vision. These include (but are not limited to): image formation, filtering and image enhancement, image matching, image features, feature alignment and stitching, grouping and matching, stereo, motion and optical flow estimation, segmentation, scene understanding, face detection and recognition, classifiers, deep learning and medical image analysis. Students will be given an overview of designing and programming in Python and OpenCV in the context of solving practical problems in the field of computer vision. By the end of the course, the students should be able to apply, in a design context, their acquired programming skills to address a wide variety of problems in computer vision.

相关推荐
阿豪只会阿巴1 小时前
【没事学点啥】TurboBlog轻量级个人博客项目——项目介绍
javascript·python·django·html
qq_413502022 小时前
如何创建CDB公共用户_C##前缀强制规则与CONTAINER=ALL
jvm·数据库·python
yexuhgu3 小时前
CSS如何利用-checked实现纯CSS手风琴折叠_通过状态选择器控制区域高度
jvm·数据库·python
sali-tec3 小时前
C# 基于OpenCv的视觉工作流-章66-直线夹角
图像处理·人工智能·opencv·算法·计算机视觉
AC赳赳老秦3 小时前
接口测试自动化:用 OpenClaw 对接 Postman,实现批量回归测试、测试报告自动生成与推送
java·人工智能·python·算法·elasticsearch·deepseek·openclaw
PILIPALAPENG3 小时前
第4周 Day 1:智能体记忆系统——给 Agent 一个"大脑"
前端·人工智能·python
DavidTaozhe3 小时前
一文搞懂外汇接口怎么实时更新美元汇率
大数据·python
用户78937733908533 小时前
Docker 部署踩坑记录:从“构建失败”到“服务跑通”,以及为什么数据被清空了
python·docker
再玩一会儿看代码3 小时前
如何理解神经网络中的权重参数?从一张图看懂模型参数量计算
人工智能·经验分享·python·深度学习·神经网络·机器学习
2301_779622413 小时前
mysql如何通过主从备份实现读写分离_配置mysql架构模式
jvm·数据库·python