【CS4495】Computer Vision

Course Motivation:

Computer vision is a field that involves the development of computer programs to automatically analyze and understand the content of images acquired from a sensor. The image data can take many forms, such as video sequences, views from multiple cameras, depth measurements from the Xbox Kinect, or multi-dimensional data from a medical scanner. The objective is to produce some form of numerical or symbolic representation of the contents of the scene. At times, the field has been concerned with duplicating the human visual system's procedure for visual perception. Often models used involve the development of mathematical tools, borrowed from the fields of geometry, probability and statistics, physics, machine learning, and others. As a scientific discipline, computer vision is concerned with the theory behind artificial systems that extract information from images. As a technological discipline, computer vision seeks to apply its theories and models to the construction of computer vision systems. This course will cover essential topics in the field of computer vision, providing students with background on some theoretical components of the field, as well as hands-on experience through practical and fun assignments. The course would complement existing courses in computer graphics, embedded systems, artificial intelligence and signal processing. It is important to note that computer vision is currently in a phase of high growth worldwide. Hardware companies (e.g. Intel, Samsung, Qualcomm), software companies (e.g. Google, Facebook), as well as car and entertainment companies (e.g. Disney, Microsoft: Xbox) are all currently heavily investing in these domains, and aggressively recruiting in the area. Many companies with products in the telecommunication area such as Apple, Blackberry, Google, Bell, Apple, and Nokia all have significant activities in developing vision and image manipulation apps for use in cellphones. Furthermore, a large number of startups in this area have recently enjoyed tremendous success due to the maturity and availability of the algorithms in the field as well as the ubiquitous nature of cameras today. The material covered in this course is aimed at senior undergraduates, both among those seeking employment the above application domains, and students considering the field as an area for graduate research.

Learning Outcomes:

During this course, the student will acquire a broad understanding of a variety of problems addressed by researchers in the field of computer vision. These include (but are not limited to): image formation, filtering and image enhancement, image matching, image features, feature alignment and stitching, grouping and matching, stereo, motion and optical flow estimation, segmentation, scene understanding, face detection and recognition, classifiers, deep learning and medical image analysis. Students will be given an overview of designing and programming in Python and OpenCV in the context of solving practical problems in the field of computer vision. By the end of the course, the students should be able to apply, in a design context, their acquired programming skills to address a wide variety of problems in computer vision.

相关推荐
xcLeigh15 分钟前
计算机视觉卷积神经网络(CNN)基础:从LeNet到ResNet
人工智能·计算机视觉·ai·cnn·卷积神经网络
测试杂货铺21 分钟前
软件测试之功能测试详解
自动化测试·软件测试·python·功能测试·测试工具·职场和发展·测试用例
前端大菜鸟_37 分钟前
河北工程大学e2e平台,python
python·e2e·河北工程大学
明月看潮生1 小时前
青少年编程与数学 02-016 Python数据结构与算法 12课题、递归
python·算法·青少年编程·编程与数学
放羊郎2 小时前
图像颜色空间对比(Opencv)
opencv·计算机视觉·rgb·灰度图像·bgr
批量小王子2 小时前
批量统一图像色彩
python
硅谷秋水2 小时前
AutoEval:现实世界中通才机器人操作策略的自主评估
人工智能·机器学习·计算机视觉·机器人
自在如风。2 小时前
Java 设计模式:装饰者模式详解
java·python·设计模式
大模型真好玩2 小时前
不写一行代码! VsCode+Cline+高德地图MCP Server 帮你搞定和女友的出行规划(附原理解析)
人工智能·python·mcp
再玩一会儿看代码2 小时前
pip 与 conda 的全面比较:Python 包管理的深度解析
经验分享·笔记·python·conda·课程设计·pip