シラバス参照

シラバス参照

講義概要/Course Information

2025/07/04 現在

科目基礎情報/General Information

授業科目名 /Course title (Japanese)	画像認識システム特論
英文授業科目名 /Course title (English)	Advanced Image Recognition Systems
科目番号 /Code
開講年度 /Academic year	2024年度	開講年次 /Year offered	全学年
開講学期 /Semester(s) offered	後学期	開講コース・課程 /Faculty offering the course	博士前期課程、博士後期課程
授業の方法 /Teaching method	講義	単位数 /Credits	2
科目区分 /Category	大学院専門教育科目 - 専門科目Ⅱ
開講類・専攻 /Cluster/Department	情報学専攻、情報・ネットワーク工学専攻
担当教員名 /Lecturer(s)	柳井　啓司
居室 /Office	西9-701
公開E-mail /e-mail	yanai@inf.uec.ac.jp
授業関連Webページ /Course website	https://mm.cs.uec.ac.jp/sys/
更新日 /Last update	2024/03/20 02:32:50	更新状況 /Update status	公開中 /now open to public

講義情報/Course Description

主題および達成目標（2,000文字以内） /Themes and goals（up to 2,000 letters）	画像情報システム論では，前半は講義で深層学習による画像認識の基礎理論について学び，後半では最新論文の輪読を通して最新研究の動向について学習する．さらに深層学習フレームワークを用いた画像認識プログラミングのレポート課題を通して，実際の深層学習を用いた画像認識プログラミングについて学ぶ． In the first half of the lecture, students learn the basic theory of image recognition by deep learning, and in the second half, they learn the trend of the latest research through reading the latest papers. In the second half, students learn the latest research trends through reading the latest papers. In addition, students learn the actual image recognition programming using deep learning through a report assignment on image recognition programming using deep learning frameworks.
前もって履修しておくべき科目（1,000文字以内） /Prerequisites（up to 1,000 letters）	線形代数確率統計物体認識論 Pythonによるプログラミング Linear Algebra Probability Statistics Object Recognition Theory Programming in Python
前もって履修しておくことが望ましい科目（1,000文字以内） /Recommended prerequisites and preparation（up to 1,000 letters）	人工知能パターン認識自然言語処理データマイニング Artificial Intelligence Pattern Recognition Natural Language Processing Data Mining
教科書等（1,000文字以内） /Course textbooks and materials（up to 1,000 letters）	IT Text 深層学習, オーム社 IT Text Deep Learning, Ohm Inc.
授業内容とその進め方（2,000文字以内） /Course outline and weekly schedule（up to 2,000 letters）	Cc:日本語で説明し、日本語の教材・資料を使う。前半は講義形式，後半は輪講形式で行う．受講者は必ず１回は，画像認識システムに関する英語の論文を読み，その内容を授業中にプレゼンテーションする．さらに，３回の深層学習に関するプログラミング課題に取り組み，レポートを提出することが必要である．プログラミング課題はPythonによるプログラミングを理解していることを前提する．前半の講義の予定は以下の通り．深層学習以外の従来手法による画像認識（主に学部「物体認識論」の内容）は，初回にざっと説明し，それ以降は本講義では扱わない．後半は，最新の深層学習による画像認識の英語論文の輪読を行う． Type（Cc）：Japanese-based course with Japanese materials The first half of the course will be given in lecture format, and the second half will be given in the form of a rotating lecture. Students are required to read an English paper on image recognition systems at least once and present the paper to the class. In addition, students are required to work on three programming assignments on deep learning and submit reports. The programming assignments assume that the students understand Python programming. The schedule for the first half of the course is as follows. Image recognition using conventional methods other than deep learning (mainly the contents of "Object Recognition" in the undergraduate course) will be briefly explained in the first lecture, and will not be covered in this course after that. In the second half, we will read English papers on the latest deep learning image recognition techniques. 1. Introduction on Deep Neural Network for image recognition 2. Elements of Deep Convolutional Neural Network: convolutional layer, pooling layer, fully connected layer, RNN, LSTM 3. Learning method: back propagation(BP), stochastic gradient descent, end-to-end training, differentiable components, multi-task learning 4. Computation of DCNN: computational complexity，computation with GPU, im2col operation and GEMM, implemetation of DCNN on mobile devices, weight compression 5. Common techniques: Dropout, Batch/weight normalization, He-init, data augmentation Representative DCNN acchitecture．AlexNet, NIN, VGG, GoogleNet, ResNet, ResNext, Densely Connected CNN usage of pre-trained model, fine-tuning. 6. Image generation network. Unpooling, fractional convolution. Generative Adversarial Network (GAN). Semi-supervised training with GAN, Adversarial training, Variational AutoEncoder (VAE), Diffusion Model 7. Image transformation network. Conv-Deconv network. Conv-Deconv with Adversarial training, AutoEncoder 8. Object detection and segmentation. Faster RCNN, SSD, Fully Convolutional Network (FCN). Conv-Deconv for semantic segmentation. 9. BP-based methods: Visualization of activations and trained weights, Google Dream, style transfer, object saliency maps, feature map operation, reverse network for image reconstruction 10. Vision and Language: image captioning, Visual Question Answering, image-word embedding 11．Student presentation (1): architecture and theory for image classification 12．Student presentation (2): object detection 13．Student presentation (3): semantic segmenatation 14．Student presentation (4): image generation & transformation network 15．Student presentation (5): vision and language
実務経験を活かした授業内容 (実務経験内容も含む) /Course content utilizing practical experience
授業時間外の学習（予習・復習等）（1,000文字以内） /Preparation and review outside class（up to 1,000 letters）	3つのプログラミング課題，輪読準備は授業時間外に行うことを前提とする． The three programming assignments and the preparation for paper reading are assumed to be done out of the class time.
成績評価方法および評価基準（最低達成基準を含む）（1,000文字以内） /Evaluation and grading （up to 1,000 letters）	論文のプレゼンテーション，課題のレポート，出席で評価する．配点は，それぞれ，30点，50点，30点とする．出席回数が全授業回数の2/3に満たない場合は0点とする（単位は出さない)． Evaluation will be based on the presentation of the paper, the report of the assignment, and attendance. Points are allocated as follows: 30, 50, and 30 points, respectively. Attendance less than 2/3 of the total number of class sessions will receive zero points (no credit will be given).
オフィスアワー：授業相談（1,000文字以内） /Office hours（up to 1,000 letters）	月曜日6限．不在の場合もあるので，できるだけ事前にメール連絡すること．質問等は電子メールでも受け付ける． Monday, 6th period. Please contact me by e-mail as far in advance as possible, as I may not be available. Questions and comments will be accepted by e-mail.
学生へのメッセージ（1,000文字以内） /Message for students（up to 1,000 letters）	深層学習技術の学習において重要なことは，「基本原理の理解」「実装と実験」「最新技術をフォロー」である．特に自習による「深層学習プログラミング」を重視し，最新の論文のネットワークをすぐにコード化できるようになることを目標に課題を出題する．なお，Pythonをマスターしている前提で課題を出題する．レポート課題および英語論文のプレゼンは作業に大変手間が掛かるので，画像認識システムを実際にプログラミングすることに対して意欲的に取り組むことができる学生の履修を希望する． In learning deep learning techniques, it is important to "understand the basic principles," "implement and experiment," and "follow the latest technologies. We will place particular emphasis on "deep learning programming" through self-study, and assignments will be given with the goal of enabling students to quickly code networks from the latest papers. The assignments will be given on the assumption that the student has mastered Python. The report assignment and the presentation of the English paper are very time-consuming, so students who are willing and able to work on the actual programming of the image recognition system are encouraged to take the course.
その他 /Others
キーワード /Keywords	Deep learning, general object recognition, image mining, data mining, machine learning