Course Information
SemesterCourse Unit CodeCourse Unit TitleT+P+LCreditNumber of ECTS CreditsLast Updated Date
1AIN424Introduction to Deep Reinforcement Learning3+0+03612.08.2025

 
Course Details
Language of Instruction English
Level of Course Unit Bachelor's Degree
Department / Program ARTIFICIAL INTELLIGENCE ENGINEERING
Type of Program Formal Education
Type of Course Unit Elective
Course Delivery Method Face To Face
Objectives of the Course To teach the basic concepts of deep reinforcement learning
Course Content Introduction to reinforcement learning (RL), Markov decision processes, Planning by Dynamic Programming, Monte Carlo methods, Temporal difference learning, RL with function approximation, Policy Gradient Methods.
Course Methods and Techniques Lecture
Prerequisites and co-requisities ( BBM102 ) and ( BBM104 ) and ( AIN426 )
Course Coordinator None
Name of Lecturers Instructor Bölüm Sorumluları
Assistants None
Work Placement(s) No

Recommended or Required Reading
Resources Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition
Course Notes Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition


Planned Learning Activities and Teaching Methods
Activities are given in detail in the section of "Assessment Methods and Criteria" and "Workload Calculation"

Assessment Methods and Criteria
In-Term Studies Quantity Percentage
Midterm Exam 1 % 30
Assignment 3 % 30
Final examination 1 % 40
Total
5
% 100

 
ECTS Allocated Based on Student Workload
Activities Quantity Duration Total Work Load
Course Duration 14 3 42
Hours for off-the-c.r.stud 14 3 42
Assignments 3 20 60
Preparation for Midterm Exam 1 20 20
General Exam Preparation 1 20 20
Total Work Load   Number of ECTS Credits 6,13333333333333 184

 
Course Learning Outcomes: Upon the successful completion of this course, students will be able to:
NoLearning Outcomes
1 Define the key features of reinforcement learning
2 Given a problem, decide if it should be formulated as a RL problem; if yes be able to define it formally and state what algorithm is best suited for addressing it.
3  
4  
5  
6  
7  
8  

 
Weekly Detailed Course Contents
WeekTopicsStudy MaterialsMaterials
1 Introduction to reinforcement learning
2 Introduction to reinforcement learning
3 Markov decision processes
4 Markov decision processes
5 Planning by Dynamic Programming
6 Planning by Dynamic Programming
7 Planning by Dynamic Programming
8 Planning by Dynamic Programming
9 Temporal difference learning
10 Temporal difference learning
11 RL with function approximation
12 RL with function approximation
13 Policy Gradient Methods
14 Policy Gradient Methods
15 Final exam preparation
16 Final exam

 
Contribution of Learning Outcomes to Programme Outcomes
P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12
All 5 5 5 3 3 1 1 2 1 1 1 1
C1
C2
C3
C4
C5
C6
C7
C8

  Contribution: 1: Very Slight 2:Slight 3:Moderate 4:Significant 5:Very Significant

  
  https://bilsis.hacettepe.edu.tr/oibs/bologna/progCourseDetails.aspx?curCourse=2733691&lang=en