Page 181 - AI Computer 10

P. 181

Actions (t)

Environment

AGENT

Reward (t) Reward (t + )

Observation (t) Observation (t + 1 )

3. Rewards often contain only partial information. A reward like a win in chess conveys that some inputs must
have been good, but it doesn’t clearly signal which inputs were good and which were not.
4. The system is learning an action policy for taking actions to maximise its receipt of cumulative rewards.
Thus, Reinforcement Learning adopts an iterative approach where the AI model performs some actions, receives
feedback on its actions, evaluates the feedback, and learns from each subsequent iteration.

Difference between Supervised, Unsupervised, and Reinforcement Learning

The main difference between the three machine learning models are:
Feature Supervised Learning Unsupervised Learning Reinforcement Learning

Data Type Labelled Data Unlabelled data Interactive Data
Objective Predict output on the basis Identify patterns or Maximise rewards
of input features groupings

Learning Process Uses a training set with Learns from the data Learn through trial and
input-output pairs itself error

Applications Image classification, Email Customer segmentation, Game Playing, Robotics
filtering Anomaly detection Navigation

TYPES OF SUPERVISED LEARNING MODELS

Supervised Learning models can be broadly categorised into several types based on their purpose and the nature
of the output. The main types of supervised learning models are:
u Classification model u Regression model

Supervised Learning Models

Classification model Regression model

47
47

176 177 178 179 180 181 182 183 184 185 186