Pavlov’s
Bell & Dog
Experiment
A neutral stimulus becomes meaningful through repeated pairing with a stimulus that already produces a response. The learner is passive — reflexes are the raw material.
CS alone → CR (after conditioning)
Stimulus → Stimulus → Response
Skinner’s
Box & Lever
Experiment
Voluntary behaviour is shaped and maintained by its consequences. The learner is active — every action produces feedback that alters the probability of that action recurring.
Behaviour → Punishment → ↓ Frequency
Response → Consequence → Learning
The Fundamental Difference: Passive Reflex vs Active Choice
Classical conditioning and operant conditioning are both forms of associative learning — both change behaviour through experience — but they operate through entirely different mechanisms, involve different types of behaviour, and assign the learner a fundamentally different role. Understanding this core distinction is the single most important concept in all of conditioning psychology.
Classical: The response happens TO the organism — it cannot prevent or control the conditioned response. Operant: The organism PRODUCES the behaviour — it has agency. Ask yourself: “Did the organism choose to do this, or did it just happen?” If just happened → Classical. If chosen → Operant.
Classical Conditioning — The Complete Picture
Ivan Pavlov (1849–1936) was a Russian physiologist studying digestion in dogs when he noticed something unexpected: his dogs began salivating not just at the sight of food, but at the sound of the experimenter’s footsteps approaching. Pavlov called this a “psychic secretion” and devoted the last 30 years of his life to understanding it. The result was one of the most influential discoveries in all of science — the conditioned reflex.
Core Vocabulary — The Building Blocks
Unconditioned Stimulus (US/UCS) — A stimulus that naturally and automatically produces a response, with no learning required. The “already meaningful” event.
Example: Food powder placed in the dog’s mouth automatically causes salivation. No learning needed — it’s a biological reflex.
Unconditioned Response (UR/UCR) — The natural, automatic response to the US.
Example: Salivation to food = UR. Completely natural. Doesn’t need to be learned.
CS (Bell) → ? (No response yet)
Conditioned Stimulus (CS) — The originally neutral stimulus that, after repeated pairing with the US, acquires the power to produce the conditioned response. The “learned signal.”
Example: The bell — initially meaningless — now predicts food and therefore causes salivation.
Conditioned Response (CR) — The learned response to the CS alone. Similar to but typically weaker than the UR.
Example: Salivation to the bell alone = CR. Requires learning. Weaker than salivation to food.
CS (Bell) alone → CR (Salivation) [AFTER]
Parallel Experiment Walkthrough — Pavlov vs Skinner
See exactly how each conditioning process unfolds, step by step, in parallel:
flowchart TD
A["BEFORE CONDITIONING
Bell (Neutral) → No Response
Food (US) → Salivation (UR)"] --> B
B["DURING CONDITIONING
Bell (CS) + Food (US) paired repeatedly
CS must PRECEDE US
Bell predicts food"] --> C
C["AFTER CONDITIONING
Bell (CS) alone → Salivation (CR)
Neutral has become meaningful
S-S association established"] --> D
D{"Further
Processes"} --> E
D --> F
D --> G
D --> H
E["EXTINCTION
CS without US repeatedly
CR gradually disappears
But NOT permanently erased"]
F["SPONTANEOUS RECOVERY
After rest period
CR briefly returns
at reduced strength"]
G["GENERALISATION
CR occurs to stimuli
SIMILAR to CS
Little Albert: rat → rabbit"]
H["DISCRIMINATION
CR only to specific CS
Not to similar stimuli
Through selective pairing"]
E --> I["HIGHER-ORDER
CONDITIONING
CS1 used to condition CS2
without original US"]
style A fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:2px
style B fill:#fde8d8,stroke:#d4822e,color:#5c2100,stroke-width:2px
style C fill:#f8ead0,stroke:#b85c18,color:#5c2100,stroke-width:2px
style D fill:#f5f5f5,stroke:#888,color:#333,stroke-width:2px
style E fill:#fde8e8,stroke:#9a1c10,color:#5c0808,stroke-width:1px
style F fill:#fff0e0,stroke:#c08010,color:#5c3000,stroke-width:1px
style G fill:#e8f0ff,stroke:#1a5276,color:#061c38,stroke-width:1px
style H fill:#e8ffe8,stroke:#2e7d32,color:#0a3010,stroke-width:1px
style I fill:#f0e8ff,stroke:#6a3a9a,color:#2a0858,stroke-width:1px
Sub-Processes of Classical Conditioning
Extinction: CS is repeatedly presented WITHOUT the US. The CR weakens gradually. Note: does NOT erase the original learning — only suppresses it.
Spontaneous Recovery: After extinction + a rest period, the CR briefly reappears at reduced strength. Proof that extinction is suppression, not erasure.
Spontaneous Recovery: Rest → CS → weak CR
Reacquisition: Much faster than original learning
Stimulus Generalisation: CR occurs to stimuli SIMILAR to the CS. The more similar, the stronger the CR. Watson’s Little Albert feared not just the rat but all white furry objects.
Stimulus Discrimination: Through selective conditioning, the organism learns to respond ONLY to the specific CS — not to similar stimuli. The opposite of generalisation.
Discrimination: CS → CR; CS-similar → no CR
(achieved by reinforcing CS but not CS-similar)
Higher-Order Conditioning
Once a CS has been established, it can be used to condition a NEW neutral stimulus — without ever presenting the original US again. The established CS acts as if it were a US.
Step 1: Bell (CS1) + Food (US) → Salivation (CR). Bell is now a conditioned stimulus. Step 2: Light (new neutral stimulus) + Bell (CS1) → Salivation (CR). Now the light alone can produce salivation — even though it was never paired with food. CS1 functions like a US for CS2. This is how chains of conditioned associations build up in real life — advertising, brand associations, and emotional conditioning in relationships all rely on higher-order conditioning.
Types of Classical Conditioning Procedures
| Procedure | Timing | Effectiveness | Example |
|---|---|---|---|
| Forward Conditioning | CS comes BEFORE US — most common | Most effective — CS predicts US | Bell rings, then food appears |
| Trace Conditioning | CS presented then gap then US | Good — CS must be held in memory | Bell rings, silence, then food |
| Delay Conditioning | CS starts before and overlaps with US | Very good | Bell starts ringing while food is being presented |
| Simultaneous Conditioning | CS and US presented at the same time | Poor — CS doesn’t predict US | Bell and food at exactly the same moment |
| Backward Conditioning | US comes BEFORE CS | Very poor or none | Food, then bell — CS cannot predict US |
John Garcia (1966) demonstrated that taste aversion can be conditioned in a single trial — even when the illness (US) occurs hours after the taste (CS). This challenges the standard view that conditioning requires repeated pairings and close CS-US proximity in time. A rat that ate a novel food and then became ill (from radiation) avoided that food even when illness followed hours later. The Garcia Effect shows that conditioning is constrained by biological preparedness — some associations are learned far more easily than others because of evolutionary relevance.
Operant Conditioning — The Complete Picture
B.F. Skinner (1904–1990) built directly on Thorndike’s Law of Effect to create the most systematic and empirically precise account of learning in the behaviourist tradition. Where Thorndike described learning in terms of “satisfying” and “annoying” aftereffects, Skinner replaced these mentalistic terms with precise, observable concepts: reinforcement (any consequence that increases a behaviour) and punishment (any consequence that decreases a behaviour).
The 4 Consequences — The Reinforcement Matrix
Every consequence in operant conditioning can be classified along two dimensions: whether a stimulus is added or removed, and whether that stimulus is pleasant (appetitive) or unpleasant (aversive). This produces four distinct types:
REINFORCEMENT (positive or negative) = ALWAYS increases behaviour. PUNISHMENT (positive or negative) = ALWAYS decreases behaviour. The +/− sign tells you whether a stimulus is ADDED or REMOVED — not whether it is good or bad. Negative reinforcement is NOT punishment. This is the single most commonly misunderstood concept in all of psychology and is tested in virtually every CTET and UGC-NET paper.
Reinforcement Schedules — When Does the Reward Arrive?
One of Skinner’s most important discoveries was that the pattern in which reinforcement is delivered — the schedule of reinforcement — profoundly shapes the pattern of responding. Different schedules produce dramatically different rates of response and resistance to extinction.
flowchart TD
B["VOLUNTARY BEHAVIOUR
Organism operates on environment"] --> C
C{"What follows
the behaviour?"} --> PR
C --> NR
C --> PP
C --> NP
C --> EX
PR["POSITIVE REINFORCEMENT
Add pleasant stimulus
Behaviour INCREASES
Example: Praise, food, money"]
NR["NEGATIVE REINFORCEMENT
Remove unpleasant stimulus
Behaviour INCREASES
Example: Pain relief, escape"]
PP["POSITIVE PUNISHMENT
Add unpleasant stimulus
Behaviour DECREASES
Example: Scolding, shock"]
NP["NEGATIVE PUNISHMENT
Remove pleasant stimulus
Behaviour DECREASES
Example: Time-out, fines"]
EX["EXTINCTION
No consequence
Behaviour DECREASES gradually
Note: extinction burst first"]
PR --> SCH
NR --> SCH
SCH{"Reinforcement
Schedule"} --> CRF
SCH --> FR
SCH --> VR
SCH --> FI
SCH --> VI
CRF["Continuous - CRF
Every response reinforced
Fast learning
Low resistance to extinction"]
FR["Fixed Ratio - FR
Every nth response reinforced
High rate, post-reinf. pause
Example: piecework"]
VR["Variable Ratio - VR
Unpredictable nth response
HIGHEST rate and resistance
Example: slot machines"]
FI["Fixed Interval - FI
First response after set time
Scallop pattern
Example: monthly salary"]
VI["Variable Interval - VI
First response after variable time
Steady moderate rate
Example: checking email"]
style B fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:2px
style PR fill:#e8f8f0,stroke:#0a6640,color:#052808,stroke-width:1px
style NR fill:#e0f4fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style PP fill:#fde8e8,stroke:#9a1c10,color:#4a0808,stroke-width:1px
style NP fill:#fef2e0,stroke:#7a3a00,color:#3a1000,stroke-width:1px
style EX fill:#f0f0f0,stroke:#666,color:#333,stroke-width:1px
style CRF fill:#e8f8f0,stroke:#0a6640,color:#052808,stroke-width:1px
style FR fill:#e8f8f0,stroke:#0a6640,color:#052808,stroke-width:1px
style VR fill:#e8f8f0,stroke:#0a6640,color:#052808,stroke-width:2px
style FI fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style VI fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
Highest to lowest response rate: VR > FR > VI > FI > CRF. Highest to lowest resistance to extinction: VR > VI > FR > FI > CRF. The variable ratio schedule wins on both counts — which is precisely why social media, gambling, and video games (all designed using VR principles) are so behaviourally compelling and difficult to stop.
Shaping, Chaining & Stimulus Control
Complex behaviours rarely appear fully formed. Skinner developed techniques for building sophisticated behavioural sequences from simple starting points.
Successive approximations — reinforcing behaviours that are progressively closer to the desired target behaviour. Begin by reinforcing any response in the right direction, then gradually raise the criterion.
Example: Teaching a pigeon to bowl — first reinforce facing the ball, then touching it, then pushing it, then pushing it toward the pins.
Reinforce each step → Raise criterion
Until full behaviour is established
Behaviour chain — a sequence of behaviours where each response produces the discriminative stimulus for the next response. The chain can be built forward or backward (backward more efficient).
Example: Getting dressed — each step serves as a cue for the next step. The final step (fully dressed) is reinforced.
Each response produces the next SD
Backward chaining: teach last step first
Stimulus Control & Discrimination
A discriminative stimulus (SD) signals that reinforcement is available if the behaviour is performed. The organism learns to respond only in the presence of the SD — this is stimulus control. A green traffic light is an SD for driving forward. A red light is an S-delta (SΔ) — signal that the behaviour will NOT be reinforced.
Primary (unconditioned) reinforcers have intrinsic biological value — food, water, warmth, sex. They reinforce without any learning history. Secondary (conditioned) reinforcers acquire reinforcing value through association with primary reinforcers — money, praise, grades, tokens. Token economies work by making tokens secondary reinforcers that can be exchanged for primary ones.
David Premack (1959) proposed that a higher-frequency (more preferred) behaviour can be used to reinforce a lower-frequency (less preferred) behaviour. “You can watch TV (high frequency) after you do your homework (low frequency).” Also known as “Grandma’s Rule.” The principle means reinforcement is relative, not absolute — any behaviour can reinforce a less probable behaviour.
🔔 Bell or Box? — How to Identify Which Conditioning
One of the most common examination tasks is identifying whether a described scenario involves classical or operant conditioning. This decision guide gives you a reliable step-by-step process for any scenario.
Stimulus-stimulus association
Pavlov · S → S → R
Behaviour-consequence association
Skinner · R → C → ↑/↓R
Most real-world learning involves both types at once.
Quick Identification Table
| Scenario | Type | Reason |
|---|---|---|
| Child salivates at the smell of the school canteen | Classical | Involuntary response; smell (CS) predicts food (US) |
| Student studies harder after getting an A | Operant | Voluntary behaviour (studying) reinforced by grade |
| Person feels anxious at the sight of a dentist’s chair | Classical | Involuntary fear; chair (CS) paired with pain (US) |
| Dog sits when owner says “sit” to get a treat | Operant | Voluntary behaviour; “sit” is SD; treat is positive reinforcement |
| Child flinches at the crack of a whip (never been hit) | Classical | Involuntary startle; sound (CS) generalised from similar sounds |
| Employee works overtime because it led to a bonus before | Operant | Voluntary behaviour shaped by past positive reinforcement |
| Person takes aspirin whenever they feel a headache coming on | Operant | Voluntary behaviour; headache relief = negative reinforcement |
| Rat salivates to a light that was paired with a bell paired with food | Classical (2nd order) | Higher-order conditioning; CS2 (light) → CS1 (bell) → US (food) |
Master Comparison Table — Every Key Parameter
| Parameter | 🔔 Classical Conditioning | ⬛ Operant Conditioning |
|---|---|---|
| Also Known As | Respondent conditioning; Pavlovian conditioning; Type S | Instrumental conditioning; Skinnerian conditioning; Type R |
| Key Theorists | Ivan Pavlov (1890s); John B. Watson (1913) | E.L. Thorndike (1898); B.F. Skinner (1938) |
| Proposed | 1890s (Pavlov); formalised 1903 | 1898 (Thorndike Law of Effect); systematised 1938 (Skinner) |
| Type of Behaviour | Involuntary, reflexive, automatic responses | Voluntary, emitted, goal-directed responses |
| Role of Learner | PASSIVE — stimuli happen to the organism | ACTIVE — organism operates on the environment |
| Core Mechanism | Stimulus-Stimulus (S–S) association | Response-Consequence (R–C) association |
| What Gets Associated | CS with US (two stimuli) | Behaviour with consequence (R with C) |
| Key Formula | CS + US → UR; after learning: CS → CR | Behaviour → Reinforcement/Punishment → ↑/↓ behaviour |
| Stimulus Timing | CS must come BEFORE (or with) the US | Consequence must follow IMMEDIATELY after behaviour |
| Role of Reinforcement | Not required — pairing alone sufficient | CENTRAL — consequences determine learning entirely |
| Famous Experiment | Pavlov’s dogs (salivation to bell); Little Albert (Watson) | Thorndike’s puzzle box; Skinner Box (rat/pigeon) |
| Extinction | CS presented repeatedly without US → CR weakens | Behaviour no longer reinforced → behaviour decreases; extinction burst first |
| Spontaneous Recovery | After extinction + rest → CR briefly returns | After extinction + rest → behaviour briefly returns |
| Generalisation | CR occurs to stimuli similar to CS (gradient) | Response occurs to stimuli similar to original SD |
| Discrimination | CR only to specific CS; not to similar stimuli | Respond only to the specific SD; not to other stimuli |
| Schedules | Not applicable (pairing frequency varies; not schedules) | CRF, FR, VR, FI, VI — profoundly shape behaviour patterns |
| Real-World Examples | Phobias, food aversions, advertising, emotional responses, brand conditioning | Education rewards, parenting, therapy, salary systems, habit apps, video games |
| Therapy Applications | Systematic desensitisation; flooding; aversion therapy; counter-conditioning | Applied Behaviour Analysis (ABA); token economy; behaviour modification; biofeedback |
| Biological Preparedness | Some CS-US associations learned very easily (taste aversion — Garcia Effect) | Some behaviours more easily reinforced than others (species-specific defence reactions) |
| Cognitive Involvement | Minimal in original theory; Rescorla showed CS provides information about US | Minimal in original theory; Tolman showed cognitive expectancies operate in conditioning |
| India — CTET Relevance | Test anxiety, emotional climate, phobia treatment, advertising | Behaviour management, praise, punishment policy (RTE 2009), token economies |
flowchart TD
SCENE["REAL-WORLD SCENARIO
Student raises hand in class"] --> CL_PART
SCENE --> OP_PART
CL_PART["CLASSICAL COMPONENT
Teacher's questioning look (CS)
Previously paired with praise (US)
Now produces positive anticipation (CR)
This motivates the behaviour emotionally"] --> ACT
OP_PART["OPERANT COMPONENT
Student raises hand (voluntary behaviour)
Teacher calls on student and praises (Positive Reinforcement)
Behaviour frequency INCREASES
This strengthens the behaviour"] --> ACT
ACT["RESULT
Student raises hand MORE often
Both emotional motivation (Classical)
and behavioural reinforcement (Operant)
operate simultaneously"]
SCENE2["ANOTHER EXAMPLE
A child learns to fear maths class"] --> CL2
SCENE2 --> OP2
CL2["CLASSICAL
Harsh teacher (US) paired with maths room (CS)
Anxiety conditioned to the room and subject
Involuntary emotional response"]
OP2["OPERANT
Child avoids maths work
Avoidance reduces anxiety
Negative Reinforcement
Avoidance behaviour increases"]
CL2 & OP2 --> RES2["COMBINED RESULT
Child fears maths (Classical)
AND increasingly avoids it (Operant)
Requires intervention addressing both dimensions"]
style SCENE fill:#f8f8f8,stroke:#444,color:#222,stroke-width:2px
style CL_PART fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:2px
style OP_PART fill:#e8f4fd,stroke:#1a5276,color:#061c38,stroke-width:2px
style ACT fill:#f0f8ee,stroke:#2e7d32,color:#0a3010,stroke-width:2px
style SCENE2 fill:#f8f8f8,stroke:#444,color:#222,stroke-width:2px
style CL2 fill:#fde8e8,stroke:#9a1c10,color:#4a0808,stroke-width:2px
style OP2 fill:#fde8e8,stroke:#9a1c10,color:#4a0808,stroke-width:2px
style RES2 fill:#fde8d0,stroke:#9a4c10,color:#4a1808,stroke-width:2px
Applications Across Education, Therapy & Life
flowchart LR
subgraph CL_APP["CLASSICAL — Association-Based"]
C1["Therapy
Systematic desensitisation
Flooding, Aversion therapy"]
C2["Advertising
Brand conditioning
Emotional associations"]
C3["Education
Classroom climate
Test anxiety reduction"]
C1 --- C2 --- C3
end
subgraph OP_APP["OPERANT — Consequence-Based"]
O1["Therapy
ABA, Token economy
Behaviour modification"]
O2["Technology
Gamification
Social media design"]
O3["Education
Praise and rewards
Programmed instruction"]
O1 --- O2 --- O3
end
subgraph BOTH["BOTH — Combined Approach"]
B1["CBT (Cognitive Behaviour Therapy)
Addresses both conditioned emotions (Classical)
and behaviour patterns (Operant)"]
B2["Classroom Management
Emotional safety (Classical)
+ Reward systems (Operant)"]
end
CL_APP --> BOTH
OP_APP --> BOTH
style C1 fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:1px
style C2 fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:1px
style C3 fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:1px
style O1 fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style O2 fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style O3 fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style B1 fill:#f0f8ee,stroke:#2e7d32,color:#0a3010,stroke-width:1px
style B2 fill:#f0f8ee,stroke:#2e7d32,color:#0a3010,stroke-width:1px
Mnemonics & Memory Tricks
US → UR → CS → CR — the four core terms in order.
Positive Reinforcement · Negative Reinforcement · Positive Punishment · Negative Punishment
Quick-Fire Comparisons
Quick Revision — Last-Hour Bullets
flowchart LR
subgraph CLASSICAL["🔔 CLASSICAL CONDITIONING"]
direction TB
C_WHO["Pavlov · Watson
1890s–1913"]
C_TYPE["Involuntary · Reflexive
Learner is PASSIVE"]
C_MECH["CS + US → UR
then CS → CR
Stimulus-Stimulus"]
C_KEY["Extinction · Generalisation
Discrimination · Spontaneous Recovery
Higher-Order Conditioning"]
C_APP["Phobia therapy
Advertising
Aversion therapy"]
C_WHO --> C_TYPE --> C_MECH --> C_KEY --> C_APP
end
subgraph OPERANT["⬛ OPERANT CONDITIONING"]
direction TB
O_WHO["Thorndike · Skinner
1898–1938"]
O_TYPE["Voluntary · Goal-directed
Learner is ACTIVE"]
O_MECH["R → Consequence → R changes
+R -R +P -P
Response-Consequence"]
O_KEY["Schedules: CRF FR VR FI VI
Shaping · Chaining
Premack Principle"]
O_APP["Behaviour modification
Token economy · ABA
Gamification · Education"]
O_WHO --> O_TYPE --> O_MECH --> O_KEY --> O_APP
end
CLASSICAL <-->|"Both can operate
simultaneously
in real-world learning"| OPERANT
style C_WHO fill:#fdf2e8,stroke:#b85c18,color:#5c2100,stroke-width:1px
style C_TYPE fill:#fde8d0,stroke:#d4822e,color:#5c2100,stroke-width:1px
style C_MECH fill:#f8e0c0,stroke:#b85c18,color:#5c2100,stroke-width:1px
style C_KEY fill:#f4d8b0,stroke:#b85c18,color:#5c2100,stroke-width:1px
style C_APP fill:#f0d0a0,stroke:#b85c18,color:#5c2100,stroke-width:1px
style O_WHO fill:#e8f0fd,stroke:#1a5276,color:#061c38,stroke-width:1px
style O_TYPE fill:#d8e8fc,stroke:#2874a6,color:#061c38,stroke-width:1px
style O_MECH fill:#c8e0fb,stroke:#1a5276,color:#061c38,stroke-width:1px
style O_KEY fill:#b8d8fa,stroke:#1a5276,color:#061c38,stroke-width:1px
style O_APP fill:#a8d0f9,stroke:#1a5276,color:#061c38,stroke-width:1px
