
How to apply reinforcement and punishment
In dog training, there are two main learning methods: operant conditioning and Pavlovian (classical) conditioning. Today, we’ll focus on operant conditioning, exploring what it is, how it works, and when to apply it. Let’s break it down.
What is operant conditioning?
Operant conditioning, a learning theory developed by B.F. Skinner, uses reinforcement and punishment to modify behaviours by applying appropriate consequences.
- Reinforcement strengthens a behaviour, making it more likely to occur.
- Punishment weakens a behaviour, making it less likely to occur.
Operant conditioning is further divided into four categories:
- Positive reinforcement: Adding something desirable (a reward) to strengthen a behaviour.
- Negative reinforcement: Removing something aversive to strengthen a behaviour.
- Positive punishment: Adding something aversive (a correction) to weaken a behaviour.
- Negative punishment: Removing something desirable to weaken a behaviour.
In this context, "positive" means adding something, while "negative" means removing something. Let’s take a closer look at how these concepts can be applied in dog training.
Examples of operant conditioning in dog training
Here’s how a handler might use operant conditioning to teach a dog to sit when greeting other people:
- Positive Reinforcement
Example: When the dog sits in front of people during a greeting, the handler gives the dog a treat, reinforcing the sitting behaviour. - Negative Reinforcement
Example: If the dog jumps on people during a greeting, the handler applies pressure on the leash. The pressure is released only when the dog sits, reinforcing the sitting behaviour. - Positive Punishment
Example: If the dog jumps on people, the handler firmly says "No," weakening the jumping behaviour. - Negative Punishment
Example: If the dog jumps on people, the handler removes the dog from the situation by placing it in a crate or a separate room, weakening the jumping behaviour.
Understanding rewards and corrections
- A rewarding stimulus (or reward) is anything the dog wants at that moment. This could include treats, praise, affection, toys, or play.
- An aversive stimulus (or correction) is anything the dog does not want at that moment. Examples include a firm "No," a leash pop, time-out, or removing attention or treats.
It’s important to note that physical abuse (e.g., kicking, smacking, or hitting the dog) is not acceptable and is considered abuse—not punishment.
Timing Is Key
During the training phase of dog training, timing is critical. Whether you’re applying a reward or a correction, it must occur within one second of the dog’s behaviour. This ensures the dog can clearly associate its action with the consequence, making the learning process more effective.
By understanding and properly applying these principles of operant conditioning, you can successfully shape and modify your dog’s behaviours.
Pets Central Veterinary Academy Puppy Class
Trainer Introduction – Vienn
Vienn is the founder of a successful dog training school in Australia, recently returned to Hong Kong, and aims to educate pet owners on canine behaviours and broaden her knowledge of dogs further. As a qualified dog trainer through NDTF (National Dog Trainers Federation) and a certified puppy instructor, Vienn has worked with and been mentored by world-class trainers, such as Boyd Hooper and Steve Mcloche. Vienn specializes in modifying canine behaviours and puppy development, as well as scent detection training. She also has years of experience with tactical working dogs, decoy work, civil training and drive development.