If you're seeing this message, it means we're having trouble loading external resources on our website. Show
If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked. | Classical Conditioning | Law of Effect | 4 Types of Operant Conditioning | Examples | Schedules of Reinforcement | What is Operant ConditioningOperant conditioning, also known as instrumental conditioning, is a learning process in which behavior is modified using rewards or punishments. By repeatedly pairing the desired behavior with a consequence, an association is formed to create new learning. E.g. a dog trainer gives his dog a treat every time the dog raises its left paw. The dog learns that raising its left paw can earn him a food reward. It will raise its paw again and again for more treats. Classical Conditioning as the Foundation of BehaviorismWe can trace back the origin of operant conditioning to its predecessor, classical conditioning. Classical conditioning, also known as Pavlovian conditioning, also involves learning a new behavior through the process of association.2 Russian physiologist Ivan Pavlov first experimented with classical conditioning in the late 1800s. He noticed that his dogs salivated whenever he entered the room to feed them. In his operant conditioning experiment, Pavlov rang a bell every time he fed his dogs. Over time, the dogs became conditioned to salivate when they heard the sound of a bell, even when food wasn’t present. Food, which was able to trigger salivation naturally, is the unconditioned stimulus. The bell’s sound, which began to trigger salivation after being paired with food, was the conditioned stimulus. When the unconditioned stimulus (food) and the conditioned stimulus (sound) became associated, the conditioned stimulus could trigger the same response. This newly learned response became a conditioned response. This is a form of learning by association. Pavlovian conditioning became the foundation of Behaviorism, a leading field within the study of psychology at the time. Behaviorists believe that behavior is a response to external stimuli, and humans only learn by association, not by thoughts, feelings, or inner mental events. Law of Effect & Operant ConditioningLater, psychologist Edward Thorndike came up with the concept of instrumental conditioning when he observed the impact of reinforcement in puzzle box experiments with cats trying to escape. He called this process “trial-and-error” learning. Thorndike proposed the Law of Effect3, which stated that if in the presence of a stimulus, a response was followed by a satisfying event (reinforcer), the bond between stimulus and response was strengthened. Conversely, if a response-stimulus event was followed by an unsatisfying event (punisher), the bond was weakened. In the early 1900s, behavioral psychologist B.F. Skinner, also known as the father of operant conditioning, built on the concepts of reinforcer and punisher to create the theory of operant conditioning (Reinforcement Theory). Skinner believed that Pavlovian conditioning was far too simple to explain complex human behavior thoroughly. He believed the best way to understand operant behavior was to observe its causes and consequences1. In Skinner’s operant conditioning paradigm, observable behavior can be manipulated when it is followed by reinforcement or punishment. To study operant conditioning, BF Skinner made a chamber, called the Skinner Box, and put a small animal inside. In the operant conditioning experiments, each time the animal pressed a lever or a bar, it received food or water as reinforcement4. Unlike classical conditioning, which involves unconscious reflexive behavior, operant behaviors are behaviors under conscious control. The major purpose of operant conditioning is to encourage desirable behaviors through rewards and reduce undesirable behaviors through punishments. Applying reinforcement and punishment creates a deliberate and conscious operant learning process. 4 Types of Operant ConditioningThe four types of operant conditioning are positive reinforcement, negative reinforcement, positive punishment, and negative punishment. Reinforcement increases target behavior, while punishment decreases it. Through his experiments, Skinner distinguished two types of consequences that could affect new learning: reinforcement vs punishment. ReinforcementThere are two types of reinforcement – positive reinforcement and negative reinforcement. In psychology, positive refers to adding a stimulus and negative removing one. Positive reinforcement adds a rewarding consequence as a positive reinforcer to behavior, therefore strengthening or increasing the likelihood that the desired behavior will appear again. Negative reinforcement removes an unpleasant stimulus to increase the desired behavior in the future. PunishmentPunishment is the opposite of reinforcement. It aims to reduce bad behavior. Like reinforcement, punishment also comes in two forms: positive punishment and negative punishment. Positive punishment adds an unpleasant stimulus to weaken or eliminate a behavior. Positive punishment is usually what we refer to as “punishment” in our everyday lives. Negative punishment removes a pleasant stimulus to stop undesired behavior. Here are the different types of operant conditioning examples. Example of positive reinforcement
Example of negative reinforcement
Example of positive punishment
Example of negative punishment
The use of operant conditioning is widespread. You can see it everywhere. It’s utilized by parents, teachers, companies, and the government. Also see: Shaping Psychology Schedules of Reinforcement Is a Key ComponentBehavior modification using reinforcers and punishers requires a continuous application to remain effective. Once the reinforcement or punishment stops, the learned behavior gradually weakens and finally disappears in a process called extinction. What is surprising, even to Skinner, is that frequency and pattern of reinforcer applications can affect how fast reinforcement works and how robust the learning remains5. The two types of reinforcement schedules are interval-based schedules and ratio schedules. Interval-based schedules: reinforcers are delivered after a period. The period can be fixed (fixed-interval schedule) or variable (variable-interval schedule). Ratio-based schedules: reinforcers are delivered after a certain number of responses. The number of responses can be fixed (fixed-ratio schedule) or variable (variable ratio schedule). Studies found that behavior learned through variable-ratio schedules is the most robust and least susceptible to extinction. This discovery is significant because now we can use reinforcement and punishment effectively in different situations. E.g. when using rewards to reinforce desired behavior, we now know that we should give them out only occasionally (variable-ratio schedule). When a toddler throws a tantrum in the market, we now know we cannot give in to buying candies no matter what. Occasionally giving in will make the habit much harder to break. Final Thoughts on Operant ConditioningOperant conditioning is something we often see around us. Sometimes we do it intentionally but sometimes not. Recognizing the pros and cons of this type of behavior modification can help us avoid pitfalls and reach the best results. Also See: Operant vs Classical Conditioning References
What type of learning involves operant behavior behavior that operates on the environment to produce rewarding or punishing stimuli?Operant conditioning, sometimes referred to as instrumental conditioning, is a method of learning that uses rewards and punishment to modify behavior. Through operant conditioning, behavior that is rewarded is likely to be repeated, and behavior that is punished will rarely occur.
Which experiment involves the use of classical conditioning?Pavlov and the salivating dogs is the notorious classical conditioning experiment. Although it seems primitive, this research has practical applications in the classroom.
What is the focus of operant conditioning?This section will focus on operant conditioning, which emphasizes reinforcement for behaviors. In operant conditioning, the motivation for a behavior happens after the behavior is demonstrated. An animal or a human receives a consequence (reinforcer or punisher) after performing a specific behavior.
Which of the following is an example of operant conditioning?The correct answer is C. When a dog plays dead she gets a treat in order to encourage her to repeat the behavior. The dog has learned that she will get a treat, positive reinforcement, for playing dead in this example. This is explained through operant conditioning of associating rewards with a behavior.
|