JOY FULL Horses: Understanding Extinction: Part 12

Mastering Micro: Building Unlikely Behaviors with Resurgence
Nothing is either all good or all bad.

We want to use positive reinforcement with our animals because we see it as being both effective and more humane.  But the associations created through positive reinforcement can create addictions to harmful behaviors.  Think about the way advertisers manipulate our behavior to encourage smoking or overeating.

Resurgence and regression can be very negative procedures, but they can also be used to produce what might otherwise be very difficult behaviors to obtain.

If you aren’t sure how you can turn what seems like a negative procedure into a positive teaching strategy, PORTL can once again help to illustrate how this works.

Here’s the set up:

The trainer sets a toy chair on the table for her learner to interact with. The goal is to get the learner to push the chair over the table the way she might push a toy car.

We’ll now observe quietly in the background while the learner begins to interact with the chair.  The trainer could get lucky.  The learner might begin offering the behavior she’s after within the first couple of clicks.  But with this learner there’s no sign of any chair pushing behavior. Why?

History matters.

The learner is going to draw on all of her previous repertoire of things she has done with chairs.  In this case we have a learner who was scolded as a child for pushing her chair over the floor, so she’s not very likely to offer this type of behavior with the toy chair.

A history of punishment has played a role in depressing chair pushing behavior for this learner, but pushing would also have been an unlikely behavior if the trainer had set down a dice. The learner would have tossed the dice or shaken it in her hand because that’s what you do with this kind of object. Pushing a dice over the table like a toy car is not an obvious behavior to try.

Through a series of small approximations, the trainer could try to shaping the behavior she wants.  Her first step would be reinforcing the learner for touching the chair.

The learner in this case is not particularly creative.  She offers simple touches, but nothing else.  Again, the trainer may be dealing with a history of punishment.  Her learner doesn’t have a lot of experience being reinforced for trying things.  In fact, quite the opposite – she may have been punished for stepping “outside the lines”.  She is like so many of our animal learners – hesitant, lacking in confidence, and not showing any outward signs of curiosity.  In her first few attempts she touches the chair, but she doesn’t try any other behaviors.  Getting her to push the chair is going to be hard.

So the trainer takes the chair away and sets out a toy car. Using an object that normally would be pushed makes it very easy to get the desired action.  The learner pushes the car over the table top. Click and treat.

This is repeated several times, and then the trainer takes the car away and sets the chair out.  The learner goes back to touching it.  The chair accidentally falls over – click and treat. The learner latches on to that, expanding her repertoire to two behaviors – touching the chair and knocking it over.

We see this so many times with our animal learners.  One click and suddenly you’ve locked in a behavior you don’t want.  With a creative learner this isn’t a problem.  You can quickly shift the behavior into something you want, but with these “one trick ponies” you have to be so very careful what you click.  In this case the learner persists in knocking the chair over even when she is no longer getting reinforced for the action.

Her trainer makes a quick decision and decides to put everything but pushing the chair like a car on extinction.  Her learner is clearly becoming frustrated.  To avoid having her shut down completely, the trainer takes the chair away and sets the car out again.  The learner immediately starts pushing the car over the table top.  Click and treat.

To help with the generalization the trainer puts a third object out – a small block. The learner pushes the block.  Click and treat.  This is repeated several times, then the trainer takes the block away and sets out the car.  The car is pushed. Click and treat.

The trainer sets the chair out, and the learner pushes the chair.  Job done.

Resurgence and Dog “Yoga”
Using the car in this way is an elegant teaching strategy.  Often when we come up with these clever ways of helping our learner to be successful, we know that it works, but we don’t really have good explanations for why.   Understanding resurgence helps us with the why in this case.  And it helps us to be more deliberate in the use of this kind of teaching strategy.  Here’s another example.

One of Kay Laurence’s students taught her dog to step up with his hind legs onto a chair.  It was elegant training, a beautiful example of setting the learner up for success.  In his talk on extinction, Dr. Jesús Rosales-Ruiz helped us to see that it was also a great example of using resurgence.

Here’s the lesson: First, the dog learned to stand one foot each on four small plastic pods. This alone was impressive training.  The pods were the same ones physiotherapists use to help people improve their balance and proprioception. It took great coordination for the dog to stay balanced on the four pods. But that was only step 1.  Next he learned to keep his front feet on the floor while he maneuvered his hind feet up onto the brick ledge of a fireplace hearth.

Adding in the precision of the pods came next.  Now the dog wasn’t just standing with his front paws on the floor and his hind end up on the ledge.  He was also balancing on all four pods.

This was not done as a cute party trick.  The dog’s owner is a yoga teacher.  Her interest was very much the same as mine – helping her animal learner maintain a healthy spine.  In this orientation she could ask her dog for weight shifts that contribute to a flexible spine.

The last step was setting up a training session next to a chair. The handler withheld the click, putting the dog into an extinction process. With very little experimentation, the dog oriented himself so his hind end was to the chair.  He certainly demonstrated the flexibility of his spine by stepping up onto the chair with his hind legs so he was standing hind end up on the chair and front feet on the floor.

Generalization and Creativity
Jesús commented that if we didn’t know about resurgence we would simply be saying the dog generalized.  That’s not a sufficient explanation.  What we were seeing was a great example of resurgence. PORTL has given us a better understanding of how to encourage this kind of problem solving.  When we want to train for this type of generalization, knowing about the “why” of resurgence helps us to be more deliberate and efficient in our training.

It isn’t positive reinforcement by itself that creates a positive learning experience.  An eagerness for learning comes from being a successful puzzle solver.  That success in turn comes from the kind of efficient, clean training that the clever use of resurgence encourages.

These examples give us a great perspective on creativity.  When we’re training, we aren’t waiting and waiting for our animals to do something we can reinforce.  Instead we can “seed” the behaviors we want them to draw on.  Then we set up the conditions and let them have the pleasure of discovering for themselves new or unlikely combinations.

We have a procedure for setting up the creative process.  You give your learner the repertoire, the components that form more complex behaviors, and then you set a puzzle and let extinction be the catalyst for solving it.

Coming Next: The “Pose”

Remember, if you are new to the JOY Full Horse blog, click on the JOY Full Horses tab at the top of this page to find the full table of contents and links to each of the articles I have published so far.

I hope you will want to share these articles by sending links to this blog to your friends.  But please remember this is copyrighted material.  All rights are reserved. Please do not copy any of the “JOY Full Horses” articles without first getting written permission from Alexandra  Kurland, via theclickercenter.com

Also note: these articles are not intended as an instruction guide for introducing your horse to clicker training.  If you are new to clicker training and you are looking for how-to instructions, you will find what you need at my web sites:

theclickercenter.com                    theclickercentercourse.com

5GoToSea: Part 15: Micro Masters

Resurgence and Regression: Understanding Extinction So You Can Master It

From a presentation given by Dr. Jesús Rosales-Ruiz during the 2014 Five Go To Sea Conference cruise.

Part 1: The Elevator Question
Part 2: The Translation to Horses: Is Personality Expressed or Suppressed?
Part 3: Unraveling the Regression Mess
Part 4: Extinction and Shaping
Part 5: Extinction Reveals The Past
Part 6: Accidental Extinction
Part 7: Emotions
Part 8: Training With High Rates Of Reinforcement
Part 9: Cues and Extinction
Part 10: PORTL
Part 11: Mastering Extinction
Part 12: Creativity Explored
Part 13: Degrees of Freedom
Part 14: The Positive Side of Resurgence
Part 15: Micro Masters

If you have not read the previous installments of this series, I suggest you begin with Part 1. Part 1 was published on May 21, 2015.

Part 15: Micro Masters

The “Pose”
Jesús closed his presentation with two horse examples.  The first was Robin’s “pose”. I’ve told the story of the “pose” many times.  I’ll keep it brief here.  Robin first learned a stationary “pose”.  It originally was a by-product of cleaning up his treat taking manners when he was two years old.  During the process he started “posing”, arching his neck and looking like a very pretty dressage horse.  I liked the look so I continued to reinforce it.  It became a default behavior.  In the absence of any other active cue from me, if Robin posed, I would click and reinforce him.  I became the cue for the behavior.

Offering “the pose” meant that if Robin wanted to interact with me and engage in the clicker game, he had a sure fire way of doing so.  Even if I was busy doing barn chores, if I saw him posing, I would click and reinforce him.  I never wanted him to feel like the proverbial toddler who is banging the kitchen pots and pans to get his mother’s attention. If Robin wanted attention from me, he had a behavior which he could use to satisfy his need for social interaction.

Because Robin wasn’t ignored, he didn’t go through an extinction process.  I didn’t see a regression into the unwanted behaviors that macro extinctions can cause. Instead I was able to reinforce a behavior I liked, one that was a useful warm up for our formal training sessions.  For his part Robin was confident that I would engage with him when he asked for attention.

Reinforcing him for the stationary pose went on through the winter.  I didn’t have any plans for developing the behavior.  It was simply something I liked.  It was Robin who was the creative one!

It must have been late March.  I was lunging him in the arena one evening.  He was giving me a ho hum trot.  There was nothing there I could reinforce.  Robin went once around the circle, twice, three times without reinforcement.  Normally I would have been clicking and reinforcing him at a much higher rate, but given the plow horse trot I was presented with, there was nothing there I wanted to say yes to.

At the time I would not have described it in these terms, but I was putting him into an extinction process.  I could see him searching, trying to decide what to do.  On the third time round he had the answer.  He would try his pose.  But in order to pose and still stay in the trot, he had to add energy.  Within one stride he transformed into magazine-cover magnificence.  I captured the moment with a click and the rest is history.  The “pose” has evolved into a major component of my work.  Robin showed us that we could indeed shape self carriage.  What began as a happy accident for Robin has become a deliberate and very systematically trained behavior in other horses.

Our Creative Horses
When I first told this story to Jesús, he commented that the pose came out because of resurgence.  At the time, I didn’t understand the significance of what he was saying, but I remembered what he said.  And Jesús remembered the story.  It got him thinking about the procedure and how we might use it to make deliberate use of resurgence.  The result: we now have a systematic way of creating unlikely behaviors. The end result can look like magic, but there is good science behind it.  Here are the steps:

First, you build a strong history of reinforcement for the component behaviors.

Next, you change the situation somewhat so extinction comes into play.

This generates a resurgence of previously reinforced behaviors.  The result: new combinations emerge.  That’s creativity.  The most fun for me is seeing what the horses invent.  They are often so much more creative than their human partners!

Seeing Familiar Landscapes with Fresh Eyes
Kay Laurence might say we are seeing familiar landscapes with fresh eyes.

Jesús would say you have to understand the process of extinction so you can master it. If you understand it, you won’t be frustrating your animals.  Instead, you’ll know how to use extinction to generate complex behaviors.

I would say that monitoring the level of extinction your learner is experiencing is a keys-to-the-kingdom part of good training.  I recently spent a couple of days working with a group of horses I have come to know well.  One of them is a retired performance horse.  Without going into a lot of details, I would describe him as an emotionally fragile horse.  He’s easily worried. If he thinks he has the right answer, he’s a superstar, but I always have to be careful how far I stretch him into new behaviors.  If he thinks he might get something wrong, he worries.  He’s come out of a training environment in which he had to perform correctly or his rider could get seriously hurt. I suspect he was corrected for mistakes which accounts for his worry.

Mastering Micro
This past weekend I was working among other things on this horse’s pose.  He’s very much got the idea that he gets reinforced for lifting up through his topline and releasing at the poll.  I was holding out for slightly better versions.  As I withheld my click, I saw him experimenting. Was it higher with his poll?  Was it more lift of his back? What did I want?

The shifts he was giving me represented micro changes.  The variations were all within a clickable range.  Clicking him for any of these variations would not have been wrong, but I was waiting fractionally to see what else would pop out.  I was using micro extinctions to create the next step.  And because I was thinking about this in terms of extinction, I was monitoring closely how this related to his emotional level. I did not want him to become macro worried.

We were always just a second or two away from a click so I could let him experiment within a micro extinction without risking the emotional fallout of a larger extinction process.

Micro is so very much the key.

Macro extinctions are painful.  Micro extinctions are part of good shaping.

Macro shaping can be frustrating.  Micro shaping is elegant.

Macro negative reinforcement is literally painful. Micro is again good shaping.

When you go micro, your learner is always just a second or two away from a reinforceable moment.  You can cue another behavior.  You can click and treat. Either way, you are saying: “Yes! Great idea!”  Micro mastery is what we should be striving for in our training.  When you say someone is a great trainer, you are saying he is a Micro Master.  In training that’s the “black belt” we should be aiming for.

(Note: this video was taken when Robin was three years old.  He was not yet started under saddle.  Also, he had never been in side reins or any of the other devices that are commonly used to lunge horses.  This beautiful self-carriage was shaped entirely through clicker training.  The dressage whips that I’m using serve as targets.  They give Robin orientation points that help him maintain his balance relative to me.)

This concludes the report on Dr. Jesús Rosales’ Ruis’ 2014 presentation on Resurgence and Regression given at the Five Go To Sea conference cruise.

For information on the 2015 Five Go To Sea Alaska cruise visit fivegotosea.com

Alexandra Kurland
theclickercenter.com
theclickercentercourse.com

Please note: If you are new to clicker training and you are looking for how-to instructions, you will find what you need at my web sites:

theclickercenter.com                    theclickercentercourse.com