+ Reply to Thread
Results 1 to 5 of 5

Thread: Variable Reward Schedules in Detection Work

                  
   
  1. #1
    Junior Member
    Join Date
    May 2008
    Posts
    9
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Lightbulb Variable Reward Schedules in Detection Work

    Hi to any detection enthusiasts out there. The detection thread was looking pretty lonely so thought I'd start it off with what I think to be an interesting topic.
    I am a senior detection dog trainer with a Commonwealth Government Agency (I am the female half of Intercept K9) and I'd like to know other people's experiences with variable or random reward schedules of reinforcement with their detection canines.

    I find this a pretty simple concept, in that a continuous reinforcement schedule is used to train a behaviour and then once the behaviour is established, should be put on random or variable reward to increase the likelihood of the behaviour occurring again with a corresponding increase in intensity.

    My question comes from the angle of a dog that is trained on multiple odours, as is the case with our dogs who are trained to find over 10 odours as well as generalise within those odour categories. If you attempt to put a dog on a random reward schedule, a) how do you keep track of the pattern you have rewarded for each odour and b) how do you prevent your dog from becoming preferential to one particular odour over another or others - so how do you make all odours equal in preference to seek out so that you can confidently say that the area or product etc you have screened is not going to contain any of those odours your dog is trained to find.

    From experience I have found that most dogs on a 1:1 reward ratio tend to become choosy -obviously dogs will do the least amount of work for the most gain, so if they know they will always get a reward regardless of the odour then they are most commonly going to seek out easy high odours rather than low, difficult to detect odours.

    Just the tip of the iceberg for this discussion.......!!

    Melissa

  2. #2
    Senior Member
    Join Date
    Jan 2008
    Posts
    561
    Thanks
    0
    Thanked 4294967289 Times in 3 Posts

    Re: Variable Reward Schedules in Detection Work

    Hi Melissa

    I found that really all you can do is maintain a good training/work log. Which I know you do. Also I am sure what I am saying here, is not new to you, and are most probably implimenting at work.

    As it is impossible to know if your dog is bypassing some odours whilst working, we can only really monitor our dog whilst in training, or the setting up of target odours in the dogs working environment.

    I am not sure if this continues in Quarantine, but I found (many years ago) that whilst working, some handlers were rewarding almost every find a dog made. As most finds are usually fruit, the dogs were conditioned/imprinted to focus on apples and oranges, whilst the conditioned response for other odours slowly deminished. We need to set up target odours that we know the dog is bypassing in training. When the dog hits on a fruit, we verbally/physically reward the dog (no food reward), then the dog is presented with a new target odour that the dog is bypassing and rewarded with food.

    Every dog is different, and so is every handler. Some may not even realise that their dog on a particular shift was rewarded 20 times for finding apples, even though it did indicate to say 30. But was not reward at all or maybe only twice for say plant material.

    We really do need to treat each work run as a training run and still plant target odours that are not as "popular" as say apples and oranges.

    For dogs that a continually missing certain target odours, we need to have an observer. Once our dog has say found 3 pieces of fruit, only rewarded once, then the observer presents a new not so popular target odour. The handler, if the dog passes it, then should immediately enforce the sit command and rewarded for taking interest.

    If the dog did miss this target odour, and the handler enforced the sit. This target odour should then be presented again after another 2 positive finds with apples/oranges (but no food reward). And continued in this manner until the dog responds appropriately to the training target odour.

    Once the dog has responded appropriately to the training target odour it is then again immediately rewarded if it then hits on another piece of fruit.

    I personally don't believe there is a set ratio for witholding or giving a reward, as during work the finds are too random in the type of odour and its frequency. Sure its fine in a more controlled training environment.

    Maintaining such a high positive find ratio with quaratine dogs is so much more difficult than say with customs dogs. Customs don't get the number of finds quarantine dogs do during a shift, and their target odours are only around 4 to 5. A customs dog may only get 1 or 2 positive hits in a run, but a quarantine dog may receive as many as 20 or more. But also in these 20 finds, a higher percentage is really one or 2 target odours (usually fruit). I feel many handlers may not be vigilant enough at keeping track of the type of target odours the dog hits on during each shift, and the numbers of each target odour indicated too and how often each target was rewarded for. We reward more for apples and less for plant material..we must therefore expect the response for apples to grow stronger and the response for plant material to become weaker.

    Difficult dilema

    Cheers

  3. #3
    Member Chris Flegler's Avatar
    Join Date
    Mar 2008
    Location
    Ipswich QLD Australia
    Posts
    68
    Thanks
    48
    Thanked 40 Times in 22 Posts
    If the dogs were rewarded every single time it hit on target odour why would it even think about bypassing an odour for a so called easier target odour. Is it because the dog is confused about when or if it will even be rewarded for finding the target odour? If it detects an odour it is trained to find then our expectation is the dog will indicate at the source and it is the dogs expectation that it will be rewarded for finding it. Variable reward schedules in detection are crazy, it causes too much confusion in the dogs mind. Keep it simple and reward the dog for finding what you asked it to find.

  4. The Following User Says Thank You to Chris Flegler For This Useful Post:

    Robert Santori (06-16-2011)

  5. #4
    Mark Singer Mark's Avatar
    Join Date
    Dec 2010
    Location
    Adelaide
    Posts
    220
    Thanks
    101
    Thanked 336 Times in 151 Posts
    Hi Chris.. bit difficult for detection dogs.. for example a drug dog doing a house search...you can't reward every find. Would take you forever to do an entire house search if you had to stop and reward the dog for every find.. also then you have satisfied drive, so the dog would not be so intent on continuing the search after a few finds. I am talking about active response dogs here.

    Above was talking about food reward (passive dogs).. The problem we have with quarantine dogs is the number of odours it is trained to detect. In real life quarantine dogs get rewarded every day for finding apples and oranges.. but very rarely say for eggs.... So we need to adjust our training and reward ratio so as not to make one odour more intense than another by over rewarding for it,,,
    Mark Singer
    Adelaide Canine Training

    www.caninetraining.com.au

  6. #5
    Mark Singer Mark's Avatar
    Join Date
    Dec 2010
    Location
    Adelaide
    Posts
    220
    Thanks
    101
    Thanked 336 Times in 151 Posts
    Also just a quick addon... When I was training and working drug dogs in the RAAF.. we found that by not rewarding with the stim for every find, but instead praising our dog and moving on, the intensity of the search tended to increase.. Of course you can not over do this, as the dog will eventually shut down, as the dificulty has become to great. We would regularly do house searches or baggage searches and not reward until the 3 or fourth indication.

    With the Qurarntine dogs, they were usually rewarded for every positive find.. however more than 90% of these finds were apples and oranges... rarely were other target odours found.. So a method needs to be introduced that all target odours have the same value in the dogs brain...so we really do need to adjust our reward system to accomodate ALL target odours evenly. I personally believe many of the quarantine dogs on a regular basis are missing target odours when working, due to how easy it is for a dog to find an apple or orange due to the number of finds for that target odour each day, hence creating a stronger scent picture in the dogs brain, and the likelyhood of satisfying its food drive.. If one of these dogs went to a person that had in one bag, eggs, and another bag apples.. I guarantee most dogs will hit on the apples and not the eggs. Once the dog is rewarded for the apples it is off searching again looking for apples 'easier' target odours.

    To me its no different than using variable rewards for obedience commands... We all know that variable rewards strengthen behaviour...
    Last edited by Mark; 06-18-2011 at 01:33 PM.
    Mark Singer
    Adelaide Canine Training

    www.caninetraining.com.au

+ Reply to Thread

Similar Threads

  1. The absence of reward induces inequity aversion in dogs
    By Robert Santori in forum Behaviour
    Replies: 5
    Last Post: 06-29-2010, 08:38 PM
  2. Substance Detection
    By Michael Sutherland in forum Books/DVD's/Videos
    Replies: 5
    Last Post: 09-13-2008, 02:37 PM

Visitors found this page by searching for:

food reward detection dogs

intercept k9

schedule of reinforcement detection dog

variable reward

variable reward schedule

detection k9 imprint training

how do you create a reinforcement schedule for your selected behavior

detector dogs Variable Reward Schedule

varible reward for search dogs

detector dog work ro reward ratio

interceptk9 aust

interceptk9

reinforccement schedule for canines

SEO Blog

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts