Machine learning and unintended consequences
post by lukeprog · 2011-09-23T02:31:11.431Z · LW · GW · Legacy · 24 commentsContents
24 comments
In Artificial Intelligence as a Negative and Positive Factor in Global Risk, Yudkowsky uses the following parable to illustrate the danger of using case-based learning to produce the goal systems of advanced AIs:
Once upon a time, the US Army wanted to use neural networks to automatically detect camouflaged enemy tanks. The researchers trained a neural net on 50 photos of camouflaged tanks in trees, and 50 photos of trees without tanks. Using standard techniques for supervised learning, the researchers trained the neural network to a weighting that correctly loaded the training set - output "yes" for the 50 photos of camouflaged tanks, and output "no" for the 50 photos of forest. This did not ensure, or even imply, that new examples would be classified correctly. The neural network might have "learned" 100 special cases that would not generalize to any new problem. Wisely, the researchers had originally taken 200 photos, 100 photos of tanks and 100 photos of trees. They had used only 50 of each for the training set. The researchers ran the neural network on the remaining 100 photos, and without further training the neural network classified all remaining photos correctly. Success confirmed! The researchers handed the finished work to the Pentagon, which soon handed it back, complaining that in their own tests the neural network did no better than chance at discriminating photos.
It turned out that in the researchers' data set, photos of camouflaged tanks had been taken on cloudy days, while photos of plain forest had been taken on sunny days. The neural network had learned to distinguish cloudy days from sunny days, instead of distinguishing camouflaged tanks from empty forest.
I once stumbled across the source of this parable online, but now I can't find it.
Anyway, I'm curious: Are there any well-known examples of this kind of problem actually causing serious damage — say, when a narrow AI trained via machine learning was placed into a somewhat novel environment?
24 comments
Comments sorted by top scores.
comment by lessdazed · 2011-09-23T03:48:30.175Z · LW(p) · GW(p)
I sat on the floor with my dog and taught him to roll over in a few sessions. Each session was five minutes, and he had it by the end of each one.
The third session was a bit different from the first two because I sat in a chair. I began by making a whirling motion with my hand and saying "roll over". He quickly shuffled a few feet to the right, crashed hard into the wall, and fell over. He looked confusedly at the wall, and then at me - the wall for attacking him, and me for withholding his reward of food.
He had, after all, performed the trick that I had taught him - pointing his head at my crotch and shuffling to the right, not stopping until he flipped 360 degrees.
That I intended for him to be learning to roll over didn't matter - this is reinforcement learning.
comment by Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2016-03-20T02:41:58.650Z · LW(p) · GW(p)
Ed Fredkin has since sent me a personal email:
By the way, the story about the two pictures of a field, with and without army tanks in the picture, comes from me. I attended a meeting in Los Angeles, about half a century ago where someone gave a paper showing how a random net could be trained to detect the tanks in the picture. I was in the audience. At the end of the talk I stood up and made the comment that it was obvious that the picture with the tanks was made on a sunny day while the other picture (of the same field without the tanks) was made on a cloudy day. I suggested that the "neural net" had merely trained itself to recognize the difference between a bright picture and a dim picture.
comment by Daniel_Burfoot · 2011-09-23T03:22:27.417Z · LW(p) · GW(p)
My favorite example of this is the "racist" face tracking camera.
Replies from: lukeprog, None↑ comment by lukeprog · 2011-09-23T05:11:15.417Z · LW(p) · GW(p)
That's pretty funny, though it appears that the cause may not have been that, say, the engineers trained the face tracking software using mostly white people.
comment by pedanterrific · 2011-09-23T03:09:48.738Z · LW(p) · GW(p)
It's almost certainly not the actual source of the "parable", or if it is the story was greatly exaggerated in its retelling (admittedly not unlikely), but this may well be the original study (and is probably the most commonly-reused data set in the field) and this is a useful overview of the topic.
Does that help?
Replies from: jkaufman, lukeprog↑ comment by jefftk (jkaufman) · 2015-12-24T15:27:04.606Z · LW(p) · GW(p)
Except "November Fort Carson RSTA Data Collection Final Report" was released in 1994 covering data collection from 1993, but the parable was described in 1992 in the "What Artificial Experts Can and Cannot Do" paper.
comment by Dreaded_Anomaly · 2011-09-23T03:15:37.846Z · LW(p) · GW(p)
The earliest reference to the parable that I can find is in this paper from 1992. (Paywalled, so here's the relevant page.) I also found another paper which attributes the story to this book, but the limited Google preview does not show me a specific discussion of it in the book.
Replies from: jkaufman, jkaufman, jkaufman, timtyler, lukeprog↑ comment by jefftk (jkaufman) · 2015-12-24T15:22:10.256Z · LW(p) · GW(p)
Here's the full version of "What Artificial Experts Can and Cannot Do" (1992): http://www.jefftk.com/dreyfus92.pdf It has:
... consider the legend of one of connectionism's first applications. In the early days of the perceptron ...
↑ comment by jefftk (jkaufman) · 2015-12-25T15:34:26.278Z · LW(p) · GW(p)
Expanded my comments into a post: http://www.jefftk.com/p/detecting-tanks
↑ comment by jefftk (jkaufman) · 2015-12-24T15:10:50.785Z · LW(p) · GW(p)
There's also https://neil.fraser.name/writing/tank/ from 1998 which says the "story might be apocryphal", so by that point it sounds like it had been passed around a lot.
↑ comment by timtyler · 2011-10-24T01:53:40.077Z · LW(p) · GW(p)
In the "Building Neural Networks" book, the bottom of page 199 seems to be about "classifying military tanks in SAR imagery". It goes on to say it is only interested in "tank" / "non-tank" categories.
Replies from: jkaufman↑ comment by jefftk (jkaufman) · 2015-12-24T15:09:26.310Z · LW(p) · GW(p)
But it also doesn't look like it's a version of this story. That section of the book is just a straight ahead "how to distinguish tanks" bit.
comment by [deleted] · 2011-09-23T14:33:28.675Z · LW(p) · GW(p)
Every time you've missed an important email because of a spam filter false positive.
comment by [deleted] · 2011-09-23T02:38:20.940Z · LW(p) · GW(p)
This was also discussed in Magical Categories. I don't know the source of the parable, though.
comment by Alex Flint (alexflint) · 2011-10-03T10:55:32.251Z · LW(p) · GW(p)
Dataset bias is a huge topic in computer vision at the moment. An excellent (and humorous) overview is given by Efros and Torralba: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5995347&tag=1.
comment by HonoreDB · 2011-09-23T16:27:12.482Z · LW(p) · GW(p)
Pakistan claims that U.S. predator drones routinely bomb outdoor weddings, mistaking them for Al-Qaeda rallies. I couldn't immediately find an authoritative source, but if there's any truth to it this sort of phenomenon is probably to blame.
Replies from: JoshuaZ, satt↑ comment by satt · 2011-09-23T17:13:20.184Z · LW(p) · GW(p)
US predator drones don't fire automatically, yet.
comment by Miller · 2011-09-23T04:25:23.950Z · LW(p) · GW(p)
I'm reminded of one of your early naively breathless articles here on the value of mid-80s and prior expert systems.
Replies from: Douglas_Knight, Miller↑ comment by Douglas_Knight · 2011-09-23T13:49:50.820Z · LW(p) · GW(p)
Why don't you write a post on how it is naive? Do you actually know something about practical application of these methods?
Yes, if experts say that they use quantifiable data X, Y, and Z to predict outcomes, that simple algorithms beat them on only that data might not be important if the experts really use other data. But there is lots of evidence saying that experts are terrible at non-quantifiable data, such as thinking interviews are useful in hiring. Tetlock finds that ecologically valid use of these trivial models beats experts in politics.
↑ comment by Miller · 2011-09-23T06:16:44.990Z · LW(p) · GW(p)
this one:
http://lesswrong.com/lw/3gv/statistical_prediction_rules_outperform_expert/
When based on the same evidence, the predictions of SPRs are at least as reliable as, and are typically more reliable than, the predictions of human experts for problems of social prediction.
Hmm yes, 'same evidence'.