I guess the point here is that +/-5 runs is mostly noise in UZR. I've read that, too, over there, so I'm not thinking Appelman is making something up just for this case. I think what this shows is that while maybe it is one of the best systems out there in the public domain, and an upgrade over officials scorers (e.g., "if he did not touch it, he can't get an error," "no one else would have reached it, but because he reached it and made a bad throw, he gets the error"), it should be treated as being bit fuzzier than how it is generally reported, especially over a week, month, or season.
I wish I could have the same comfort level as you do. But I don't. Here's what I'm getting:
Me: What's up with this New Coke ?
MBC: Why, doesn't it taste good ?
Me: Yeah, it's pretty good. Tastes a lot like Pepsi, actually. And I like Pepsi. But that's not the point. Why change the original formula ?
MBC: Because it's better.
Me: No it's not.
MBC: Says you !
Me: Well, yeah. I drink a lot of cola, and each has their own charm. Even Jolt and RC. But I can't figure out why New Coke is supposed to be better than Original Formula.
MBC: Because we say it is.
Me: Ummm...
MBC: Is there really that much difference ? Let me ask you this - Old Coke was good, and New Coke is good, right ?
Me: Well, yeah...
MBC: There you have it !
Me: Have what ?
MBC: There's no difference !
Me: Yes, there is.
MBC: Only in your opinion.
Me: No, it's measurable by your standards.
MBC: So what ? It still tastes like cola.