I was going through Sutton's book and, using sample-based learning for estimating the expectations, we have this formula
$$ \text{new estimate} = \text{old estimate} + \alpha(\text{target} - \text{old estimate}) $$
What I don't quite understand is why it's called the target, because since it's the sample, it’s not the actual target value, so why are we moving towards a wrong value?