The plot shows that the training error, that is the error evaluated on the points belonging to the training set, keeps on decreasing as the number of iterations of the training algorithm increases. But the generalisation error, that is the error computed on a number of solutions that don't belong to the training set (generalisation set) start increasing after a certain number of iterations.
The generalisation set can be used to perform the so called cross validation process. The network keeps lowering the training error, the accuracy is very good, but when the error computed on the generalisation set start raising, we have to stop our training procedure (avoiding over-training).