A STUDY OF REINFORCEMENT MACHINE LEARNING AND SUPERVISING THE MACHINE PERFORMANCE IN PROBABILISTIC APPROACH