Computers learning Tic-Tac-Toe Pt. 3: Optimisation
Performance and hyperparameter optimisation for the vanilla DQN algorithm for Tic-Tac-Toe
Performance and hyperparameter optimisation for the vanilla DQN algorithm for Tic-Tac-Toe
Learn how to make computers play Tic-Tac-Toe using deep Q-learning algorithm
Learn how to make computers play Tic-Tac-Toe using the minimax and Q-learning algorithms
One extension to rule them all. How to efficiently reuse accurate SVD extensions.
Learn about the magic of high-precision extensions using SVDs.