Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
-
Updated
Aug 24, 2022 - JavaScript
Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
Softmax function, normalize a vector of real numbers into a probability distribution
The system is designed to compare and utilize multiple state-of-the-art multilingual language models to produce more accurate and context-aware translations between Indonesian and Manado language.
Add a description, image, and links to the softmax topic page so that developers can more easily learn about it.
To associate your repository with the softmax topic, visit your repo's landing page and select "manage topics."