Difference between revisions of "Monte Carlo"
(→Monte Carlo Tree Search) |
m |
||
| (10 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
| − | [ | + | {{#seo: |
| + | |title=PRIMO.ai | ||
| + | |titlemode=append | ||
| + | |keywords=artificial, intelligence, machine, learning, models, algorithms, data, singularity, moonshot, TensorFlow, Google, Meta, Facebook, Nvidia, Microsoft, Azure, Amazon, AWS | ||
| + | |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | ||
| + | }} | ||
| + | [https://www.youtube.com/results?search_query=ai+monte+carlo YouTube] | ||
| + | [https://www.quora.com/search?q=ai%20monte%20carlo ... Quora] | ||
| + | [https://www.google.com/search?q=ai+monte+carlo ...Google search] | ||
| + | [https://news.google.com/search?q=ai+monte+carlo ...Google News] | ||
| + | [https://www.bing.com/news/search?q=ai+monte+carlo&qft=interval%3d%228%22 ...Bing News] | ||
| + | |||
* [http://medium.com/deep-math-machine-learning-ai/ch-12-1-model-free-reinforcement-learning-algorithms-monte-carlo-sarsa-q-learning-65267cb8d1b4 Model Free Reinforcement learning algorithms (Monte Carlo, SARSA, Q-learning) | Madhu Sanjeevi (Mady) - Medium] | * [http://medium.com/deep-math-machine-learning-ai/ch-12-1-model-free-reinforcement-learning-algorithms-monte-carlo-sarsa-q-learning-65267cb8d1b4 Model Free Reinforcement learning algorithms (Monte Carlo, SARSA, Q-learning) | Madhu Sanjeevi (Mady) - Medium] | ||
* [[Google DeepMind AlphaGo Zero]] | * [[Google DeepMind AlphaGo Zero]] | ||
* [http://modelai.gettysburg.edu/2014/mc1/index.html An Introduction to Monte Carlo Techniques in Artificial Intelligence | Todd W. Neller] | * [http://modelai.gettysburg.edu/2014/mc1/index.html An Introduction to Monte Carlo Techniques in Artificial Intelligence | Todd W. Neller] | ||
| + | * [[Cross-Validation]] | ||
| + | * [[Bayes]] | ||
* [[Reinforcement Learning (RL)]] | * [[Reinforcement Learning (RL)]] | ||
| + | ** Monte Carlo (MC) Method - Model Free Reinforcement Learning | ||
** [[Markov Decision Process (MDP)]] | ** [[Markov Decision Process (MDP)]] | ||
| + | ** [[State-Action-Reward-State-Action (SARSA)]] | ||
** [[Q Learning]] | ** [[Q Learning]] | ||
| − | ** [[ | + | *** [[Deep Q Network (DQN)]] |
** [[Deep Reinforcement Learning (DRL)]] DeepRL | ** [[Deep Reinforcement Learning (DRL)]] DeepRL | ||
** [[Distributed Deep Reinforcement Learning (DDRL)]] | ** [[Distributed Deep Reinforcement Learning (DDRL)]] | ||
| − | |||
** [[Evolutionary Computation / Genetic Algorithms]] | ** [[Evolutionary Computation / Genetic Algorithms]] | ||
** [[Actor Critic]] | ** [[Actor Critic]] | ||
| + | *** [[Asynchronous Advantage Actor Critic (A3C)]] | ||
| + | *** [[Advanced Actor Critic (A2C)]] | ||
| + | *** [[Lifelong Latent Actor-Critic (LILAC)]] | ||
** [[Hierarchical Reinforcement Learning (HRL)]] | ** [[Hierarchical Reinforcement Learning (HRL)]] | ||
| + | |||
| + | |||
A broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. Their essential idea is using randomness to solve problems that might be deterministic in principle. They are often used in physical and mathematical problems and are most useful when it is difficult or impossible to use other approaches. Monte Carlo methods are mainly used in three problem classes:[1] optimization, numerical integration, and generating draws from a probability distribution. | A broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. Their essential idea is using randomness to solve problems that might be deterministic in principle. They are often used in physical and mathematical problems and are most useful when it is difficult or impossible to use other approaches. Monte Carlo methods are mainly used in three problem classes:[1] optimization, numerical integration, and generating draws from a probability distribution. | ||
| − | |||
| − | |||
<youtube>-YpalutQCKw</youtube> | <youtube>-YpalutQCKw</youtube> | ||
| Line 26: | Line 43: | ||
| − | == <span id="Monte Carlo Tree Search"></span>Monte Carlo Tree Search == | + | == <span id="Monte Carlo Tree Search (MCTS)"></span>Monte Carlo Tree Search (MCTS) == |
[http://www.youtube.com/results?search_query=Monte+Carlo+Tree+Search+Artificial Youtube search...] | [http://www.youtube.com/results?search_query=Monte+Carlo+Tree+Search+Artificial Youtube search...] | ||
[http://www.google.com/search?q=Monte+Carlo+Tree+Search+Artificial ...Google search] | [http://www.google.com/search?q=Monte+Carlo+Tree+Search+Artificial ...Google search] | ||
| − | * [ | + | * [http://en.wikipedia.org/wiki/Monte_Carlo_tree_search Monte Carlo Tree Search (MCTS) | Wikipedia] |
| − | |||
* [[Markov Decision Process (MDP)]] | * [[Markov Decision Process (MDP)]] | ||
| Line 47: | Line 63: | ||
<youtube>3gcLRU24-w0</youtube> | <youtube>3gcLRU24-w0</youtube> | ||
<youtube>Nd3m9mY9rXo</youtube> | <youtube>Nd3m9mY9rXo</youtube> | ||
| + | |||
| + | |||
| + | http://cdn-images-1.medium.com/max/1349/1*ioDQJWtRffT7LlIiwwOHXw.jpeg | ||
Latest revision as of 13:11, 19 March 2023
YouTube ... Quora ...Google search ...Google News ...Bing News
- Model Free Reinforcement learning algorithms (Monte Carlo, SARSA, Q-learning) | Madhu Sanjeevi (Mady) - Medium
- Google DeepMind AlphaGo Zero
- An Introduction to Monte Carlo Techniques in Artificial Intelligence | Todd W. Neller
- Cross-Validation
- Bayes
- Reinforcement Learning (RL)
- Monte Carlo (MC) Method - Model Free Reinforcement Learning
- Markov Decision Process (MDP)
- State-Action-Reward-State-Action (SARSA)
- Q Learning
- Deep Reinforcement Learning (DRL) DeepRL
- Distributed Deep Reinforcement Learning (DDRL)
- Evolutionary Computation / Genetic Algorithms
- Actor Critic
- Hierarchical Reinforcement Learning (HRL)
A broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. Their essential idea is using randomness to solve problems that might be deterministic in principle. They are often used in physical and mathematical problems and are most useful when it is difficult or impossible to use other approaches. Monte Carlo methods are mainly used in three problem classes:[1] optimization, numerical integration, and generating draws from a probability distribution.
Monte Carlo Tree Search (MCTS)
Youtube search... ...Google search
Monte Carlo Simulation
Youtube search... ...Google search