Multi-Armed Bandit (MAB) algorithm implementation in go
-
Updated
Nov 25, 2019 - Go
Multi-Armed Bandit (MAB) algorithm implementation in go
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Few-shot prompting using Contextual Combinatorial Bandit optimizations
Explore key RL algorithms with detailed explanations and fully commented Python code implementations
Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.
This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.
A Contextual-bandit approach on MIND Datasets for News Recommendation Systems.
An implementation of solvers for the multi-armed-bandit-problem in JavaScript.
This repository is focused on my assignments solutions for the Statistical Techniques for Data Science course at Innopolis University.
Retrosynthesis planning via Monte Carlo Tree Search (MCTS) with UCB1 — PyTorch template-scoring MLP + RDKit reaction SMARTS + purchasability oracle. Inspired by Gao, Mercado & Coley (ICLR 2022).
Repository of Online Learning Applications project at Polytechnic of Milan. Dynamic pricing scenario with multiple products.
Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)
Python implementation of Monte Carlo Tree Search
Let's play Checkers / Draughts here
My reports for the reinforcement learning class given at the ENS
REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin
Pluggable multi-armed bandit allocators (Thompson sampling, epsilon-greedy, UCB1, random baseline) over a generic arm-statistics mapping, with an injectable RNG and an id-based registry.
Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan
Add a description, image, and links to the ucb1 topic page so that developers can more easily learn about it.
To associate your repository with the ucb1 topic, visit your repo's landing page and select "manage topics."