ucb1

Star

Here are 22 public repositories matching this topic...

alextanhongpin / go-bandit

Sponsor

Star

Multi-Armed Bandit (MAB) algorithm implementation in go

go ucb1 mulit-arm-bandit greedy-epsilon

Updated Nov 25, 2019
Go

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

viswanath57 / Bandit-Algorithms

Star

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

gokhanmeteerturk / adaptive-shots

Star

Few-shot prompting using Contextual Combinatorial Bandit optimizations

python reinforcement-learning ai contextual-bandits few-shot ucb1

Updated Dec 19, 2024
Python

Pegah-Ardehkhani / Reinforcement-Learning-Algorithms-from-Scratch

Star

Explore key RL algorithms with detailed explanations and fully commented Python code implementations

reinforcement-learning monte-carlo q-learning thompson-sampling epsilon-greedy reinforcement-learning-algorithms sarsa rl policy-iteration value-iteration deep-q-learning reinforcement-learning-agent ucb1 td-lambda reinforcement-learning-environments td-0 optimistic-inital-values iterative-policy-evaluation

Updated Dec 8, 2024
Jupyter Notebook

HoangTran0410 / Reversi-mcts

Star

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

board-game machine-learning csharp bitboard mcts monte-carlo-tree-search othello-game reversi-game ucb1 othello-ai mcts-algorithm

Updated May 31, 2021
C#

kochlisGit / Reinforcement-Learning-Algorithms

Star

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army

Updated Feb 15, 2022
Python

lisamandro / News-Recommendation-System-using-Reinforcement-Learning

Star

A Contextual-bandit approach on MIND Datasets for News Recommendation Systems.

python thompson-sampling epsilon-greedy reinforcement-learning-algorithms multi-armed-bandits ucb1 linucb news-recommendation-system

Updated Jul 7, 2025
Python

mykeels / multi-armed-bandit-problem

Star

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

thompson-sampling epsilon-greedy multi-armed-bandit ucb1

Updated Apr 25, 2019
JavaScript

leiluk1 / stat-techniques

Star

This repository is focused on my assignments solutions for the Statistical Techniques for Data Science course at Innopolis University.

statistics thompson-sampling multi-armed-bandit ucb1 dna-replication mrl98-quantile-algo

Updated May 18, 2023
Jupyter Notebook

Islamomar-1 / RetroMCTS

Star

Retrosynthesis planning via Monte Carlo Tree Search (MCTS) with UCB1 — PyTorch template-scoring MLP + RDKit reaction SMARTS + purchasability oracle. Inspired by Gao, Mercado & Coley (ICLR 2022).

python machine-learning deep-learning cheminformatics pytorch networkx computational-chemistry mcts drug-discovery rdkit monte-carlo-tree-search organic-chemistry ucb1 template-based retrosynthesis reaction-prediction iclr2022 synthesis-planning

Updated Jun 24, 2026

Digioref / OLA-Pricing

Star

Repository of Online Learning Applications project at Polytechnic of Milan. Dynamic pricing scenario with multiple products.

gaussian-processes polimi online-learning primal-dual online-learning-algorithms ucb1 politecnico-di-milano primal-dual-algorithms online-learning-applications

Updated Sep 19, 2025
Jupyter Notebook

VladMarianCimpeanu / OLA_project

Star

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

Updated Oct 30, 2022
Jupyter Notebook

sanxore / py-mcts

Star

Python implementation of Monte Carlo Tree Search

mcts uct monte-carlo-tree-search ucb1

Updated Jan 4, 2020
Python

OMerkel / Buff-and-Green

Star

Let's play Checkers / Draughts here

game board-game entertainment uct monte-carlo-tree-search abstract-game perfect-information 2-player-strategy-game ucb1

Updated Jun 10, 2026
JavaScript

Stepan-Makarenko / Multi-armed-bandit-research

Star

multi-armed-bandits ucb1 e-greedy

Updated Dec 17, 2023
Jupyter Notebook

Twice22 / Reinforcement-Learning

Star

My reports for the reinforcement learning class given at the ENS

reinforcement-learning policy-gradient reinforce policy-iteration value-iteration ucb1

Updated Jan 16, 2018
Jupyter Notebook

Nikita-Kudrin / funcorp-bandit

Star

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

kotlin spring-boot ucb1

Updated Jan 29, 2022
Kotlin

skunkworks-powerweave / multi-armed-bandit-strategies

Star

Pluggable multi-armed bandit allocators (Thompson sampling, epsilon-greedy, UCB1, random baseline) over a generic arm-statistics mapping, with an injectable RNG and an id-based registry.

python open-source ai thompson-sampling exploration epsilon-greedy ab-testing allocation multi-armed-bandit bandit skunkworks ucb1 powerweave

Updated Jun 10, 2026
Python

EmanuelAlogna / Data-Intelligence-Applications

Star

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

reinforcement-learning social-network pricing thompson-sampling reinforcement-learning-algorithms multi-armed-bandit ucb1 social-influence

Updated Feb 12, 2020
Python

Improve this page

Add a description, image, and links to the ucb1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb1 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ucb1

Here are 22 public repositories matching this topic...

alextanhongpin / go-bandit

akshaykhadse / reinforcement-learning

viswanath57 / Bandit-Algorithms

gokhanmeteerturk / adaptive-shots

Pegah-Ardehkhani / Reinforcement-Learning-Algorithms-from-Scratch

HoangTran0410 / Reversi-mcts

kochlisGit / Reinforcement-Learning-Algorithms

lisamandro / News-Recommendation-System-using-Reinforcement-Learning

mykeels / multi-armed-bandit-problem

leiluk1 / stat-techniques

Islamomar-1 / RetroMCTS

Digioref / OLA-Pricing

VladMarianCimpeanu / OLA_project

sanxore / py-mcts

OMerkel / Buff-and-Green

Stepan-Makarenko / Multi-armed-bandit-research

Twice22 / Reinforcement-Learning

Nikita-Kudrin / funcorp-bandit

skunkworks-powerweave / multi-armed-bandit-strategies

EmanuelAlogna / Data-Intelligence-Applications

Improve this page

Add this topic to your repo