Blog

Thoughts on mathematics, cryptography, decentralized systems, machine learning, and whatever I'm exploring at the moment.

Master Thesis: Reinforcement Learning Applications

My master thesis exploring advanced reinforcement learning techniques and their applications. Featuring implementations of Soft Actor-Critic (SAC) algorithm and comprehensive analysis of modern RL methods in continuous control tasks.

Consensus algorithm for data labeling

Introducing a bayesian approach to determine the label of a data example based on the labels provided by multiple noisy annotators. In progress