Apr 2026
In progress
Tetris agent trained with PPO (Stable-Baselines3) on a custom Gymnasium environment. Uses CNN with Rotary Position Embeddings (RoPE), bit-packed uint32 board representation, exhaustive placement search via BFS, and interactive Pygame visualization.
AI
Jupyter
NumPy
Pandas
Pygame
Python
PyTorch
Reinforcement Learning


.webp&w=1920&q=75)
