Julian Baldwin

CS @ Northwestern

Exploring OthelloGPT | Julian Baldwin

Exploring OthelloGPT

This research, entitled “Exploring the Limits of OthelloGPT’s Emergent Representations”, was conducted over 10 weeks in Summer 2023 as part of UChicago’s XLab Research Fellowship.

I was supervised by David Reber and Victor Veitch, and sought to expand on earlier work by Li et al and Neel Nanda that used the toy problem of predicting legal moves in the board game Othello to study world representations in transformer models

[Code] [PDF] [Slides]