Portfolio

Portfolio 1

title: “NanoGPT Preference Alignment (DPO) for Arithmetic Tasks” excerpt: “Implemented Direct Preference Optimization on a NanoGPT pretrained model to solve algebraic equations.
” collection: portfolio —