Portfolio 1
title: “NanoGPT Preference Alignment (DPO) for Arithmetic Tasks” excerpt: “Implemented Direct Preference Optimization on a NanoGPT pretrained model to solve algebraic equations.
” collection: portfolio —
title: “NanoGPT Preference Alignment (DPO) for Arithmetic Tasks” excerpt: “Implemented Direct Preference Optimization on a NanoGPT pretrained model to solve algebraic equations.
” collection: portfolio —
Short description of portfolio item number 2 
Published in ACM CoNEXT 2025 Workshop, 2025
We propose a reflective co-evolutionary framework that optimizes task descriptions and routing algorithms via network state feedback using LLMs.
Recommended citation: Guo, Shuhan and Yin, Nan and Liu, Shuzhi and Liu, Zhongheng and Huangfu, Wei and Yao, Quanming. (2025). ""NeRM-Net: Reflective Evolution of Routing Strategies for Dynamic Communication Networks."" ACM CoNEXT Workshop.
Download Paper
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.