LLM Reasoning & Alignment

Overview

Large language models can follow instructions, but struggle with long-horizon reasoning, preference consistency, and personalization. My research develops methods to align LLMs with individual user preferences through reinforcement fine-tuning.

Current Projects

LLM Preference Following and Personalization

Status: Ongoing research at UC Irvine

→ Key insight: Dynamic preference memory enables personalization without full model retraining.

What we’re building:

Personality-prediction method that infers user persona from dialogue history
Reinforcement fine-tuning (RFT) for long-context preference following
Dynamic preference memory module with per-turn updates and conflict resolution

Why it matters: Current LLMs treat all users the same. Personalization requires understanding individual preferences, resolving conflicts, and adapting over time — all without expensive retraining.

Research Questions I’m Exploring

Preference stability — How do we maintain consistent preferences across long conversations?
Preference conflicts — When user preferences contradict, how should the model arbitrate?
Efficient personalization — Can we personalize with parameter-efficient methods (LoRA, adapters)?

Constitutional AI: Harmlessness from AI Feedback — Anthropic’s approach to preference learning
Direct Preference Optimization — Simplifying RLHF
Self-Rewarding Language Models — LLMs as their own reward models

← Back to home

Overview

Current Projects

LLM Preference Following and Personalization

Research Questions I’m Exploring

Related Reading