|
Research
I am interested in sequential decision making using RL. More specifically, I am interested in exploration and sample-efficient RL fine-tuning. I am also interested in building adaptive, feedback-driven models with an eye on applications such as in education and personalized learning.
(*) denotes corresponding author
-
Poly-EPO: Training Exploratory Reasoning Models.
Ifdita Hasan Orney*, Jubayer Ibn Hamid*, Shreya Ramanujam, Shirley Wu, Hengyuan Hu, Noah Goodman, Dorsa Sadigh, Chelsea Finn.
In Submission
(Paper)
-
Polychromic Objectives for Reinforcement Learning.
Jubayer Ibn Hamid*, Ifdita Hasan Orney*, Ellen Xu, Chelsea Finn, Dorsa Sadigh.
International Conference on Learning Representations (ICLR), 2026.
(Paper)
-
Can LLM-Simulated Practice and Feedback Upskill Human Counselors? A Randomized Study with 90+ Novice Counselors.
Ryan Louie*, Ifdita Hasan Orney, Juan Pablo Pacheco, Raj Sanjay Shah, Emma Brunskill, Diyi Yang.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2026.
(Website) (Paper)
(Honorable Mention Award)
|