ijamil1

Irfan Jamil ijamil1

Johns Hopkins University alumnus (MS/BS Applied Math, CS, Data Science) | Formerly Gap Inc.

Achievements

BlicketTest_CausalReasoning BlicketTest_CausalReasoning Public

Built a multi-turn RL training environment that can be used to benchmark and improve LLM causal reasoning based on the Blicket Detector Test from developmental psychology, motivated by published re…

Python
CausalReasoningEnv CausalReasoningEnv Public

Developed an environment to benchmark and train an LLM's ability to reason and identify causal effects on Directed Acyclic Graphs that represent structural causal models

Python
Unsupervised-Elicitation Unsupervised-Elicitation Public

Forked from Jiaxin-Wen/Unsupervised-Elicitation

Implementation of Internal Coherence Maximization on TruthfulQA -- results suggest unsupervised LLM alignment may be possible

Python
Persona-Elicitation Persona-Elicitation Public

Implementation of Internal Coherence Maximization for Persona Elicitation on the GlobalOpinionQA dataset

Python
Reasoning-Theater Reasoning-Theater Public

Forked from AskSid/disentangling-computation-from-cot

Python
ReinforcementLearning ReinforcementLearning Public

This repo contains scripts and notebooks in which I explore and implement common RL algorithms on agents operating in Gym envs

Jupyter Notebook