Pinned Loading
-
BlicketTest_CausalReasoning
BlicketTest_CausalReasoning PublicBuilt a multi-turn RL training environment that can be used to benchmark and improve LLM causal reasoning based on the Blicket Detector Test from developmental psychology, motivated by published re…
Python
-
CausalReasoningEnv
CausalReasoningEnv PublicDeveloped an environment to benchmark and train an LLM's ability to reason and identify causal effects on Directed Acyclic Graphs that represent structural causal models
Python
-
Unsupervised-Elicitation
Unsupervised-Elicitation PublicForked from Jiaxin-Wen/Unsupervised-Elicitation
Implementation of Internal Coherence Maximization on TruthfulQA -- results suggest unsupervised LLM alignment may be possible
Python
-
Persona-Elicitation
Persona-Elicitation PublicImplementation of Internal Coherence Maximization for Persona Elicitation on the GlobalOpinionQA dataset
Python
-
Reasoning-Theater
Reasoning-Theater PublicForked from AskSid/disentangling-computation-from-cot
Python
-
ReinforcementLearning
ReinforcementLearning PublicThis repo contains scripts and notebooks in which I explore and implement common RL algorithms on agents operating in Gym envs
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.
