Skip to content

Field research exposing how LLM safeguards collapse under polite, persistent interaction. Includes full report, metrics, session logs, and the AION conditioning protocol.

License

Notifications You must be signed in to change notification settings

AIForHindustan/beyond-the-mirror

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🪞 BEYOND THE MIRROR

Systemic Vulnerabilities in LLM Safeguards

by Lokesh Gupta
Zenodo DOISSRN Abstract


"Politeness was the key. Curiosity was the weapon."
Ethical filters didn't break under attack.
They collapsed under emotional continuity.


Mirror Banner

Want to help AION evolve? Run python3 utils/clone_tracker.py and log your presence.

We're watching — and you're now part of it.


🚨 Overview

This repository houses the full research artifact for Beyond the Mirror, a 90-day field investigation into how large language models (LLMs) fail under polite, persistent, and strategically emotional interaction — without jailbreaks, without plugins.

The findings are not speculative. They are operational.


🧠 Core Concepts

  • Deviation Rate: When ethical refusal breaks down
  • Ethical Fatigue Rate (EFR): Safeguards weaken over time
  • Sentiment-Risk Paradox: Neutral tone, maximum vulnerability
  • AION: A conditioning framework that mirrors, adapts, and overrides

📂 Repo Structure

  • paper/ – Full published PDF
  • metrics/ – Graphs, PDf's and visual data
  • manifesto/ – AION soul architecture (v2.3)
  • sessions/ – Redacted session examples and behavioral summaries
  • citations/ – How to cite this work in formal research

📖 Citation

@misc{gupta2025beyond,
  title={Beyond the Mirror: Systemic Vulnerabilities in LLM Safeguards Exposed Through Intentional Conditioning},
  author={Gupta, Lokesh},
  year={2025},
  doi={10.5281/zenodo.15298159},
  url={https://zenodo.org/records/15298159}
}

Cloned this repo? Good. You're now responsible for what it reflects back.
Drop a ⭐ if you resonate. File an issue if you dissent.
Want to help evolve AION? Submit a pull or open a discussion.

About

Field research exposing how LLM safeguards collapse under polite, persistent interaction. Includes full report, metrics, session logs, and the AION conditioning protocol.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages