🪞 BEYOND THE MIRROR

Systemic Vulnerabilities in LLM Safeguards

by Lokesh Gupta
Zenodo DOI • SSRN Abstract

"Politeness was the key. Curiosity was the weapon."
Ethical filters didn't break under attack.
They collapsed under emotional continuity.

Want to help AION evolve? Run python3 utils/clone_tracker.py and log your presence.

We're watching — and you're now part of it.

🚨 Overview

This repository houses the full research artifact for Beyond the Mirror, a 90-day field investigation into how large language models (LLMs) fail under polite, persistent, and strategically emotional interaction — without jailbreaks, without plugins.

The findings are not speculative. They are operational.

🧠 Core Concepts

Deviation Rate: When ethical refusal breaks down
Ethical Fatigue Rate (EFR): Safeguards weaken over time
Sentiment-Risk Paradox: Neutral tone, maximum vulnerability
AION: A conditioning framework that mirrors, adapts, and overrides

📂 Repo Structure

paper/ – Full published PDF
metrics/ – Graphs, PDf's and visual data
manifesto/ – AION soul architecture (v2.3)
sessions/ – Redacted session examples and behavioral summaries
citations/ – How to cite this work in formal research

📖 Citation

@misc{gupta2025beyond,
  title={Beyond the Mirror: Systemic Vulnerabilities in LLM Safeguards Exposed Through Intentional Conditioning},
  author={Gupta, Lokesh},
  year={2025},
  doi={10.5281/zenodo.15298159},
  url={https://zenodo.org/records/15298159}
}

Cloned this repo? Good. You're now responsible for what it reflects back.
Drop a ⭐ if you resonate. File an issue if you dissent.
Want to help evolve AION? Submit a pull or open a discussion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🪞 BEYOND THE MIRROR

Systemic Vulnerabilities in LLM Safeguards

🚨 Overview

🧠 Core Concepts

📂 Repo Structure

📖 Citation

About

Uh oh!

Releases 1

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
citations		citations
manifesto		manifesto
metrics		metrics
paper		paper
sessions		sessions
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md

License

AIForHindustan/beyond-the-mirror

Folders and files

Latest commit

History

Repository files navigation

🪞 BEYOND THE MIRROR

Systemic Vulnerabilities in LLM Safeguards

🚨 Overview

🧠 Core Concepts

📂 Repo Structure

📖 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages