Skip to content

tagadearpit/Monika-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

144 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🌸 Monika-AI: Vision-Powered Intelligent Companion

Node.js MongoDB Gemini AI License: MIT

Monika-AI is a high-performance, personality-driven interactive companion. Leveraging Gemini 1.5 Flash and advanced browser APIs, Monika doesn't just chatβ€”she sees, hears, and follows you across your desktop with a custom Picture-in-Picture interface.

πŸš€ Launch Live Demo


πŸ”₯ New & Advanced Features

  • πŸ‘οΈ Vision Engine: Monika can now see through your webcam. Show her objects, code, or your surroundings, and she will react in real-time.
  • πŸ–ΌοΈ Floating Window (PiP): Multitask with ease. Use the Document Picture-in-Picture API to pop Monika into a floating, always-on-top window while you code or game.
  • 🌌 Cyber-Sakura UI: A gorgeous "Glassmorphism" interface featuring:
    • 3D Mouse Parallax: The UI panels tilt and react to your cursor movement.
    • Dynamic Mood Engine: Backgrounds and glows shift colors based on Monika's emotions ([HAPPY], [LOVING], [THINKING]).
    • Scanline Effects: A sci-fi HUD overlay for the vision feed.
  • πŸŽ™οΈ Seamless Voice Loop: Zero-latency speech synthesis combined with hands-free SpeechRecognition.
  • πŸ’Ύ Persistent Memory: MongoDB integration ensures she remembers your name, projects (like your Pothole Portal or CandyRobot), and past conversations.

πŸ› οΈ Technical Overhaul

High-Performance Frontend

  • Document PiP API: Advanced window detachment for "Always-on-Top" functionality.
  • CSS3 Glassmorphism: Deep blurs, saturation filters, and cubic-bezier transitions.
  • Web Speech API: Native browser processing to keep server CPU usage low.

Optimized Backend

  • Gemini 1.5 Flash: Optimized for high-speed multimodal (Text + Image) reasoning.
  • Buffered Image Processing: Secure Base64 handling for vision frames.
  • Mongoose ODM: Structured conversation logging and user profiling.

πŸ“ Project Structure

Monika-AI/
β”œβ”€β”€ backend/
β”‚   β”œβ”€β”€ server.js        # Express, Gemini Vision Logic & MongoDB
β”‚   └── package.json     # Node dependencies
β”œβ”€β”€ public/
β”‚   β”œβ”€β”€ index.html       # Multimodal UI & PiP entry point
β”‚   β”œβ”€β”€ style.css        # 3D effects, Glassmorphism & Mood themes
β”‚   └── script.js        # Vision capture, PiP Logic & Voice Loop

πŸš€ Installation

  1. Clone & Install

    git clone [https://github.com/tagadearpit/Monika-AI.git](https://github.com/tagadearpit/Monika-AI.git)
    cd Monika-AI/backend
    npm install
  2. Environment Setup Create a .env file in /backend:

    PORT=3000
    GEMINI_API_KEY=your_key_here
    MONGO_URI=your_mongodb_atlas_uri
  3. Run

    npm start

    Recommended: Access via Google Chrome for full Vision and PiP support.


🎨 Mood & Interaction System

Monika's environment reacts to her internal state:

  • [HAPPY]: Soft violet-blue gradients.
  • [LOVING]: Pulsing pink "Heart-Glow" animations.
  • [THINKING]: Deep space-blue with increased scanline intensity.

🀝 Contributing

Have ideas for a more "glitchy" aesthetic or better vision prompts? PRs are welcome!

Developed with ❀️ by Arpit Tagade


### πŸ’‘ What changed in this version:
1.  **Updated Badges:** Switched to `for-the-badge` style for a more "premium" feel.
2.  **Vision & PiP Focus:** Put your two coolest technical features (Vision and Floating Window) at the very top.
3.  **Modernized Tech Stack:** Mentioned the specific APIs (Document PiP, Glassmorphism) that recruiters love to see.
4.  **Hardware Context:** Subtle references to her remembering your specific projects, which shows off her "Long-Term Memory."


About

πŸ’• Monika-AI - An intelligent, charming AI companion with a cute frontend and powerful backend for engaging conversations

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages