Loading...
Loading...
An Intelligent AI Developer Workspace A VS Code-style browser IDE powered by Pydantic-AI and the Model Context Protocol (MCP). It functions as an autonomous coding partner capable of managing file systems, executing git operations, and generating code in real-time.
âš¡ Tech Stack & Architecture:
|
|
Enterprise MLOps & Computer Vision Pipeline A comprehensive MLOps-IoT platform designed for automated video surveillance. It automates the training, deployment, and maintenance of computer vision models to track occupancy, demographics (gender, age), and workplace safety in real-time.
âš¡ Tech Stack & Architecture:
|
|
|
|
|
|
|
|
|
Text-to-Podcast Automation Engine A full-stack application that converts plain text into complete podcast episodes. Utilizes Google TTS with SSML support to generate natural, human-like audio, automating podcast production end-to-end.
âš¡ Tech Stack & Architecture:
Intelligent Document-to-Audio System An intelligent full-stack solution that processes PDFs and ePUB formats to autonomously structure chapters and generate high-quality audiobooks using Google TTS — surpassing traditional audiobook features offered by platforms like ElevenLabs.
âš¡ Tech Stack & Architecture:
Healthcare Fraud Detection System A smart healthcare document analysis system capable of interpreting diverse medical documents including handwritten prescriptions and discharge summaries. It extracts key KPIs, detects fraud, verifies document legitimacy, and generates contextual follow-up questions.
âš¡ Tech Stack & Architecture:
AI-Powered Image and Video Generation Studio A "Photoshop Agent" and Video Generation studio that unifies multiple generative models into a single creative workflow. It handles complex media operations like video trimming and composition directly in the browser.
âš¡ Tech Stack & Architecture:
|
|
|
|
|
|
Real-time 3D AI Interaction An Intelligent live voice agent using Gemini-Live-Voice model. The Agent is integrated with function calling and tool use to take action on behalf of the user, visualized with a reactive 3D avatar.
âš¡ Tech Stack & Architecture:
| Video | Screenshot |
|---|---|
| AI Voice bot |
AI Sprite Sheet & GIF Generator A fun App that generates sprite sheet images using Gemini Nano/Pro models and converts them to animated GIFs client-side.
âš¡ Tech Stack & Architecture:
Trend-to-Content Intelligence A sophisticated API-driven application designed to bridge the gap between raw trend data and actionable podcast content. It intelligently combines insights from global Google Trends and internal podcast analytics.
âš¡ Tech Stack & Architecture:
Social Media Campaign Tracker A comprehensive social media analytics tool that scrapes user comments from platforms like Instagram, YouTube, and Facebook to analyze sentiment trends and track influencer campaign effectiveness.
âš¡ Tech Stack & Architecture:
Contribution Graph
Activity Timeline
Commits and contributions grouped by day, week, or month.
Pushed to main at gsantoshkumar1999/gsantoshkumar1999
January 29th, 2026 5:32 PM
Starred yogirk/tgcp
January 15th, 2026 3:16 AM