AI PoC Projects — Rohan Patil

🧠

AI Meeting Notes Summarizer

A production-ready FastAPI + LangChain service that ingests meeting transcripts and automatically generates structured notes — summaries, key decisions, and action items with owners and deadlines.

Summarization Vector Store

Key Capabilities

Accepts .txt transcript uploads via REST API endpoint
Splits and embeds transcript chunks into a Chroma vector store using local HuggingFace embeddings (all-MiniLM-L6-v2)
Structured output via Pydantic parser — summary, decisions, and action items with owner & deadline fields
LLM-powered via google/gemma-3-12b-it:free through OpenRouter with automatic retry & backoff
Fully containerised with Docker Compose (API + Chroma services)

Tech Stack

Python FastAPI LangChain Chroma DB HuggingFace OpenRouter LLM Docker Pydantic

Real-World Use Cases

Corporate meeting automation CRM action item extraction Legal transcript processing HR interview notes

Architecture Flow

POST /upload-transcript → TextLoader → RecursiveCharacterTextSplitter → Chroma Vector Store → PromptTemplate → ChatOpenAI (OpenRouter) → PydanticOutputParser → MeetingOutput JSON

🔍

Enterprise Policy Q&A — RAG System

A locally-deployed Retrieval-Augmented Generation pipeline for querying enterprise policy PDFs. Ask natural language questions and receive grounded answers with source citations and LLM-as-judge evaluation metrics.

RAG Hybrid Retrieval

Key Capabilities

PDF ingestion via PyMuPDF — parsed, chunked (512 chars / 64 overlap), and embedded locally
Hybrid retrieval: dense vector search (Qdrant + BAAI/bge-small-en-v1.5) fused with BM25 keyword search via Reciprocal Rank Fusion (RRF)
Cross-encoder reranking with ms-marco-MiniLM-L-6-v2 for precision-ranked results
Optional LLM query rewriting to expand ambiguous questions before retrieval
LLM-as-judge evaluation: faithfulness, context precision, context recall, and answer relevance (scored 0–1)

Tech Stack

Python FastAPI Qdrant BM25 Cross-Encoder PyMuPDF HuggingFace OpenRouter LLM Docker

Real-World Use Cases

HR policy chatbot Legal document Q&A Compliance assistant Internal knowledge base

Architecture Flow

PDF Upload → PyMuPDF Parser → Text Chunker → Qdrant (dense) + BM25 (keyword) → RRF Fusion → Cross-Encoder Reranker → LLM Answer Generator → Answer + Citations

⚡

AI Lambda Functions

A collection of serverless AWS Lambda functions that expose AI capabilities — text generation, AI image creation, a conversational chatbot, and a RAG prototype — all powered by Amazon Bedrock foundation models.

Serverless AWS Bedrock

Functions Included

textGenerationFunction — Invokes Bedrock Titan Text Express to generate creative or business content from a prompt. Configurable temperature, topP, and token count.
moviePosterDesignFunction — Generates AI movie poster images via Bedrock Titan Image Generator, uploads to S3, and returns a pre-signed URL (30-min expiry).
chatbot — Streamlit + LangChain conversational chatbot backed by AWS Bedrock (Amazon Nova Pro / DeepSeek). Maintains conversation context via ConversationSummaryBufferMemory.
rag — Lightweight RAG prototype with document loading, text splitting, and retrieval-augmented Q&A using Bedrock models.

Tech Stack

Python AWS Lambda AWS Bedrock Titan Text Express Titan Image Generator Amazon S3 LangChain Streamlit boto3

Real-World Use Cases

Serverless content generation AI image creation APIs Embedded chatbots Event-driven AI pipelines

Architecture Flow

HTTP Event / Trigger → AWS Lambda Handler → Input Validation → Bedrock InvokeModel (boto3) → Response Parsing → S3 Upload (image) / JSON Response (text)

🤖

Node.js AI Agent — ToDo Manager

A terminal-based agentic AI application that accepts natural language instructions, maps intent to safe pre-defined database operations, and manages a PostgreSQL-backed todo list — without ever executing LLM-generated SQL.

AI Agent Tool Calling

Key Capabilities

Natural language CRUD — create, list, update, delete, and search todos via conversational input
Strict tool registry: LLM can only invoke one of 6 pre-defined tools (create_todo, list_todos, update_todo, delete_todo, mark_completed, search_todos)
All database operations use parameterised SQL — no raw LLM-generated queries ever reach the database
Runtime conversation memory maintains context across the session
OpenRouter integration with retry logic, structured logging, and configurable model selection

Tech Stack

Node.js OpenRouter LLM PostgreSQL Tool Calling Parameterised SQL UUID Keys Conversation Memory

Real-World Use Cases

AI-powered task managers Natural language DB interfaces Safe agentic CRUD systems Internal ops automation

Architecture Flow

User Natural Language Input → Agent Service → OpenRouter LLM (tool_call) → Tool Registry Validation → Repository Function → Parameterised SQL → PostgreSQL → Conversational Response

AI Proof of Concept
Projects

AI Systems Built

AI Engineering Areas Explored

Interested in Collaborating?

AI Proof of ConceptProjects

AI Systems Built

AI Engineering Areas Explored

Interested in Collaborating?

AI Proof of Concept
Projects