Case study · 2024 — Present
Datasprings — Data Automation & API Platform
Django + Neo4j employment platform with scraping pipelines, RAG-based search, and cloud-native deployment.
- Role
- Software Engineer — Backend / Product Owner
- Company
- NEWPRODATA
- Year
- 2024 — Present
- Status
- Shipped
01 — Problem
The employment data space is fragmented — dozens of sources, inconsistent formats, stale records, and no trustworthy verification layer. Recruiters and platform users were blocked on slow, manual reconciliation.
02 — Approach
Designed a Django monolith that orchestrates scraping, normalization, verification, and graph-backed search. Built Selenium + BeautifulSoup pipelines feeding a cleaning/validation layer before writing to PostgreSQL for relational data and Neo4j / AWS Neptune for relationship queries. Wrapped everything in a REST API with JWT + SSO. Added a Retrieval-Augmented Generation layer over Bedrock so users get semantic answers instead of keyword hits. Deployed on EC2 behind Nginx + uWSGI with S3 for blobs, Lambda for async work, CloudWatch for observability.
03 — Outcome
Shipped a platform that replaced manual workflows and is now the team's source of truth. Owned every on-call incident solo — kept uptime stable while shipping new features in parallel.
-
◆
Zero-downtime deploys across 12+ production releases
-
◆
Replaced ~70% of manual verification work with automation
-
◆
Sub-second semantic search over the graph
-
◆
End-to-end ownership of 3+ parallel systems
04 — Tech stack
More work
2024 — Present
AI-Driven Workflow Engine
RAG + LLM orchestration layer that turns repetitive ops work into automated, reviewable decisions.
Read the case2024 — 2025
Timepaq — Attendance & Employee Management
Web-based attendance and HR system with biometric-device integration. Led design, delivery, and team coordination.
Read the case