Writing
Blog
Technical deep-dives, production challenges, and architecture patterns from real AWS deployments.
How I Designed a Disaster Recovery Architecture That Achieves Sub-40-Minute RPO & RTO in Production
Disaster recovery is a business problem before it's a technical one. The right strategy starts with a single question: what can the business actually afford to lose? With a tolerance of up to one hour of data loss and downtime, a Pilot Light architecture on AWS proved to be the ideal fit — keeping non-compute infrastructure live in a secondary region at all times, while provisioning compute only at failover. Layered data replication, parallel CI/CD pipelines, and fully automated CloudFormation scripts bring the total recovery time to well under 40 minutes — validated through quarterly DR drills. The key insight: over-engineered DR is a hidden cost, and under-tested DR is a hidden risk.
April 12, 2026
Read →Key Skills for Aspiring Cloud Architects in 2025
January 10, 2025
Read →Mastering AWS Cost Optimization: Strategies for Efficiency - Part 01
February 12, 2024
Read →