Lead Software Engineer, Capital One
I am a Lead Software Engineer in Capital One with 12 years of experience in building Web Applications and leading multiple projects over the years. I have a Masters degree in Computer Science and very blessed to witness the evolution of Software over a period of time. I am also an avid Tech Blogger and write Tech Blogs on Medium.com and have experience in speaking various Tech Conferences
Today, we all live in Interconnected Systems where one Service has multiple dependencies on other Services. Any downtime on any of these downstream Services/endpoints impacts the Upstream Services causing severe Customer impact. What if we have an Intelligent mechanism to monitor and react without developer intervention? Can we predict the failures & react proactively? Solution: Intelligent & Predictive Failover is a solution which identifies errors & failures even before the actual Customer request. It intelligently Clusters the Services & applies Proactive Monitoring with failover predictability. I will be talking about Intelligent tracking of anomalies with deep health checks & latency based region detection for traffic failover & failback, their alarm config management, threshold based error handling & etc. The focus will be on how to predict failures, reduce customer impact & learn from pervious patterns for future failures.