Hi, my name is

Narayan Kundgir

|

About Me

Narayan Kundgir

Cloud Infrastructure Engineer with 12 years of experience in cloud architecture, platform engineering, and infrastructure automation. Bilingual in English and Japanese (JLPT N2). Built and managed all AWS cloud infrastructure independently at GMO GlobalSign Japan for 3+ years — 10 production services end-to-end, covering architecture, cost optimization, disaster recovery, security, and production operations.

Beyond infrastructure, I build internal platforms and developer tooling that enable engineering teams to ship faster — self-service environments, automated cost controls, and deployment orchestration. Now growing the infrastructure team and establishing scalable practices.

12+ Years Experience
10 Production Services
6 Certifications
7 Technical Articles

Experience

Cloud Infrastructure Engineer

GMO GlobalSign Japan May 2022 — Present · Tokyo, Japan

Built and managed all cloud infrastructure independently for 3+ years — 10 services (Ruby/Go/Java, PostgreSQL) across staging, pre-production, and production. Now growing the team.

Infrastructure & Architecture

  • AWS: EC2, ECS (Fargate), Aurora PostgreSQL, ElastiCache (Redis), OpenSearch, MSK (Kafka), Amazon MQ, WAF, Route53, Lambda, EventBridge
  • Monitoring: CloudWatch, New Relic, Pandora FMS
  • All infrastructure as code in Terraform (migrated 0.12→1.3), deployed via Bitbucket Pipelines
  • Evaluated EKS for StatefulSet and persistent storage requirements

Cost Optimization — 61% non-prod, 30% production savings

  • Designed per-developer staging with modular Terraform + Lambda cost optimizer (Python/EventBridge) with developer self-service controls — 61% reduction (168→65 hrs/week)
  • Implemented Reserved Instances and Savings Plans (RDS, EC2, Fargate) — 30% production cost reduction

Reliability & DR

  • Aurora Multi-AZ failover — GMO Awards 2025 for 3-minute AWS outage recovery
  • Designing cross-region standby (Tokyo→Osaka): DMS replication, AMI sync, DNS failover

Security

  • Layered defense: AWS WAF at ALB (managed + custom rules) with Trend Micro IPS on host instances

Operations & Maintenance

  • Aurora engine upgrades, ECS task definitions, OS patching, Docker image updates, SSL/RI renewals, Terraform state management
  • Monthly cost analysis with Trusted Advisor; IAM policy audits; DNS management (Route53, Cloudflare); alert threshold tuning; yearly DR exercises

Automation

  • Release Orchestrator: YAML-driven automated deployment scheduling across service repos
  • ECS rolling deployments for zero-downtime releases

Senior Software Engineer (Lead SDET)

Fast Retailing Aug 2018 — May 2022 · Tokyo, Japan

Led test automation and CI/CD pipeline engineering, progressively building infrastructure automation capabilities.

  • Designed and owned CI/CD pipelines in Jenkins (Groovy), integrating automated testing into deployment workflows
  • Built test automation frameworks (RestAssured, TestNG, Cucumber, Selenium, Gatling) adopted across multiple teams
  • Trained engineering teams on test development, improving velocity and code quality
  • Adopted Docker, Kubernetes, AWS, Terraform, and Ansible — transitioning from quality engineering into DevOps

Software Consultant (SDET)

Rakuten Aug 2017 — Jul 2018 · Tokyo, Japan

Automated testing for large-scale e-commerce web services, with exposure to containerized and cloud-native monitoring environments.

  • Built automated test suites for web services using REST-Assured, Gatling, Postman, TestNG, and Cucumber
  • Developed CI/CD automation workflows in Jenkins, enabling faster release cycles
  • Worked with Docker, Kubernetes, and cloud monitoring tools (Datadog, New Relic) in an Agile delivery environment
  • Authored test plans, policies, and executive reports for stakeholder visibility

Software Engineer — Performance Engineering

NTT DATA Services Jul 2014 — Aug 2017 · Pune, India

Systems-level performance engineering — profiling, tuning, and optimizing infrastructure, databases, and application servers.

  • Designed and executed end-to-end performance testing strategies for web applications using JMeter, establishing NFR baselines and workload models
  • Tuned application and database infrastructure: Apache, Tomcat, Oracle, MySQL, PostgreSQL, DB2 — resolving bottlenecks across the full stack
  • Monitored and profiled applications using Dynatrace APM, jconsole, jprofiler, and jvisualVM for deep performance analysis
  • Diagnosed production latency issues through network trace analysis using browser developer tools and packet capture
  • Delivered performance reports and executive summaries translating technical findings into stakeholder decisions

Certifications

Certified Kubernetes Administrator

CKA

Certified Kubernetes Application Developer

CKAD

Terraform Associate

Terraform Associate

Docker Certified Associate

Docker Certified

DevOps Basics

DevOps Basics

Ansible

Ansible

BCP-DR

BCP-DR

Bitbucket Pipelines

Bitbucket Pipelines

AWS Solutions Architect Associate

AWS SAA-C03

JLPT N2

JLPT N2

JLPT N3

JLPT N3

JLPT N4

JLPT N4

Claude Code in Action

Claude Code in Action

Skills

Cloud (AWS)

EC2 ECS EKS Fargate Aurora ElastiCache OpenSearch MSK Lambda EventBridge WAF Route53 S3 DMS CloudWatch

IaC & CI/CD

Terraform Ansible Bitbucket Pipelines GitHub Actions Jenkins

Containers

Docker Kubernetes ECS EKS

Monitoring

CloudWatch New Relic Prometheus Grafana Loki Alertmanager

Security

AWS WAF Trend Micro IAM

Programming

Python Shell Scripting Groovy

Databases

Aurora PostgreSQL Redis

Publications

Languages

English Full Professional
Japanese JLPT N2
Hindi Professional
Marathi Native