AI & Platform Engineering Manager
MichaelTohar
Starts with building a Foundation, ends with Innovation and Productivity. Leading cross-functional teams to build resilient, scalable systems — from infrastructure to AI-powered platforms.
About Me
Platform Engineering Manager with 9+ years of experience spanning infrastructure, DevOps, and AI-driven platform engineering in high-scale e-commerce environments. I progressed from intern to Assistant Manager, building and leading cross-functional teams while delivering 30+ AI/automation projects across departments.
I built a custom high-performance Rust-based platform framework, launched an internal AI platform that centralized all AI operations, and developed multi-agent LLM/RAG systems — all while maintaining 99.9% SLA and a project portfolio exceeding $10M/yr in value.
I operate at the intersection of engineering, infrastructure, and business — bridging technology with strategy. From cloud migrations and Kubernetes at scale to intelligent automation and AI copilots, I build systems that are resilient, scalable, and aligned with organizational goals, while empowering teams to innovate faster.

Skills & Expertise
Experience
Assistant Manager — AI Strategy & Platform Engineering
Aug 2025 — Present
- Led the team's pivot into AI-driven platform engineering, establishing the AI project lifecycle from ideation to production
- Delivered 30+ AI/automation projects across 4+ departments with 90–95% accuracy, spanning compliance, dispute resolution, document extraction, and marketing intelligence
- Built multi-agent LLM systems with $0 model cost via on-prem deployment, reducing AI operational costs by up to 91%
- Developed RAG-based assistants, SQL copilots, and unified intelligence dashboards serving multiple business functions
- Conducted AI workshops across all departments, driving productivity improvements and AI adoption organization-wide
- Own engineering budgeting and deliver initiatives targeting ≥5× ROI — total portfolio value exceeds +$10M/yr
Assistant Manager — AI Strategy & Platform Engineering
Aug 2025 — Present
- Led the team's pivot into AI-driven platform engineering, establishing the AI project lifecycle from ideation to production
- Delivered 30+ AI/automation projects across 4+ departments with 90–95% accuracy, spanning compliance, dispute resolution, document extraction, and marketing intelligence
- Built multi-agent LLM systems with $0 model cost via on-prem deployment, reducing AI operational costs by up to 91%
- Developed RAG-based assistants, SQL copilots, and unified intelligence dashboards serving multiple business functions
- Conducted AI workshops across all departments, driving productivity improvements and AI adoption organization-wide
- Own engineering budgeting and deliver initiatives targeting ≥5× ROI — total portfolio value exceeds +$10M/yr
Assistant Manager — Platform Engineering
Feb 2025 — Jul 2025
- Led and mentored a 6-person cross-functional engineering team, fostering ownership, delivery, and technical excellence
- Built a custom high-performance Rust HTTP/worker framework (io_uring, async runtime) with <200ms API response time, serving as backbone for all new platform services
- Launched internal AI platform with RBAC, agent marketplace, and self-service AI tools — OKR target of 200+ DAU
- Built centralized LLM budget and routing gateway, stress-tested at 1,000 concurrent requests
- Led workflow orchestration migration from on-prem Airflow to GKE (cloud-managed Kubernetes)
- Completed legacy data access migration — retired JDBC access across 34 data services
Assistant Manager — Platform Engineering
Feb 2025 — Jul 2025
- Led and mentored a 6-person cross-functional engineering team, fostering ownership, delivery, and technical excellence
- Built a custom high-performance Rust HTTP/worker framework (io_uring, async runtime) with <200ms API response time, serving as backbone for all new platform services
- Launched internal AI platform with RBAC, agent marketplace, and self-service AI tools — OKR target of 200+ DAU
- Built centralized LLM budget and routing gateway, stress-tested at 1,000 concurrent requests
- Led workflow orchestration migration from on-prem Airflow to GKE (cloud-managed Kubernetes)
- Completed legacy data access migration — retired JDBC access across 34 data services
Assistant Manager — Data Infrastructure
Jan 2023 — Feb 2025
- Made strategic decisions regarding infrastructure changes impacting 50+ production systems
- Managed Infra Team to maintain Server SLA at 99.9%
- Established a platform solving internal problems through automation and PaaS, reducing ticket resolution time by 80%
- Led project collaboration between Local and Regional teams across 2+ countries
- Conducted Architecture Design & Review for 10+ projects
- Bridged the gap between Business and Technology — translating business needs into technical solutions
Assistant Manager — Data Infrastructure
Jan 2023 — Feb 2025
- Made strategic decisions regarding infrastructure changes impacting 50+ production systems
- Managed Infra Team to maintain Server SLA at 99.9%
- Established a platform solving internal problems through automation and PaaS, reducing ticket resolution time by 80%
- Led project collaboration between Local and Regional teams across 2+ countries
- Conducted Architecture Design & Review for 10+ projects
- Bridged the gap between Business and Technology — translating business needs into technical solutions
Team Lead & DevOps Engineer — Data Infrastructure
Oct 2021 — Feb 2023
- Managed team of 4 engineers, created Roadmap & OKR alignment, authored BRD & PRD documents
- Built infrastructure from scratch (server installation, network & DevOps tooling) supporting 50+ production workloads
- Maintained on-premises Kubernetes Cluster running 250+ pods with SLA above 99.9%
- Optimized cloud platform costs, achieving 50% reduction in monthly spend
- Deployed Platform as a Service (PaaS) for internal BI use cases, serving 100+ users
Team Lead & DevOps Engineer — Data Infrastructure
Oct 2021 — Feb 2023
- Managed team of 4 engineers, created Roadmap & OKR alignment, authored BRD & PRD documents
- Built infrastructure from scratch (server installation, network & DevOps tooling) supporting 50+ production workloads
- Maintained on-premises Kubernetes Cluster running 250+ pods with SLA above 99.9%
- Optimized cloud platform costs, achieving 50% reduction in monthly spend
- Deployed Platform as a Service (PaaS) for internal BI use cases, serving 100+ users
Senior Associate — DevOps Engineer, Data Platform
Apr 2021 — Oct 2021
- Built infrastructure using Virtualization Layer to manage local server resources across 5 nodes
- Migrated monolith architecture to microservices, improving server utilization by 85%
- Implemented GitLab as internal engineering code repository for 20+ developers
- Managed Hadoop Cluster & Presto servers for large-scale data processing
- Built alerting system to monitor servers, reducing undetected incidents by 90%
- Maintained job scheduler on Apache Airflow, orchestrating 100+ DAGs
Senior Associate — DevOps Engineer, Data Platform
Apr 2021 — Oct 2021
- Built infrastructure using Virtualization Layer to manage local server resources across 5 nodes
- Migrated monolith architecture to microservices, improving server utilization by 85%
- Implemented GitLab as internal engineering code repository for 20+ developers
- Managed Hadoop Cluster & Presto servers for large-scale data processing
- Built alerting system to monitor servers, reducing undetected incidents by 90%
- Maintained job scheduler on Apache Airflow, orchestrating 100+ DAGs
Senior DevOps Engineer → DevOps Design Engineer (Team Lead)
Sep 2019 — Apr 2021
- Set vision & managed Jakarta Team of 10+ engineers (sync-ups, 1-on-1s, coaching & mentoring); promoted to Team Lead in Mar 2021
- Managed Kubernetes Cluster on GKE with 1,000+ pods across 250+ services in production and non-production
- Served as principal for the company's Future Program for Infrastructure — defined curriculum for 15+ participants
- Managed GCP infrastructure using Terraform, created Ansible roles, and maintained Jenkins CI/CD pipelines
- Managed logging system with Splunk & Elastic Stack; created GoLang and PowerShell tooling for cloud resource management
- Part of 24/7 on-call team, served as infrastructure review representative from DevOps side
Senior DevOps Engineer → DevOps Design Engineer (Team Lead)
Sep 2019 — Apr 2021
- Set vision & managed Jakarta Team of 10+ engineers (sync-ups, 1-on-1s, coaching & mentoring); promoted to Team Lead in Mar 2021
- Managed Kubernetes Cluster on GKE with 1,000+ pods across 250+ services in production and non-production
- Served as principal for the company's Future Program for Infrastructure — defined curriculum for 15+ participants
- Managed GCP infrastructure using Terraform, created Ansible roles, and maintained Jenkins CI/CD pipelines
- Managed logging system with Splunk & Elastic Stack; created GoLang and PowerShell tooling for cloud resource management
- Part of 24/7 on-call team, served as infrastructure review representative from DevOps side
Associate DevOps Engineer
Apr 2018 — Aug 2019
- Served as Infrastructure Lead Release Manager — coordinated deployments across Scrum of Scrum meetings, PMs, Developers & QA teams
- Led migration of 100+ services from Data Center to Cloud (GCP) as part of Infrastructure as Code team
- Mentored participants in the company's Future Program for Infrastructure across multiple batches
- Standardized provisioning across 3+ environments using Terraform
Associate DevOps Engineer
Apr 2018 — Aug 2019
- Served as Infrastructure Lead Release Manager — coordinated deployments across Scrum of Scrum meetings, PMs, Developers & QA teams
- Led migration of 100+ services from Data Center to Cloud (GCP) as part of Infrastructure as Code team
- Mentored participants in the company's Future Program for Infrastructure across multiple batches
- Standardized provisioning across 3+ environments using Terraform
System Engineer — Intern
Mar 2017 — Mar 2018
- Set up & managed automation processes with Chef Automation in non-production & production environments
- Maintained internal software systems and provisioned servers for internal web properties
- Assisted developers with production deployments, monitoring & managing infrastructure usage
System Engineer — Intern
Mar 2017 — Mar 2018
- Set up & managed automation processes with Chef Automation in non-production & production environments
- Maintained internal software systems and provisioned servers for internal web properties
- Assisted developers with production deployments, monitoring & managing infrastructure usage
Education
BINUS University
Bachelor of Computer Science, Information Technology
2014 — 2018 — Major in Networking & Operating System — GPA 3.34
Notable Projects
Move to Cloud (GCP Project)
Blibli.com — Mar 2019 — Sep 2019
Handled Infrastructure as Code (Terraform) and migrated microservices to Google Kubernetes Engine (GKE) from on-premises to Cloud with fully automated CI/CD.
Performance Tuning — Blibli Friends Blog
Blibli.com — May 2018 — Jul 2018
Managed infrastructure for the Blibli Friends blog platform, optimizing web server (Nginx) and support tooling for production performance.