Address: 619 Đỗ Xuân Hợp, Phước Long B, District 9, HCM City
Objective
With 3 years of experience as a DevOps/SRE and 2 years as a Fullstack Developer, I am adept at collaborating with feature teams and maintaining a comprehensive view of system architecture. Currently seeking a role as a Site Reliability Engineer/DevOps Engineer, where I can focus on enhancing system design and establishing processes to ensure continuous operation and system improvement. For me, designing a system is just the first step; tracking and enhancing the system is where the true value lies.
Honors & Awards
2023 • Achieve the "Rising Star" (Top 5) Award of the year
Certifications
2023
• AWS Certified DevOps Engineer – Professional
• Datadog Certified: Log Management Fundament
• Certified Kubernetes Application Developer
2022
• AWS Certified Developer – Associate
• Datadog Certified: Datadog Fundamentals
Experience
Datump Consulting Company (Onsite for Tyme Group)
07/2021 - Current
Cloud DevOps & Site Reliability Engineer
Designing/Deciding/Optimizing/Maintaining/Improving the Self Managed Kafka/MSK system (achieving 99.99% High Availability SLO and 100% data reliability)
Designing/Maintaining the central backup for AWS organization (ensuring full compliance with banking backup standards for South Africa and the Philippines)
Designing/Deciding/Optimizing/Maintaining Platform Products including S3 virus scanning tools, start/stop stack, DevOps tools, Kafka tools, Internal CA, and self-hosted N8n
Implementing the Landing Zone in multi-tenant account (cross-region, cross accounts)
Deploy on-demand AWS resources: ECS, EC2, RDS, CloudFront, S3, Lambda, DynamoDB and API Gateway
Setup network cross account: DNS, Route53, Private Link/VPC Peering/Transit GW via Pipeline using CloudFormation and Terraform
Design and execute DR Automation for the system
Built platform product using Datadog enabling 100% services to be monitored without effort from service teams
Automated Datadog deployment mechanism with 95% services following common standards
Designed/implemented streaming mechanism for Logs/Trace/Event/Metrics on private network
Investigated/Optimized Datadog costs achieving more than 70% reduction in streaming log costs
Incident on-call support and training (Kafka 101, Datadog 101)
Clik Company
08/2020 - 07/2021
Java Fullstack Developer
Developed and maintained a furniture production application and website using Spring Boot, React Native, and React JS