Job Summary:
We are looking for a Senior DevOps Engineer to support and improve AWS cloud infrastructure and
production environments. The role focuses on ECS-based container operations, infrastructure
automation, monitoring, incident response, and system reliability.
Responsibilities:
Cloud Infrastructure & Operations
Manage AWS infrastructure and ECS-based container services in production environments.
Ensure system reliability, scalability, availability, and performance.
Perform monitoring, troubleshooting, performance tuning, and cost optimization.
Support database backup, upgrade, and recovery operations.
Infrastructure as Code & Automation
Design and maintain infrastructure using Terraform.
Build reusable Terraform modules and automate infrastructure provisioning.
Improve operational efficiency through automation.
Monitoring & Incident Response
Build and maintain monitoring, logging, and alerting systems.
Manage observability tools including Prometheus, Grafana, OpenSearch, and CloudWatch.
Troubleshoot production issues using logs, metrics, and system data.
Participate in on-call rotation and handle production incidents.
Perform root cause analysis (RCA) and implement preventive improvements.
Requirements:
Experience with Terraform.
Experience operating ECS-based production workloads.
Experience with monitoring tools such as Prometheus, Grafana, and OpenSearch.
Linux systems and networking fundamentals.
Experience supporting production cloud environments.
Fluent in Chinese Mandarin.
Pay: From $100,000.00 per year
Benefits:
Experience:
Language:
Ability to Commute:
Work Location: In person
Sign in to browse authentic reviews, anonymous ratings and salary data before you apply.