Senior DevOps Engineer
Applications are now closed.
We are looking for an Intermediate DevOps Engineer to support production and corporate infrastructure for a video game studio. The role focuses on designing, implementing, automating, securing, monitoring, and optimizing cloud-native infrastructure on Google Cloud Platform (GCP). You will work closely with development, security, and operations teams to ensure high availability, security, compliance, cost efficiency, and operational excellence.
Key Responsibilities:
Cloud Infrastructure (GCP)
Design, implement, and maintain GCP infrastructure across multiple environments
Manage:
- VPCs, subnets, firewall rules, VPC peering
- Application & Network Load Balancers
- Cloud Armor (WAF, DDoS protection)
- Cloud NAT, private networking
- Cloud CDN and Google Cloud Storage (GCS)
- Support site-to-site VPN connectivity
- Provide network administration and troubleshooting
- Support next-generation firewalls in GCP
- Occasionally support Azure or AWS environments if required
Containerization & Kubernetes
Administer and support Google Kubernetes Engine (GKE) and/or on-prem Kubernetes clusters
Deploy and manage:
- NGINX web servers
- NGINX Ingress Controller
- Perform cluster upgrades, scaling, tuning, and troubleshooting
- Apply Linux and Kubernetes hardening standards
- Support Kubernetes across GKE / EKS / AKS when required
CI/CD, GitOps & Automation
- Design and maintain CI/CD pipelines using Docker and Kubernetes
- Implement GitOps workflows using:
- FluxCD
- ArgoCD
- Automate infrastructure and application deployments
- Improve deployment reliability, rollback strategies, and release governance
Observability, Logging & Monitoring
Implement and maintain monitoring with:
- Prometheus
- Grafana
- Alertmanager
- Node Exporter
Design alerting strategies for infrastructure and applications
Implement centralized logging using:
Fluentd / Fluent Bit
OpenSearch & OpenSearch Dashboards
Manage index lifecycle policies (ILM) and log retention
Continuously improve observability maturity
Security & Compliance
- Implement HashiCorp Vault for secrets management
- Configure mTLS for secure service communication
- Apply Linux, Kubernetes, and network hardening practices
- Support security vulnerability remediation and patching
- Work in compliance-driven environments (regulatory standards apply)
Infrastructure as Code & Configuration Management
- Develop and maintain Terraform code for infrastructure provisioning
- Create and manage Ansible playbooks and roles
- Ensure modular, reusable, version-controlled IaC practices
Databases & Data Platforms
Support:
- MongoDB (replica sets, sharded clusters)
- Redis (replication)
- Perform backups, restores, upgrades, and performance tuning
- Implement monitoring and alerting for data platforms
Disaster Recovery & Reliability
- Design and execute Disaster Recovery (DR) testing
- Conduct DR drills and failover testing
- Ensure backup strategies meet RPO / RTO requirements
- Prepare DR documentation and reports
Performance & Cost Optimization
Conduct performance and load testing using:
- Locust
- Apache JMeter
- Analyze results and recommend optimizations
- Perform monthly GCP cost analysis
- Identify cost-saving opportunities across infrastructure and workloads
Operational Support & Documentation
- Provide L2/L3 operational support for DevOps platforms
- Support internal tools (e.g., Squid proxy, Postfix relay)
- Maintain documentation:
- Architecture diagrams
- Runbooks
- SOPs and operational guides (Confluence)
Required Skills & Experience:
- Hands-on experience as a DevOps Engineer
- Strong experience with Google Cloud Platform (GCP) or equivalent
- Solid understanding of:
- Linux internals
- Networking
- Security best practices
- Experience with:
- Kubernetes
- Docker
- CI/CD pipelines
- Infrastructure as Code (Terraform, Ansible)
- Good problem-solving skills and attention to detail
- Strong communication skills and ability to work in cross-functional teams
Nice to Have:
- Experience with Azure or AWS
- Background in regulated or compliance-driven environments
- Experience with open-source tooling and platforms