描述和要求
The Infrastructure and Platform Services team (IPS) is the backbone of EA's global gaming ecosystem, delivering the technology that powers EA's live games and player experiences. IPS provides a deep portfolio of digital platform gaming services and production infrastructure to EA's game studios. From scalable cloud solutions to AI-driven gameplay services, IPS ensures that EA's games are available anytime, anywhere for millions of players worldwide.
As a Site Reliability Engineer, you will architect and maintain the critical infrastructure that powers our products. You will work at the intersection of development and operations building automated solutions that allow our engineering teams to deploy faster and more reliably.
This hybrid remote/in-office role offers the opportunity to shape our cloud-first infrastructure strategy while collaborating with teams. You will be reporting to an Engineering Manager.
Responsibilities
You will work as a technical liaison with development teams to address issues and provide recommendations.
You will oversee and expand our cloud infrastructure, security, governance and budget reporting.
You will monitor automation systems and respond to support requests, outages and breakages.
You will evaluate new technologies and software products to determine the feasibility and desirability of incorporating their capabilities within the company products
Qualifications:
5+ years of experience automating infrastructure provisioning, Developer Operation, integration or delivery.
Experience guiding customers through complex migration projects, from discovery and technical assessment to successful execution and operational handover.
Experience with security practices like identity and access management, data protection and certificate management.
Experience in one or more of the following disciplines: software development, managing operating systems (Linux, Windows), network design and deployment, and security.
Professional development experience with Jenkins on Linux and Windows
Experience with scripting languages including Groovy and Python
Proficiency with containerization (eg. Docker, Kubernetes, and orchestration)
Knowledge of infrastructure automation tools (e.g., Helm, Terraform)
Experience with monitoring tools (e.g., CloudWatch, Grafana, Prometheus, and OpenTelemetry)
Experience with version control - Git workflows and branching strategies
System administration skills (e.g., Windows, MacOS, Linux)