Job Overview
The Principal Platform Engineer will be a technical leader in shaping our platform architecture, delivering large-scale change initiatives, and optimizing the operations of our products and services. This role is critical in transforming our infrastructure and operations by fostering a culture of automation to streamline processes and drive continuous improvement.
Key Responsibilities
* Lead process optimization and technology change initiatives
* Provide technical guidance in developing and maintaining platform solutions using Python and Go, ensuring security and performance
* Foster a culture of innovation, continuous improvement, and best practice engineering
* Optimize clustered infrastructure for scalability, resilience, and cost-effectiveness
* Design, implement, and manage CI/CD pipelines using Jenkins, GitHub Actions integrating security into the pipelines, and ensuring compliance and best practices
* Manage and optimize Kubernetes clusters for high availability and scalability
* Lead Infrastructure as Code (IaC) development using tools like Terraform
* Develop and maintain monitoring and logging solutions using Prometheus and Grafana
* Collaborate with cross-functional teams, mentor engineers, and promote continuous improvement
About You
We are seeking an established, highly experienced, and technically astute Principal Platform Engineer with demonstrated success in leading the delivery and operations of large-scale, enterprise-grade, stable, scalable, and secure platforms. The ideal candidate will bring well-established expertise in Platform Engineering tools and practices such as Python, Go, and Linux alongside advanced Linux engineering in large-scale environments, expertise in developing software automation using Python and/or Go, and expertise with opensource development practices.
Requirements
* Well-developed interpersonal skills, particularly the ability to influence positive outcomes across interdisciplinary areas and multiple levels of leadership
* Clear, concise, and effective communication skills, with the ability to appropriately demonstrate technical initiatives and outcomes to non-technical audiences
* Demonstrable business acumen and an understanding of potential customer and market impacts on decisioning
* Effective leadership skills with the ability to appropriately guide a highly technical team of world-class engineers
* Advanced Linux engineering in large-scale environments
* Expertise in developing software automation using Python and/or Go
* Expertise with opensource development practices
* Expertise with Ansible or Terraform for infrastructure as code, automation, and orchestration
* Experience with Kubernetes and large cluster operations
* Proven ability to design, implement, and manage secure CI/CD pipelines using GitHub actions, focusing on automating testing, deployment, and monitoring
* Proficiency with cloud native software components and architecture
* Proficiency in data processing and analysis, especially for monitoring system performance and handling large datasets with Prometheus and Grafana
* Proficient in networking and systems programming, including custom networking tools, load balancers, and API gateways
* Extensive experience with Cloud native software for optimising clustered infrastructure
* Competence in developing private cloud environments