You are passionate about leveraging software engineering practices to address platform engineering challenges. We are currently undergoing one of Australia's largest digital transformations. Together we can envisage a new future for banking for millions of customers. CommBank is acknowledged as an industry leader in IT and operations with its cutting-edge platforms and processes, agile IT infrastructure, and innovation across everything from payments to internet banking and mobile apps. Our teams apply process excellence principles to ensure timely, error-free processing, which is a vital part of the value we offer our customers. Everyone places the customer at the heart of all we do, and our performance is measured against the Group's external customer satisfaction metrics. See Yourself in the Team Join the team responsible for creating, managing, and enhancing the reliability and availability of IT Services that power our entire business. The Retail Technology Platform Engineering crew are custodians of the operational effectiveness of numerous applications across all portfolios. They work tirelessly to ensure the reliability, stability and availability of applications, setting guardrails, leading innovation and driving continuous improvement practices that ensure our tools and platforms are fit for purpose, function optimally and at scale. Your Impact and Contribution As a Principal Platform Engineer, you play a critical leadership role in ensuring the reliability, availability, and performance of our applications that support online channels. You will collaborate closely with cross-functional teams to implement site reliability principles and practices, troubleshoot complex issues, and develop robust system engineering solutions. Ultimately, you will seek to enhance our infrastructure, ensuring maximum uptime and seamless functionality. To achieve this, your role may include, but not be limited to: Designing, implementing, and maintaining reliable systems and scalable infrastructure across Linux and Windows environments. Monitoring and observability practices, utilising tools like Grafana and Prometheus to monitor systems and build dashboards, alerts, and reporting across multi-OS environments. Building and maintaining scripts using programming languages such as Bash, Perl, Python, PowerShell, or Ruby to automate repetitive tasks, enhance systems, and increase operational efficiency for both Linux and Windows. Troubleshooting network-related issues using TCP/IP and improving overall system connectivity. Managing and deploying infrastructure in cloud environments, leveraging best practices for scaling and performance. Overseeing and optimising a large fleet of Windows systems, ensuring high availability, security, and seamless integration with Linux-based infrastructure. What You'll Bring Advanced knowledge across both Linux and Windows environments, with competence in managing a large Windows fleet, configuration, troubleshooting, and security. Proficiency in scripting languages like Bash, Perl, Python, PowerShell, or Ruby. Extensive experience with Grafana, Prometheus, and other monitoring tools. Strong understanding of TCP/IP, DNS, load balancing, and network troubleshooting. Hands-on experience with cloud platforms, including infrastructure automation and orchestration. Proven experience in a Site Reliability Engineering (SRE) or similar platform support role, ideally at a senior or principal level. Strong analytical and troubleshooting skills with a proactive approach to problem-solving. Experience in Incident Management and a solid understanding of disaster recovery and failover. Clear and effective communication and stakeholder collaboration skills. If this sounds like you, we'd love to hear from you! What's in it for you? Opportunity to work with cutting-edge technologies in a collaborative environment. Professional growth in a culture that values continuous learning and development. Competitive salary, benefits, and a supportive work-life balance. #J-18808-Ljbffr