Staff Platform Engineer ( M LOps Platforms ) You are highly experienced in building customer focussed solutions We are a team of big thinkers, who love to push boundaries and create new solution Together we will build tomorrow’s bank today, using world-leading technology and innovation Do work that matters : The role of Platform Engineer is to Design, Build, Run & Evolve tools, infrastructure, templates and capabilities that our data science community and other engineers use to deliver business value, and to write code that automates running our infrastructure and environments. Collaboratively work with customer facing product owners and platform engineers to design, build and run ‘platforms’ that they can use to deliver customer value at greater quality, velocity, and safety. Help to make our platforms loved by our engineers and data science community We are keen hear from ML platform engineers who are passionate about infrastructure as code, software eating the world, LLMs, GPUs and High Performance Compute We support our people with the flexibility to balance where work is done with at least half their time each month connecting in office. We also have many other flexible working options available including changing start and finish times, part-time arrangements and job share to name a few. Talk to us about how these arrangements might work in the role you’re interested in. We’re interested in hearing from people who: As a Staff Platform Engineer is expert at the Full Cycle model, where engineers are involved in Design, Build, Cha n ge, and Run Have a passion for designing, developing, deploying and running high quality modern machine learning platforms Contributes to a culture where quality, inclusiveness and excellence are championed Have a natural drive to educate, communicate and positively influence various stakeholder groups including high level executives. Roles and Responsibilities of a Staff Platform Engineer on the ML Ops Platform: Provide s trategic technical leadership and mentorship driving best practices for ML platform architecture, deployment and scaling Oversee the design and development of scalable and resilient ML infrastructure with a focus on performance and reliability and a rchitect core components, ensuring performance, reliability, and scalability Lead the development of tools and frameworks to streamline the ML lifecycle, from data ingestion to model deployment and m onitoring, understanding DevSecOps frameworks, interact with vendors and understand their product roadmap Create a standardised set of tooling for deploying and running applications and setting them up with best practices P articipate in cross-group activities to build a culture of one team, bar-raising both our engineering capability and our technology solutions to drive our strategy Collaborate with data scientists, engineers and s takeholders to define and implement technical requirements. Translate needs into technical solutions and e nsure the platform's reliability through robust monitoring, logging, and alerting systems Drive Implementation of CI/CD pipelines to streamline ML model deployment and updates ; Troubleshoot complex technical issues to minimize disruptions Develop and maintain comprehensive documentation, including architecture blueprints and best practices as well as c onduct workshops and training sessions to educate and align the team on platform usage and best practices Stay up to date with the latest development in the field of ML, MLOps, LLMs, GPUs and related concepts Tech Skills: We use a broad range of tools, languages, and frameworks. We don’t expect you to know them all but experience or exposure with some of these (or equivalents) will set you up for success in this team; AWS Services: In depth knowledge of AWS services such as EC2, ECS, S3, Lambda, Step function, RDS, DynamoDB, IAM, VPC, Route 53, Cloudwatch, EKS ML Services: Expertise in AWS ML services like SageMaker, AWS Glue, Amazon EMR. Familiarity with AWS Bedrock, Amazon Q services, NVIDIA GPUs and related framework s, LLM s. Model Lifecycle: Experience with the end-to-end ML lifecycle, including data preprocessing, feature engineering, model training, evaluation, and deployment Scripting: Proficient in automation and Scripting ( Bash, Python ). IaC Tools: Hands-on experience with infrastructure as code tools like AWS CloudFormation, Version Control: Proficiency with version control systems like Github, Github Actions Monitoring & Observability: Expertise in tools like Grafana, Prometheus Engineering Tooling: Artifactory, Synk, Docker Working with us Whether you're passionate about customer service, driven by data, or called by creativity, a career with CommBank is for you. Our people bring their diverse backgrounds and unique perspectives to build a respectful, inclusive, and flexible workplace with flexible work locations. Here, you’ll thrive. You’ll be supported when faced with challenges and empowered to tackle new opportunities. We’re hiring engineers from across all of Australia and have opened technology hubs in Melbourne and Perth. We really love working here, and we think you will too. If this sounds like the role for you then we would love to hear from you. Apply today If you're already part of the Commonwealth Bank Group (including Bankwest, x15ventures), you'll need to apply through Sidekick to submit a valid application. We’re keen to support you with the next step in your career. We're aware of some accessibility issues on this site, particularly for screen reader users. We want to make finding your dream job as easy as possible, so if you require additional support please contact HR Direct on 1800 989 696. Advertising End Date: 29/11/2024