Global Investment/Financial Services company is looking to hire an experienced Kubernetes Site Reliability Engineer as part of their Digital Assets Technical Operations Team. You will work with various engineering teams to own the design of a new multi-region, highly available, cloud-based deployment of our applications to AWS's Kubernetes Platform (EKS) Experience Several years of hands-on experience with AWS in a production environment Production experience running Kubernetes workloads on AWS using EKS Experience creating and deploying Helm charts & libraries Specialist in AWS CloudFormation, IAM, VPC and network security Experience with monitoring tools eg Cloudwatch, Datadog, Splunk Proficiency with Unix operating systems and Shell Scripting Programming experience, eg Python, preferred Experience with CDN Providers eg Akamai, preferred Experience with the agile software development life cycle preferred Automated CI/CD pipelines, eg JenkinsX (Kubernetes native), Jenkins Enterprise Skills Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog, Splunk, and Kibana See problems as opportunities to automate Ability to work independently with minimal direction Coordinate the overall design of highly available, secure, scalable microservices-based applications in AWS Track record of providing technical leadership to strong teams of Site Reliability Engineers Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders Ability to communicate at all levels with track record of strong written and verbal communications Full job spec available. To apply and find out more please reach out (see below)
07/05/2024
Project-based
Global Investment/Financial Services company is looking to hire an experienced Kubernetes Site Reliability Engineer as part of their Digital Assets Technical Operations Team. You will work with various engineering teams to own the design of a new multi-region, highly available, cloud-based deployment of our applications to AWS's Kubernetes Platform (EKS) Experience Several years of hands-on experience with AWS in a production environment Production experience running Kubernetes workloads on AWS using EKS Experience creating and deploying Helm charts & libraries Specialist in AWS CloudFormation, IAM, VPC and network security Experience with monitoring tools eg Cloudwatch, Datadog, Splunk Proficiency with Unix operating systems and Shell Scripting Programming experience, eg Python, preferred Experience with CDN Providers eg Akamai, preferred Experience with the agile software development life cycle preferred Automated CI/CD pipelines, eg JenkinsX (Kubernetes native), Jenkins Enterprise Skills Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog, Splunk, and Kibana See problems as opportunities to automate Ability to work independently with minimal direction Coordinate the overall design of highly available, secure, scalable microservices-based applications in AWS Track record of providing technical leadership to strong teams of Site Reliability Engineers Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders Ability to communicate at all levels with track record of strong written and verbal communications Full job spec available. To apply and find out more please reach out (see below)
Monitoring Engineer (SRE - Site Reliability) - Grafana/Dynatrace/Kubernetes - this is a long term contract opportunity to join a globally operating Zurich based company in the financial sector . Your tasks: Managing the global observability platform as a member of a product team Being responsible for the platform life cycle from design to production Cooperating with application teams to collect feedback and improve the product offering Creating an API-first platform to drive the adoption of monitoring-as-code Your experience/knowledge: Platform/Site Reliability Engineer with expertise in products like Prometheus, Grafana, Dynatrace, Open Telemetry Well-versed with modern observability platforms, collecting infrastructure and application metrics Proficiency in defining threshold, alerts and dashboard visualization of monitoring information Knowhow of open-source or commercial APM Familiarity with cloud technologies and platforms with focus on Kubernetes and Azure Language skills: English - fluent in written and spoken, German conversational Your soft skills: Team player with good communication and organizational abilities Excellent problem-solving skills and the ability to troubleshoot complex issues Location: Zurich, Switzerland Sector: Finance Start: 06/2024 Duration: 12MM+ Ref .Nr.: BH21615 Take the next step and send us your resume along with a daytime phone number where we can reach you. Due to Swiss work permit restrictions, we can only consider applications from Swiss nationals, EU citizens as well as current work-permit holders for Switzerland. Ukrainian refugees are warmly welcomed, we will support you all the way. We welcome applications from individuals of all genders, age groups, sexual orientations, personal expressions, ethnic backgrounds, and religious beliefs. Therefore, there is no requirement to provide gender information or a photo in your application. As per client requirements, we need information about your marital status, nationality, date of birth, and a valid Swiss work permit. For applicants with disabilities, we are happy to explore potential solutions with our end client.
07/05/2024
Project-based
Monitoring Engineer (SRE - Site Reliability) - Grafana/Dynatrace/Kubernetes - this is a long term contract opportunity to join a globally operating Zurich based company in the financial sector . Your tasks: Managing the global observability platform as a member of a product team Being responsible for the platform life cycle from design to production Cooperating with application teams to collect feedback and improve the product offering Creating an API-first platform to drive the adoption of monitoring-as-code Your experience/knowledge: Platform/Site Reliability Engineer with expertise in products like Prometheus, Grafana, Dynatrace, Open Telemetry Well-versed with modern observability platforms, collecting infrastructure and application metrics Proficiency in defining threshold, alerts and dashboard visualization of monitoring information Knowhow of open-source or commercial APM Familiarity with cloud technologies and platforms with focus on Kubernetes and Azure Language skills: English - fluent in written and spoken, German conversational Your soft skills: Team player with good communication and organizational abilities Excellent problem-solving skills and the ability to troubleshoot complex issues Location: Zurich, Switzerland Sector: Finance Start: 06/2024 Duration: 12MM+ Ref .Nr.: BH21615 Take the next step and send us your resume along with a daytime phone number where we can reach you. Due to Swiss work permit restrictions, we can only consider applications from Swiss nationals, EU citizens as well as current work-permit holders for Switzerland. Ukrainian refugees are warmly welcomed, we will support you all the way. We welcome applications from individuals of all genders, age groups, sexual orientations, personal expressions, ethnic backgrounds, and religious beliefs. Therefore, there is no requirement to provide gender information or a photo in your application. As per client requirements, we need information about your marital status, nationality, date of birth, and a valid Swiss work permit. For applicants with disabilities, we are happy to explore potential solutions with our end client.
Job title: Senior Golang, Kotlin Developer Location: Osterley, UK Duration: 6 months (Possible extension) Hybrid work option: Minimum 3 days/week from office What you'll do: Create and deliver large-scale software engineering tooling, for the Global OTT platforms. Contribute to the software and infrastructure design of our team's purpose-built platforms. Write resilient code that will be continuously tested, deployed and run at scale in the cloud, on-premise and across a wide range of streaming devices. Be part of a self-organising Agile team. Actively improve overall software quality whilst also helping fellow team members Contribute to the team's technical direction and the improvement of its tools and processes. Update and improve data monitoring and alerting solutions in the department while contributing to the security of our client applications. What you'll bring: Strong experience working across the stack in JavaScript, TypeScript, Node.js or similar technologies such as Kotlin and Go Someone who can help upskill the team on Kotlin and Go will be a plus. Good understanding of development best practices such as pair programming, TDD, continuous integration and continuous delivery Good understanding of/experience with CI tools (Jenkins, Concourse) and testing frameworks Good understanding and working experience with alerting and monitoring KPIs through creation of dashboards for the applications developed. Industry experience working with Proof-of-Concept projects as well as with AWS or similar cloud technologies, building, deploying, and managing virtual resources. Ability and enthusiasm to push for new improvements across the code base and influence/learn from a large community of developers. Experience working closely with DevOps and SREs Driven to work with new technologies and designing solutions with the team from the ground up using effective communication skills that encourage collaboration and teamwork.
03/05/2024
Project-based
Job title: Senior Golang, Kotlin Developer Location: Osterley, UK Duration: 6 months (Possible extension) Hybrid work option: Minimum 3 days/week from office What you'll do: Create and deliver large-scale software engineering tooling, for the Global OTT platforms. Contribute to the software and infrastructure design of our team's purpose-built platforms. Write resilient code that will be continuously tested, deployed and run at scale in the cloud, on-premise and across a wide range of streaming devices. Be part of a self-organising Agile team. Actively improve overall software quality whilst also helping fellow team members Contribute to the team's technical direction and the improvement of its tools and processes. Update and improve data monitoring and alerting solutions in the department while contributing to the security of our client applications. What you'll bring: Strong experience working across the stack in JavaScript, TypeScript, Node.js or similar technologies such as Kotlin and Go Someone who can help upskill the team on Kotlin and Go will be a plus. Good understanding of development best practices such as pair programming, TDD, continuous integration and continuous delivery Good understanding of/experience with CI tools (Jenkins, Concourse) and testing frameworks Good understanding and working experience with alerting and monitoring KPIs through creation of dashboards for the applications developed. Industry experience working with Proof-of-Concept projects as well as with AWS or similar cloud technologies, building, deploying, and managing virtual resources. Ability and enthusiasm to push for new improvements across the code base and influence/learn from a large community of developers. Experience working closely with DevOps and SREs Driven to work with new technologies and designing solutions with the team from the ground up using effective communication skills that encourage collaboration and teamwork.
Job Title: SRE - Ignition Certified Location: Fully Remote Salary/Rate: $500-$600 per day Start Date: 20/05/24 Duration: 12 months We have an amazing opportunity to work with a sector-leading consultancy that are based in the US! They are looking for an experienced Lead SRE to join them on an innovative project. Job Responsibilities/Objectives: Design, implement, and maintain Kubernetes clusters and Ignition-managed systems to ensure high availability, reliability, and scalability of infrastructure. Develop automation scripts and tools to streamline the deployment, configuration, and management of applications on Kubernetes clusters, leveraging Ignition for system provisioning. Set up monitoring and alerting systems to proactively identify and respond to issues affecting Kubernetes clusters and Ignition-managed systems, ensuring optimal performance and reliability. Perform capacity planning and optimization to ensure efficient resource utilization and cost-effective scaling of Kubernetes clusters and Ignition-managed systems. Troubleshoot and resolve incidents and outages affecting Kubernetes clusters and Ignition-managed systems, diagnosing root causes and implementing remediation measures. Implement security best practices and compliance standards for Kubernetes clusters and Ignition-managed systems, including access control, network security, and data encryption. Develop and maintain CI/CD pipelines for deploying and updating applications on Kubernetes clusters, integrating Ignition provisioning as part of the deployment process. Document infrastructure configurations, deployment procedures, and troubleshooting processes, and share knowledge with team members to promote collaboration and learning. Required Skills/Experience The ideal candidate will have the following: Ignition Certification is a MUST Certified Kubernetes Administrator (CKA) or similar certification. Experience with other cloud-native technologies and platforms (eg, Docker, AWS, Azure). Familiarity with infrastructure as code (IaC) tools (eg, Terraform) for automating infrastructure deployment and management. If you are interested in this opportunity, please apply now with your updated CV in Microsoft Word/PDF format. Disclaimer Notwithstanding any guidelines given to level of experience sought, we will consider candidates from outside this range if they can demonstrate the necessary competencies. Square One is acting as both an employment agency and an employment business, and is an equal opportunities recruitment business. Square One embraces diversity and will treat everyone equally. Please see our website for our full diversity statement.
01/05/2024
Project-based
Job Title: SRE - Ignition Certified Location: Fully Remote Salary/Rate: $500-$600 per day Start Date: 20/05/24 Duration: 12 months We have an amazing opportunity to work with a sector-leading consultancy that are based in the US! They are looking for an experienced Lead SRE to join them on an innovative project. Job Responsibilities/Objectives: Design, implement, and maintain Kubernetes clusters and Ignition-managed systems to ensure high availability, reliability, and scalability of infrastructure. Develop automation scripts and tools to streamline the deployment, configuration, and management of applications on Kubernetes clusters, leveraging Ignition for system provisioning. Set up monitoring and alerting systems to proactively identify and respond to issues affecting Kubernetes clusters and Ignition-managed systems, ensuring optimal performance and reliability. Perform capacity planning and optimization to ensure efficient resource utilization and cost-effective scaling of Kubernetes clusters and Ignition-managed systems. Troubleshoot and resolve incidents and outages affecting Kubernetes clusters and Ignition-managed systems, diagnosing root causes and implementing remediation measures. Implement security best practices and compliance standards for Kubernetes clusters and Ignition-managed systems, including access control, network security, and data encryption. Develop and maintain CI/CD pipelines for deploying and updating applications on Kubernetes clusters, integrating Ignition provisioning as part of the deployment process. Document infrastructure configurations, deployment procedures, and troubleshooting processes, and share knowledge with team members to promote collaboration and learning. Required Skills/Experience The ideal candidate will have the following: Ignition Certification is a MUST Certified Kubernetes Administrator (CKA) or similar certification. Experience with other cloud-native technologies and platforms (eg, Docker, AWS, Azure). Familiarity with infrastructure as code (IaC) tools (eg, Terraform) for automating infrastructure deployment and management. If you are interested in this opportunity, please apply now with your updated CV in Microsoft Word/PDF format. Disclaimer Notwithstanding any guidelines given to level of experience sought, we will consider candidates from outside this range if they can demonstrate the necessary competencies. Square One is acting as both an employment agency and an employment business, and is an equal opportunities recruitment business. Square One embraces diversity and will treat everyone equally. Please see our website for our full diversity statement.