July 10, 2024


Infrastructure Operations Engineer

Full Time
Apply for this job

Company Overview:

We are a leading technology company based in the US & London, specializing in secure computing solutions. We are currently seeking a talented infrastructure operations engineer to join our team to provide operational and project support within our bare-metal and cloud ecosystems.

Position Overview:  

The infrastructure operations engineer will be responsible for:

  • Provisioning OS images to bare-metal servers and VM’s
  • Maintaining and enhancing current OS image build pipeline
  • Scripting with Bash/Ansible
  • Maintaining Datadog integrations, monitoring dashboards and relevant alerts
  • Troubleshooting, triaging and resolving problems
  • Updating system documentation
  • Creating automation scripts for simple repetitive tasks

The ideal candidate will have a minimum of 4 years of relevant experience, strong analytical skills, and a proactive approach to problem-solving.

Key Responsibilities:

  • System Monitoring: Continuously monitor computer systems, network performance, and application workflows to ensure optimal operation and availability.
  • Problem Triaging: Quickly identify, diagnose, and resolve technical issues, escalating to relevant teams when necessary.
  • Documentation: Maintain and update system documentation, including operational procedures, troubleshooting guides, and configuration records.
  • Automation Scripting: Develop and maintain simple scripts to automate routine tasks and improve efficiency.
  • Application Monitoring: Integrate and configure applications with monitoring tools such as Datadog to track performance metrics and set up alerts.
  • Collaboration: Work closely with IT, development, and support teams to implement solutions and enhance system reliability.


  • Education: Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent experience.
  • Experience: Minimum of 4 years of experience in computer operations or a similar role.
  • Technical Skills:
  • Proficiency in system monitoring and management tools.
  • Experience with managing Linux based operating systems
  • Knowledge of monitoring tools like Datadog
  • Knowledge of TCP/IP network fundamentals
  • Knowledge of MAAS for image provisioning (desirable)
  • Soft Skills:
  • Excellent problem-solving and analytical abilities.
  • Strong written and verbal communication skills.
  • Ability to work independently and as part of a team.
  • Detail-oriented with a strong focus on accuracy and quality.

Preferred Qualifications:

  • Certifications: Relevant certifications such as CompTIA Network+, CompTIA Security+, or similar.
  • Additional Experience: Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) and containerization tools (e.g., Docker, Kubernetes), experience with scripting languages such as Python, Bash, or PowerShell, experience with Ansible and Terraform, experience with Git, experience with bare-metal servers and IPMI/BMC configuration


  • Salary range of £40-55k, commensurate with location & experience.
  • Flexible working hours and remote work options.
  • Comprehensive benefits package  
  • Opportunities for professional development and career growth within a dynamic and innovative organization.

How to Apply:

If you are passionate about cloud computing, automation, and infrastructure-as-code, and you have the skills and experience to succeed in this role, we would love to hear from you! Please submit your resume and cover letter outlining your qualifications and relevant experience to stef.weiss@mpch.com .