Site Reliability Engineer

Site Reliability Engineer

Full-Time 74000 - 90000 £ / year (est.) No working from home possible
Ocient

At a Glance

  • Tasks: Support and enhance our cutting-edge data warehouse services for high availability and performance.
  • Company: Join Ocient, a leader in innovative data solutions with a remote-friendly culture.
  • Benefits: Competitive salary, flexible hours, and opportunities for professional growth.
  • Other info: Dynamic remote work environment with excellent career advancement opportunities.
  • Why this job: Tackle challenging problems and make a real impact in the tech world.
  • Qualifications: 3+ years in system administration and scripting skills required.

The predicted salary is between 74000 - 90000 £ per year.

Location: Remote (United Kingdom)

Hiring Manager: Service Delivery Engineering Manager

Estimated salary range: £74,000 to £90,000. The salary offered for this position will be based on a candidate’s experience and skill demonstrated during interviews and other evaluations.

Position Overview

Ocient is searching for an experienced Site Reliability Engineer with strong problem-solving skills and a passion for solving hard problems to help maintain and expand Ocient's "as a service" offering of its cutting-edge data warehouse.

Responsibilities

  • Support the design and operations of Ocient's hosted database and related services — including message queues and storage systems — ensuring high availability, performance, and efficiency.
  • Design and maintain monitoring, log centralization, and alerting for all services to facilitate observability and incident management.
  • Automate deployment and configuration of Linux-based servers, including the OS and the numerous applications that compose our hosted offerings.
  • Develop and maintain rigorous security practices to protect our applications and customer data.
  • Assist with automation of testing pipelines for the Ocient DB and monitoring of test infrastructure.

Ideal Qualifications

  • 3+ years of experience in system administration in production environments.
  • Scripting experience with Bash, Python, or other languages.
  • Experience with system and software monitoring and alerting tools, such as the ELK stack, Graylog, InfluxDB, Prometheus, Zabbix, Grafana, Dynatrace, or others.
  • Experience with configuration management software such as Ansible, Puppet, or Chef.
  • Experience with data archiving, backup and disaster recovery.
  • Continuous Integration / Continuous Deployment experience with Jenkins, Gitlab CI or others.
  • Experience with source control tools like Git.
  • Ability to work flexible hours and serve in on-call rotations.

An Exceptional Candidate Will Have

  • Knowledge of OWASP principles for application security.
  • Experience with server / system virtualization and containerization technologies e.g., ProxMox, KVM, VMware.
  • Experience with SQL and Database Administration.
  • Experience managing and operating cloud infrastructure (e.g., AWS, GCP, Azure).
  • Experience with SSAE18 SOC2 Compliance.
  • Experience with networking administration, including VPN, proxy, DNS, and firewall configuration.

Site Reliability Engineer employer: Ocient

Ocient is an exceptional employer that fosters a dynamic and inclusive work culture, offering remote opportunities across the United Kingdom. With a strong emphasis on employee growth, Ocient provides access to cutting-edge technology and encourages continuous learning through hands-on experience in system administration and cloud infrastructure. The company values innovation and collaboration, making it an ideal place for Site Reliability Engineers looking to tackle challenging problems while enjoying a supportive and flexible work environment.

Ocient

Contact Details:

Ocient Recruitment Team

We think you need these skills to ace Site Reliability Engineer

Problem-Solving Skills
Linux System Administration
Scripting (Bash, Python)
Monitoring and Alerting Tools (ELK stack, Prometheus, Grafana)
Configuration Management (Ansible, Puppet, Chef)
Data Archiving and Backup
Continuous Integration / Continuous Deployment (Jenkins, Gitlab CI)