Site Reliability Engineer

Job Board

Companies

Ocient

Site Reliability Engineer

Full-Time 74000 - 90000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Support and enhance our cutting-edge data warehouse services for high availability and performance.
Company: Join Ocient, a leader in innovative data solutions with a remote-friendly culture.
Benefits: Competitive salary, flexible hours, and opportunities for professional growth.
Other info: Dynamic remote work environment with excellent career advancement opportunities.
Why this job: Tackle challenging problems and make a real impact in the tech world.
Qualifications: 3+ years in system administration and scripting skills required.

The predicted salary is between 74000 - 90000 £ per year.

Location: Remote (United Kingdom)

Hiring Manager: Service Delivery Engineering Manager

Estimated salary range: £74,000 to £90,000. The salary offered for this position will be based on a candidate’s experience and skill demonstrated during interviews and other evaluations.

Position Overview

Ocient is searching for an experienced Site Reliability Engineer with strong problem-solving skills and a passion for solving hard problems to help maintain and expand Ocient's "as a service" offering of its cutting-edge data warehouse.

Responsibilities

Support the design and operations of Ocient's hosted database and related services — including message queues and storage systems — ensuring high availability, performance, and efficiency.
Design and maintain monitoring, log centralization, and alerting for all services to facilitate observability and incident management.
Automate deployment and configuration of Linux-based servers, including the OS and the numerous applications that compose our hosted offerings.
Develop and maintain rigorous security practices to protect our applications and customer data.
Assist with automation of testing pipelines for the Ocient DB and monitoring of test infrastructure.

Ideal Qualifications

3+ years of experience in system administration in production environments.
Scripting experience with Bash, Python, or other languages.
Experience with system and software monitoring and alerting tools, such as the ELK stack, Graylog, InfluxDB, Prometheus, Zabbix, Grafana, Dynatrace, or others.
Experience with configuration management software such as Ansible, Puppet, or Chef.
Experience with data archiving, backup and disaster recovery.
Continuous Integration / Continuous Deployment experience with Jenkins, Gitlab CI or others.
Experience with source control tools like Git.
Ability to work flexible hours and serve in on-call rotations.

An Exceptional Candidate Will Have

Knowledge of OWASP principles for application security.
Experience with server / system virtualization and containerization technologies e.g., ProxMox, KVM, VMware.
Experience with SQL and Database Administration.
Experience managing and operating cloud infrastructure (e.g., AWS, GCP, Azure).
Experience with SSAE18 SOC2 Compliance.
Experience with networking administration, including VPN, proxy, DNS, and firewall configuration.

Site Reliability Engineer employer: Ocient

Ocient is an exceptional employer that fosters a dynamic and inclusive work culture, offering remote opportunities across the United Kingdom. With a strong emphasis on employee growth, Ocient provides access to cutting-edge technology and encourages continuous learning through hands-on experience in system administration and cloud infrastructure. The company values innovation and collaboration, making it an ideal place for Site Reliability Engineers looking to tackle challenging problems while enjoying a supportive and flexible work environment.

Contact Details:

Ocient Recruitment Team

View Ocient profile

We think you need these skills to ace Site Reliability Engineer

Problem-Solving Skills

Linux System Administration

Scripting (Bash, Python)

Monitoring and Alerting Tools (ELK stack, Prometheus, Grafana)

Configuration Management (Ansible, Puppet, Chef)

Data Archiving and Backup

Continuous Integration / Continuous Deployment (Jenkins, Gitlab CI)

Source Control (Git)

Application Security (OWASP principles)

Virtualization and Containerization (ProxMox, KVM, VMware)

SQL and Database Administration

Cloud Infrastructure Management (AWS, GCP, Azure)

Networking Administration (VPN, proxy, DNS, firewall configuration)

Site Reliability Engineer

Ocient

Apply Now

Site Reliability Engineer

At a Glance

Site Reliability Engineer employer: Ocient

We think you need these skills to ace Site Reliability Engineer

Company

Product

Help