Naukrijobs UK
Register
London Jobs
Manchester Jobs
Liverpool Jobs
Nottingham Jobs
Birmingham Jobs
Cambridge Jobs
Glasgow Jobs
Bristol Jobs
Wales Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

Operations Site Reliability Engineer x2

Job LocationBristol
EducationNot Mentioned
SalaryCompetitive salary
IndustryNot Mentioned
Functional AreaNot Mentioned
Job TypePermanent , full-time

Job Description

Competitive Salary (DOE) + Company Shares + Bonus(This role is an office-based, 5 days a week in Bristol, this role will need to participate in weekends and holidays on-call support as and when required.The Company:Elevate your career as an Operations Site Reliability Engineer (SRE) with a global technology powerhouse that has dominated the industry for over 50 years. Our client, a multinational technology leader, boasts an impressive £30bn+ annual revenue, fuelledby innovative solutions and an unwavering commitment to customer satisfaction.Recognized for their global reach and dedicated customer base, they are expanding their Bristol team and seeking two Ops Site Reliability Engineers (SRE) to contribute to their continued success.What you will do:The Ops Site Reliability Engineer will help with operational support for our clients customer-facing SaaS products. You will be part of a team of engineers that demonstrates superb technical competency, operates mission-critical infrastructure and ensuresthe highest levels of availability (24x7x365), performance and security.Other responsibilities:

  • To form part of a critical operations function that is responsible for the monitoring, availability and performance of production services.
  • Responding to stakeholder requests within agreed timescales or SLO
  • Drive automation to reduce failures, manual tasks and therefore improving overall application performance and availability.
  • Perform systems administration activities to ensure the smooth operation of applications across multiple platforms
  • Coordinate and communicate with impacted stakeholders as per incident management process.
  • Demonstrate ownership of events and incidents through to restoration
  • Perform daily shift handovers to peers and management across multiple geographies.
  • Support maintenance activities which impact production applications.
  • Support critical systems that handle sensitive and proprietary data
  • Create, maintain and update work instructions for troubleshooting and supporting applications.
  • Contribute to the planning of application/infrastructure releases and configuration changes
  • Provide input to administering and maintaining all production environments
  • Patching and upgrade of existing applications
  • Provide feedback and coaching to upstream teams (both internal and vendors) to reduce escalations and to continually improve overall experience for customers.
What experience do you need
  • A degree in Systems Engineering, Computer Science or related fields with related experience preferred
  • 5+ years of experience administering Linux systems
  • Strong hands-on experience of variants of Linux distros
  • 2+ years Operational experience of working with Amazon Web Services or Google Cloud Platform
  • Experience of working with an automation platform to automate repetitive actions that reduce manual effort
  • Familiarity with deployment tools such as Ansible Tower and Jenkins
  • Experience in carrying out large deployments to global infrastructure
  • Proficient with orchestration/configuration tools such as Ansible and Terraform
  • Strong working knowledge of networking, packet tracing, understanding latency and throughput in order to pinpoint or resolve application issues.
  • Thorough knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker containers
  • Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments
  • Experienced and confident in at least one scripting language such as Perl, shell, Ruby or Python.
  • Experience of tuning and optimising monitoring systems
You will also require the following:
  • A strong team player with the ability to grasp new technologies, adapt to change in methodologies, with a focus on delivery
  • Extensive troubleshooting and problem-solving skills with respect to application technologies
  • Ability to remain calm and work well under pressure
  • A keen interest and desire to work within the security arena
  • Ability to communicate effectively at all levels up to senior management.
Whats in it for You:This role offers an attractive salary package, including a bonus and company shares, Pension and Private Healthcare. If you are a results-driven Operations Site Reliability Engineer (SRE) with a passion for excellence and a desire to contribute to a globaltechnology leader, we invite you to apply for this rewarding opportunity. Successful candidates will be contacted within 48 hours.

Keyskills :
LinuxNetworkingReliability EngineeringSystems EngineeringJenkinsAWStats

APPLY NOW

Operations Site Reliability Engineer x2 Related Jobs

© 2019 Naukrijobs All Rights Reserved