Forgotten Password

Outcome Logix LLC

Outcome Logix LLC is looking for a Sr. Site Reliability Engineer  #JobsHiringNearPagosaSprings #JobBoardPagosaSprings #EngineerJobsPagosaSprings

This is a long-term project, 12-24 months Sr. Site Reliability Engineer (SRE) to get engaged early in the software development life cycle to build capabilities to enhance reliability and scalability. The primary goals of this model are: Reduce Mean Time to Detect (MTTR), Mean time to resolve (MTTR), increase system availability and reduce overall incidents. The Site Reliability Engineer (SRE) will work within a development scrum team to build reliability features as the team builds features. The SRE will be collaborating with business product owners, developers, support team members and quality analysts to help them drive value delivery. What youll be doing (Responsibilities): · Implement capabilities like logging, metrics, distributed tracing and Chaos engineering · Drive SRE activities within a line of business · Develop dashboards, alerts, and monitoring for various systems which are hosted in cloud or onPrem · Drive initiatives specific to SRE and automation utilizing SRE and DevOps concepts and monitoring tools like AppInsights & Log Analytics · Drive engagements with Development and Business Teams to define key Business and system metrics · Developing SRE capabilities to meet SLI/SLO/SLA requirements. · Creates error budget for each component, availability dashboard and sets up fast burn and slow burn alerts · Performs chaos engineering by artificially injecting faults in systems to simulate SLO failures · Designs, codes, tests, and implements automation using .NET or Java, Python, and any scripting technology following GIT process · Coordinates structured walkthroughs and technical reviews ensuring reliability, resiliency and scalability · Ensures overall quality by continuous monitoring in development cycle · Assists in the production support and maintenance of applications as needed What youll be doing (Responsibilities):—  Implement capabilities like logging, metrics, distributed tracing and Chaos engineering—  Drive SRE activities within a line of business —  Develop dashboards, alerts, and monitoring for various systems which are hosted in cloud or onPrem—  Drive initiatives specific to SRE and automation utilizing SRE and DevOps concepts and monitoring tools like AppInsights & Log Analytics—  Drive engagements with Development and Business Teams to define key Business and system metrics—  Developing SRE capabilities to meet SLI/SLO/SLA requirements.—  Creates error budget for each component, availability dashboard and sets up fast burn and slow burn alerts—  Performs chaos engineering by artificially injecting faults in systems to simulate SLO failures—  Designs, codes, tests, and implements automation using .NET or Java, Python, and any scripting technology following GIT processRecommended SkillsAppinsightAzure MonitorC#Power BiPowershellSite ReliabilityRecommended JobsReliability EngineerAtlanta,GaReliability Engineer, Atlanta,GaSr. Site Reliability EngineerOutcome Logix LLCWork From HomeContractorSite Reliability EngineerAtlantaSite Reliability EngineerAtlanta$60.00 – $75.00/Hour

(event) {
, ‘ExternalApply-jdf0yz5vmv03b2wpqjd’);
});

Tagged as: Engineer

Apply for job

Apply For This Job

To begin the application process, please provide your email address.

Loding

By continuing you agree to JobsInPagosaSprings Cookies, Privacy and Terms

Job Overview