fbpx

Site Reliability Engineer – Triage #4993

Careers

  • US-Remote Employee Location
  • Information Technology
Apply Now

Company Overview

GovCIO is a team of transformers—people who are passionate about transforming government I.T. We believe in making a difference by developing digital strategies and delivering the technology-related innovation that improves governmental operations each day.

But we can’t do it alone. We welcome and nurture an inclusive and diversified work culture. Because different backgrounds, experiences, abilities, and perspectives make us better decision-makers, problem solvers, and creators. We’re changing the face of I.T. – from our diverse staff to the end-products we develop. And we’re excited to expand our team. Are you ready to be a transformer?

As a Site Reliability Engineer – Junior you will apply site reliability process transformation skills to support building processes that manage and improve OIT’s response posture to system events impacting end users and Veterans. This includes working with business partners to improve communication and responsiveness to application failures by minimizing impacts in performance degradation and availability, working towards a significant reduction in application downtime and impact to the users. You will be working with a team of site reliability engineers, both junior and senior level, to support a site engineer team lead to perform the required deliverables.

Responsibilities

 Areas of support include:

  • Support design and implementation of improvement planning, data analysis, assessments, and organizational strategies.
  • Support and provide guidance for tracking complex business procedures to achieve goals and overcome barriers in the collection of technical information from the relevant stakeholders, or in support of content for white papers and other communication devices; and assessing and evaluating the effectiveness of executive communication to effect process improvement.
  • Support establishment, coordination, and facilitation professional learning communities.
  • Support Triage efforts during Major Incidents by deconstructing application performance, interoperability, instrumentation, and human factors to facilitate resolution and development of resilient solutions.
  • Support coordination and ensure all High Priority Incident (HPI) and Critical Priority Incident (CPI) are triaged properly and routed to the appropriate and correct groups for immediate resolution.
  • Provide support to Problem Management’s enterprise root cause analysis (RCA) processes in collaboration with appropriate OI&T organizations.
  • Demonstrate proficiency with DevOps tools, JIRA, ServiceNow, MS Project and perform tasks using the tools. 

Qualifications

Bachelor’s with 8+ years (or commensurate experience)

Required Skills and Experience

  • Should be well versed in the concepts of DevOps and have a full understanding of Site Reliability Engineering (SRE) principles.
  • IT background and ability to understand technical content with expertise across multiple technology areas and the ability to diagnose complex issues throughout many technologies.
  • Must be able to identify and mitigate risks to the product.
  • Must be able to provide oral and written discussion of analytical findings using narrative and graphic forms.
  • Must be able to use qualitative and quantitative analytical skills to assess the effectiveness of the operations, identifying symptoms for process improvement.

Preferred Skills and Experience

  • Bachelor’s Degree is preferred in Business Administration, Business Management, Computer Science, Information Systems, Information Resource Management, Industrial Engineering, Operations Research or related fields.
  • 5+ years of relative experience
  • Certifications in relevant UX software plus 3-5 years of relevant experience
  • 8 to 10 years of relevant experience may be substituted for education (13-15 years total)
  • Analytical, investigation, and organization skills.
  • Communications including being able to craft content for executive-level presentations.
  • Experience in issue tracking tools and project management software (i.e., ServiceNow, JIRA, Microsoft Office).

 

#Dice


GovCIO is a team of transformers — people who are passionate about transforming government I.T. Every day, we make a positive impact by delivering innovative IT services and solutions that improve how government agencies operate and serve our citizens.


But we can’t do it alone. We need great people to help us do great things – for our customers, our culture, and our ability to attract other great people. We are changing the face of government IT and building a workforce that fuels this missoin. Are you ready to be transformer?


GovCIO is a team of professionals who want to make a difference. And that can only happen with a diverse, happy, and cared-for team. So, we prioritize your well-being, equity for all and look for ways to make work a better place for each of us every day.


We are an Equal Opportunity Employer.


All qualified applicants receive consideration for employment without regard to race, ethnicity, religious affiliation, gender, gender identity or expression, sexual orientation, national origin, or disability status. EOE AA M/ F/Vet/Disabled


Compensation Range (In compliance with Colorado's Equal Pay for Equal Work Act for remote or positions located in CO)

$110,000 - $130,000

Apply Now

Not The Right Fit?

Is this not the job you’re looking for? That’s ok! We’ve got plenty of other opportunities for you to peruse. Search all of our open positions by your area of interest or location.

View All Jobs