Site Reliability Engineer
Company: Blue Ocean Ventures
Location: Warren
Posted on: March 21, 2023
|
|
Job Description:
Site Reliability Engineer Responsibilities: # Introduce
enterprise capabilities, tools, and innovation improving
availability in a multi-cloud ecosystem by evolving observability,
monitoring, logging, CI CD integration(performance, smoke,
regression , functional, chaos and environment propagation through
automatic deployments) # Introduce continuous improvement,
standardization automation, capabilities to conduct destructive and
resiliency testing # Consistent track record of troubleshooting an
d resolving issues in live production environments and implementing
strategies to eliminate them # Driven approach to continually
improving service levels # Build and manage systems,
infrastructure, and applications through automation # Deploy, suppo
rt, and monitor new and existing services, platforms, and
application stacks # Engage in improving the whole lifecycle of
services from inception through deployment, operations, and
refinement # Provide hands-on technical expertise during service
imp acting events # Collaborate with other engineers on code
reviews, internal infrastructure improvements and process
enhancements # Use scalability testing to measure, tune and
optimize system performance # Automate key SRE metrics and IT
Service Opera tions processes including customer impact,
availability of critical business flows, SLO SLI adherence, error
budget, automate incident process for IT Service Operations through
data integrating with unified communications, alerting notification
sys tems # Participate in periodic 24x7 on-call duties # Share
support responsibilities for critical applications and customer
journeys onboarded to SRE including remediation of issues through
Agile, conduct blameless postmortems, root cause analysis and
introduce continuous improvement solving problems once and for all
with the goal of no repeats. Required Qualifications # Experience
with Observability Monitoring technologies like Splunk, Signalfx
(Cloud Obervability), Splunk-OnCall, Rigor and Azur e Monitoring #
Experience with one or more Cloud Platforms (Azure, GCP, AWS) #
Experience with Container technologies: Kubernetes, Docker, AKS #
Experience setting up monitoring in infrastructure, applications
and database # 3 years of systems s.
Keywords: Blue Ocean Ventures, Elizabeth , Site Reliability Engineer, Engineering , Warren, New Jersey
Click
here to apply!
|