Site Reliability Engineer (SRE with java development) Job at Compunnel Inc., Canada

WXFBdlFOVUdheWFvRUtueTFpMWNjbGpiTnc9PQ==
  • Compunnel Inc.
  • Canada

Job Description

Job Description

SRE

Toronto, ON

Contract: 6+ months (extendable)

Client is expecting a profile with development experience in Java or cloud with SRE.

Mandatory Skills: AWS, Cloud Watch, Lambda, Python , Monitoring tools like Dynatrace and Observability.

Responsibilities:

•Work in collaboration with Application Development, Quality, Product and Data Engineering teams to Champion SRE/ DevOps culture and practices.

•Strategic approach with clear objectives to improve service / product Availability, Performance Optimization, improve Incident MTTR, Change Success Rate and ensure feedback loop to Dev

•Build and maintain Reliable Systems and platforms using SRE and DevSecOps principles with special focus on Observability, Resiliency (proactive impact prevention), Self Healing and Reliability testing

•Work with App & Business teams to establish (SLO/SLI), SRE Dashboards that provide multiple views (LOB, business process or App) view to track value and enable effective decision making

•Innovative approach to Reliability, from Arch and feasibility phase to Operation & Continuous Improvement following product model and Agile methodologies.

•Focus on latest technology trends when it comes to Observability, Automation, Platform technology and tools including AIOps & MLOps reliability and resiliency.

•Ensure Toil is addressed from inception and addressed in Operations (self healing, self config, self Provision and optimization) by leveraging Sense & response, advanced monitoring (synthetic & RUM)

•Lead / Participate in Community of Practice (CoP) to connect and collaborate with like minded teams, set objectives, roadmaps, and implementation. SRE office hours and CoP leadership and participation.

Qualifications:

•SRE: In depth knowledge and experience in Observability, Toil Management, Monitoring tools (Dynatrace, CW, Azure Monitor), Resilient Arch, IaC, CaC, JSON, Typescript, API and Webhook development using Python, Node.js, Ruby, PowerShell, and Shell Scripting languages.

•Cloud Experience: In depth knowledge in Cloud Native tools / services: CDK, Cloud Watch, EKS, EC2, ELB, S3, Lambda, & SSM.

•In depth understanding of Dynatrace advanced features (DT Guardian, RUM, Synthetic testing and monitoring, AI event correlation)

•Experience in Logs ingestion (AWS Firehose, DT Open Pipeline), Reporting and Dashboard tools, Operational Metrics and analytics

•Automation: Leverage Ansible Tower, AWS SSM, BitBucket / GitHub to build automated workflow that eliminate Toil, improve response time and streamline deployment pipeline.

•Cloud Orchestration tools (AWS Step functions, Containers, Apache Airflow) with special focus on Data Batch Processing and Pipelines

•Deep knowledge in Data Management, Data Warehouse, Data lakes, & Database reliability (RedShift, RDS, Aurora), PostgreSQL, SQL Server, Oracle with DevOps experience.

•Exceptional Problem-Solving skills, Knowledge Management and effective communicator that can speak the language of people, process and technology.

•Decisive, energetic, focused team player who builds and leads high-performing teams / CoP and foster a culture of diversity, inclusion, recognition and growth.

Job Tags

Contract work,

Similar Jobs

Aughdem Recruitment

Senior Events & Conferences Manager Job at Aughdem Recruitment

Company Bio:Our client is a globally recognized events and conferences company with over 20 years of experience producing large-scale, high-impact experiences for Fortune 500 companies. As they expand into the U.S. market, they are establishing their first U.S. headquarters... 

Cardinal Health

Pharmacy Delivery Driver Job at Cardinal Health

 ...What Pharmacy Services & Delivery contributes to Cardinal Health Pharmacy Services & Delivery is responsible for the prompt and accurate delivery and distribution of radiopharmaceuticals or oncology pharmaceuticals to medical care providers in accordance with customer... 

Ecolab

Pest Control Technician Job at Ecolab

 ...Position requires the ability to obtain required pest certification and/or business licensing pursuant to state/local law Due to the nature and hours of work, must be 18 years of age or older Ecolab conducts a background check on all candidates who receive a job offer... 

Legrand North America

Cybersecurity Intern Job at Legrand North America

Position Description: At a Glance Legrand has an exciting opportunity for a Cybersecurity Intern to join the Building Control Systems Wattstopper Team in Carlsbad, CA. What Will You Do? Develop a deep understanding of the requirements of ioXt certifications... 

Pyramid Consulting, Inc.

Business Process Management Consultant Job at Pyramid Consulting, Inc.

Immediate need for a talented Business Process Management Consultant. This is a 12+months contract opportunity with long-term potential and...  ...include, but are not limited to, health insurance (medical, dental, vision), 401(k) plan, and paid sick leave (depending on...