We are seeking an experienced Systems Engineer with experience and a focus on incident and request management, and experience with tools such as Dynatrace, Grafana, and Splunk. The role involves monitoring setup and tool administration, as well as breakfix for medium complexity tickets.
Req.#560199502
RESPONSIBILITIES
Develop and maintain documentation that describes best practices for logging and monitoring in the company
Conduct regular audits of logging and monitoring practices to ensure compliance with company policies and industry standards
Participate in cross-functional discussions and initiatives related to logging and monitoring best practices across the company
Manage monitoring, alerting, operability, and observability for applications using tools such as Dynatrace, Splunk, and Grafana
Triage tickets and update ticket details while assessing urgency
Review all documentation to escalate tickets that are beyond Level 2 troubleshooting skills
Warm handoff notes for escalated tickets
Leverage and create documentation for standard incidents and requests
Define the average time to complete per ticket and create an SLO for each product request type
Review and present metrics and escalated tickets regularly to document and shift the support process left
Manage incidents and requests for monitoring setup and tool administration in managing tickets using JIRA as a tracking tool
Be available for monitoring and escalation during off-hours, and weekends and carry pager duty for emergencies after hours
Review and present metrics and escalated tickets regularly to document and shift support process left
REQUIREMENTS
Bachelor’s degree in computer science or related fields and/or equivalent work experience
Strong knowledge of observability (monitoring, logging, and tracing)
Experience with Dynatrace, Splunk, Grafana, and other monitoring and logging tools
Experience with Azure logging and monitoring tools (Log Analytics, Azure Monitor, App Insights)
Experience in operating high-availability, fault-tolerant, scalable, distributed software in production
Soft Skills
Ability to work independently and as part of a team
Strong analytical and problem-solving mindset combined with experience troubleshooting under pressure
Strategic thinking, complex problem-solving, and analytical capabilities
Strong organizational and interpersonal skills, with experience developing and instilling a culture of operational maturity
Ability to adjust quickly to new technologies
Excellent communication skills and fluency in English
BENEFITS
Extended Healthcare with Prescription Drugs, Dental and Vision Insurance, and Healthcare Spending Account (Company Paid)
Maternity/Parental/Adoption Leave Top-up
Life and AD&D Insurance (Company Paid)
Employee Assistance Program (Company Paid)
Unlimited access to LinkedIn learning solutions
Long-Term Disability
Registered Retirement Savings Plan (RRSP) with company match
Paid Time Off
Critical Illness Insurance
Employee Discounts
Employee Stock Purchase Program
ABOUT EPAM
EPAM Systems, Inc. is an equal opportunity employer. We recognize the value of diversity and inclusion in creating success for our customers, business partners, shareholders, employees and communities. We are committed to recruiting, hiring, developing and promoting employees without discrimination. As a global employer, this commitment includes complying with all laws in the countries in which we operate. Nevertheless, we believe equal employment practices should not be limited to what the law requires. Equal opportunity and inclusion are essential to motivate, empower and recognize the best in everyone.
At EPAM, employment actions are based on individual qualifications, without regard to race, color, religion, creed, gender, pregnancy status, sexual orientation, gender identity, gender expression, marital or familial status, national origin, ancestry, genetics, age, disability status, veteran status, citizenship status when otherwise legally able to work, or any other characteristic protected by law.
Taking care of our environment has never been more important than it is today. The Nature Conservancy of Canada (NCC)...
Apply For This JobJob Description About Us Unparalleled premium hospitality and amenities with expert care and support to deliver a personalized senior living...
Apply For This JobYou’ve got the skills, we’ve got the tech. It’s a match made in Service Advisor heaven. Let’s make it official....
Apply For This JobEngineer, Cybersecurity Engineering (OT) Hawkins, TX, US, 75765Tilton, NH, US, 03276Carefree, AZ, US, 85377Clinton, MO, US, 64735Cochrane, ON, CA, P0L...
Apply For This JobENGINEERING TECHNOLOGIST TECHNICIAN 1 Job ID: 30135 Job Category: Engineering & Technical Division & Section: Engineering & Construction Services, ECS...
Apply For This JobRole: Bilingual Backoffice Administrator Line of Business: Client Onboarding (Top 3 Bank in Canada) Employment Type: 6 month contract to...
Apply For This Job