If you are motivated to conquer bold challenges and work with industry-leading technology platforms, we have an exciting opportunity available. ServiceLink, the unrivaled mortgage industry leader, is in search of a Site Reliability Engineer to insure that important, revenue-critical systems up and running despite extreme weather conditions, bandwidth outages and configuration errors. The ideal candidate will have a passion for technology, exceptional problem solving skills, excellent verbal and written communication skills, poise under pressure and strong collaborative skills. If you are confident in your ability to engage with others to promote our Serve First culture, we invite you to apply today. This is an exciting time to join ServiceLink, where those at the forefront of technology excel in our entrepreneurship culture focused on empowerment and innovation.
A DAY IN THE LIFE
In this role, you will…
· Design, write and deliver software to improve the availability, scalability, latency and efficiency of ServiceLink’s EXOS system.
· Manage on-call rotations using a follow-the-sun model for EXOS platform
· Own end-to-end availability and performance of 200+ micro services and build automation to prevent problem recurrence.
· Provide advanced operational support of EXOS system and underlying Azure Cloud infrastructure services
· Work with Release Organization for developing and maturing CI/CD - Continuous Integration & Deployment pipelines
· Software Development in .net Core platform, and developing automation scripting using Python, PowerShell or similar scripting languages
· Developing Alarms, Metrics and Monitors on Splunk or similar application for proactively troubleshooting production issues before they lead to incidents.
· Own production incident management to ensure the production issues are triaged, logged and fixed.
WHO YOU ARE
You possess …
· Bachelor's degree in Computer Science or related field, and/or equivalent years of relevant work experience.
· 3-6+ years of software development experience on .net CORE framework
· 3-6+ years of experience in SRE, Site Reliability Engineer, DevOps Engineering
· 2+ years of Azure Cloud Operations
· Background in System/Software Operations with keen on problems troubleshooting and proactively identifying production issues
· 3+ years of experience in Jenkins or similar product building CI/CD pipelines
· 1+ years of Azure DevOPS is preferred with focus on Azure Pipelines
· 2+ years of Splunk/DataDog or similar tool experience developing operations dashboard, alarms, metrics and monitors
· 3+ years of Python/Powershell or a similar scripting programming experience for automating mundane monotonous work items
· Proficiency working with algorithms, data structures and production troubleshooting.
· Expertise in problem solving and analyzing global scale distributed systems.
Software Powered by iCIMS
www.icims.com