A top global insurer is hiring for a strong .NET or C# developer to join their infrastructure team as a (Senior) Site Reliability Engineer. This is a newly-created headcount to build out a team, with eventual management/Team Lead opportunities.
Functional Duties:
- Setup and maintain monitoring of infrastructure and application
- Build alerts and auto recovery for various operational issues
- Gather and analyse metrics from operating systems as well as applications
- Advise in performance tuning and fault finding
- Partner with development teams to improve services
- Assist formulating preventive actions where possible, lead potential failure scenarios studies and formulate automated recovery methods
- Comfortable with working on new tools e.g.; Azure DevOps, Grafana, Dynatrace and etc
People Management Duties:
- Be an advisor on applications and support application team establish recovery processes
Requirements:
- Bachelor's degree in Computer Science or related field
- Database (2 or more): MSSQL, MYSQL, NOSQL, KQL
- Programming Languages (2 or more) : .NET, C#, C++, Java 8, Python
- OS: Linux(RHEL or SUSE) or Windows Server
- Scripting(Must have either 1) : Shell, Bash, PowerShell
- Knowledge in open source distributed version control system, git
- Sound knowledge of how REST API works
- Experience in Atlassian tools (Jira, Bitbucket, & Confluence)
- Familiarity with Azure Cloud services
- Working experience with ITIL in Agile environment
- Good to have:
- Experience with containerization (Docker, AKS, ACR, EKS, ECS)
- Experience in CICD with Azure DevOps
- Experience in Dashboard development with Grafana,Elasticsearch, Azure Monitor, or Dynatrace
- Experience in infrastructure management with Terraform or Ansible
- Experience with Azure or AWS cloud certification would be an added advantage