Software Engineering Lead (Infrastructure) – Multiple Locations

  Clinical Research

Job title: Software Engineering Lead (Infrastructure) – Multiple Locations

Company: UnitedHealth Group

Job description: Combine two of the fastest-growing fields on the planet with a culture of performance, collaboration and opportunity and this is what you get. Leading edge technology in an industry that’s improving the lives of millions. Here, innovation isn’t about another gadget, it’s about making health care data available wherever and whenever people need it, safely and reliably. There’s no room for error. Join us and start doing your life’s best work.(sm)

At OptumLabs, the Infrastructure Engineering team is responsible for work in the following categories:

  • Engineer Productivity Tools – these are tools engineers use to simplify the entire SDLC process for software and data products while ensuring our quality, security, and delivery standards are being met. Examples include build tools, deployment tools, code scanning tools, work tracking, artifact repos, etc.
  • Platform Services – a set of EIS approved Infrastructure as Code templates for items such as networking, certificates, hosting, storage, database compute, secret management, messaging, eventing, etc. These are created and managed to provide a secure, scalable, reliable, HA/DR, and performant environment to run and operate software and data products. Platform will support data sovereignty requirements where required
  • SRE/Operational Tools – responsible for the overall operation of the platform services and infra. On-call rotation for resolving issues as well as measuring and meeting SLA, SLO, SLI standards. Integrate, create and/or manage tools/frameworks including dashboards which engineers use to run, operate, and support the platform services as well as the products utilizing them. Examples include logging, monitoring, incident management, and instrumentation.

Skills Required:

  • Embody company core values and reinforce them through speech and actions
  • View stability as a product
  • Believe team collaboration and empowerment are critical
  • Passion for testing / quality – planning for failure, chaos engineering
  • Strong belief in the agile manifesto principles
  • Enthusiastic about developer platforms for continuous delivery of software and infrastructure changes.
  • Driven by customer success
  • Good analytical skills
  • Clear understanding of security best practices through entire SDLC

Primary Responsibilities:

  • Actively participate in on-call rotation for incident resolution for the platform and/or any dependent components which the product engineering teams rely on for their work
  • Maintain and improve operational tooling, frameworks, perform chaos engineering activities
  • Perform root cause analysis and deliver resolution for tools and automation failures
  • Build frameworks that test the performance and resiliency of our platform services/tools
  • Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
  • Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
  • Identify and measure SLOs, SLAs and SLIs
  • Improve processes/runbooks and champion automation of any manual items around support
  • Build and manage tooling as CLIs, APIs, and/or libraries to simplify the entire software development lifecycle. These tools must empower product engineering teams to be more autonomous and productive by allowing them to focus on their craft versus the plumbing required to build, deploy, and run their solutions
  • Create and manage standard CI/CD pipelines which build, test, scan, package, and deploy software and data applications. These pipelines are driven using GitOps practices and are driven automatically based on our pull request standards
  • Build, maintain, and operate the cloud hosted platform which all cloud native solutions are deployed on. This platform provides standard approved infrastructure as service templates, seamless integration with centralized logging, metrics dashboards, instrumentation, incident monitoring and management for any deployed applications
  • Ensure through automated testing/chaos engineering the IaC templates provide infrastructure which is secure, scalable, highly-available, maintainable, and performant
  • Coach and mentor other engineers on the team when required
  • Ensure all released products meet our SDLC standards including security approvals
  • Administer necessary IT systems, development tools, and productivity tools like wikis, source control, artifact repositories, logging/APM tooling, cloud subscriptions, and other external systems required for engineering teams to collaborate and effectively do their jobs. Must favor self-service and automation over manual administration where possible.
  • Required Skills/Experience:
  • Developing cloud-native applications using one or more languages (Typescript, .NET Core (C#) are preferred)
  • Deploying and operating cloud-native applications in a public cloud (Azure preferred)
  • Implementing end-to-end DevOps functions for cloud native containerized applications/APIs (Azure DevOps – preferred, Jenkins, Ansible, Gitlab)
  • Providing administrative engineering functions to an organization (i.e. implementing single-signon, automating app registrations, automating certificate management, etc.)
  • Experience supporting applications and/or infrastructure in production environments
  • Experience with Docker and Kubernetes (Azure Kubernetes Service preferred) in production
  • Strong Git skills
  • Experience using centralized logging solutions (Splunk, Elk, etc.)
  • Experience using active monitoring systems (Datadog, New Relic etc.)
  • Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

  • Developing cloud-native applications using one or more languages (Typescript, .NET Core (C#)
  • Deploying and operating cloud-native applications in a public cloud (Azure)
  • Supporting software and/or cloud-infrastructure in an on-call rotation basis to help with identification and remediation of technical problems at the root cause
  • In-depth and proactive communication skills around status of projects/issues
  • Experience with Docker and Kubernetes (Azure Kubernetes Service preferred) in production
  • Strong Git skills
  • Experience using centralized logging solutions (Splunk), Elk, etc.)
  • Experience using active monitoring systems (Datadog, New Relic, etc.)

Preferred Qualifications:

  • Architecting and implementing cloud infrastructure
  • Using Terraform or similar tools for creating infrastructure as code
  • Site Reliability Engineer Focus
  • Implementing dashboards to help teams visualize logs, instrumentation, and other data to ensure optimal performance of the platform services, infra, and deployed applications. (Grafana)
  • Experience creating runbooks, processes, and test plans around reliability, performance, etc. of infra/applications.
  • Experience planning and supporting +99.99% availability against critical applications in production

Careers with Optum. Here’s the idea. We built an entire organization around one giant objective; make health care work better for everyone. So when it comes to how we use the world’s large accumulation of health-related information, or guide health and lifestyle choices or manage pharmacy benefits for millions, our first goal is to leap beyond the status quo and uncover new ways to serve. Optum, part of the UnitedHealth Group family of businesses, brings together some of the greatest minds and most advanced ideas on where health care has to go in order to reach its fullest potential. For you, that means working on high performance teams against sophisticated challenges that matter. Optum, incredible ideas in one incredible company and a singular opportunity to do your life’s best work.(sm)

Expected salary:

Location: Bangalore, Karnataka

Job date: Wed, 13 Jul 2022 22:52:50 GMT

Apply for the job now!