SiriusXM is looking for a strong collaborative team player to work within our Automatic Labs (Connected Car) division on the Infrastructure DevOps Team -- specifically focusing on scalable deployments of both infrastructure and security as code. Developer Operations is responsible for the overall performance and reliability of our infrastructure and products. Our team designs and implements the tools that automate building reliable and performant systems. We provide the platform for our developers from local all the way to production.
As a member of our team, you would spend time scaling our infrastructure and improving the backbone of our infrastructure services. The qualifying candidate will be aiding in administering, maintaining and supporting applications, and hardware deployed across Automatic Lab's microservices architecture. Candidate will perform a wide variety of software/hardware configuration/maintenance and operational support tasks of many diverse back-end systems, including, but not limited to, AWS Infrastructure, Postgres/SQL databases, and various application servers. Candidate will be responsible for active monitoring of environments, working with development teams to guarantee uptime/stability. The position is considered to be on-call, and the candidate will be expected to ensure the coverage of various after-hours deployments/maintenance windows. This position is an individual contributor/hands-on role, which includes expanded responsibilities surrounding all infrastructure support including application monitoring, software build machines, and software support servers. The candidate should have a penchant for solving tough technical problems and dedication to ensuring high-availability.
Duties and Responsibilities: Automate as much as humanly possible and always configure as code Advocate and implement reliable design patterns (circuit breakers, graceful degradation, etc.) Setup, configuration, and administration of Linux Servers on Amazon Web Service (AWS) infrastructure Networking (TCP/IP) configuration including Firewall ACL's and security Collecting, monitoring and analyzing systems performance data for improved performance Perform custom application maintenance, to include debugging, installing new application releases, and patching Develop, document and maintain procedures for administering, maintaining, and supporting infrastructure Create automated tasks with shell scripts (bash, python, etc.) Work with server engineering teams on design and implementation of a large scale distributed system
Minimum Qualifications: Minimum of 5 years of full-time Software Engineering experience Experience with virtual infrastructure and support (VMWare/AWS) Degree in Computer Science/Engineering, Information Sciences Technology or related fields Equivalent experience will be evaluated and considered
Requirements and General Skills: Have worked previously in a production AWS environment Experience with configuration management tools (Chef, Puppet, Ansible, Salt) Experience with secrets and credentials management Knowledge of system basics which includes DNS, logging, cron, process management, and package management Experience with containerization platforms such as Kubernetes and Docker, and familiarity with microservices management a plus Must be able to design, implement and support solutions based on the needs of the business and based on industry professional and security best practices. Exhibit planning and coordination skills with the ability to prioritize new work as forthcoming Pay attention to details and be organized. Identify problems, recommend solutions and perform triage in a team environment. This position requires 24x7 availability for support and after hours work in order to support the availability and uptime requirements of the business. Must have legal right to work in the U.S.
Technical Skills: Linux knowledge in a virtualized environment is required. Proficient with a terminal in a Linux environment. Experience with deployment of applications/code. Experience with development / CI applications such as SVN, GitHub. Knowledge of networking concepts. Proficiency in managing systems within multiple sites including failover redundancy. Postgres, SQL, and NoSQL Database knowledge is a plus. Understanding of Internet security best practices. Cloudformation and or Terraform knowledge a plus
Equal Opportunity/Affirmative Action Employer - Minorities/Women/Protected Veterans/Disabled
The requirements and duties described above may be modified or waived by the Company in its sole discretion.