Senior DevOps Engineer (JR1963414) - Sustainable Talent Santa Clara, California, United States Bookmark Share Print 23 0 0

Listing Description

Job Title: Senior DevOps Engineer (JR1963414)

Location: Santa Clara, CA

Job Description

NVIDIA is looking for a world class engineer to join its multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior DevOps Engineer. The position will be part of a fast-paced crew that develops and maintains sophisticated build & test environments for a multitude of hardware platforms both NVIDIA GPUs and Tegra Processors along with various operating systems (Windows/Linux/Android). The team works with various other business units within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure & system’s needs.

As a DevOps Engineer, you’ll also be working in conjunction with various teams such as software engineering to deploy these new products and manage our infrastructure, associated processes and systems. Keen attention to detail, problem-solving abilities, and a solid knowledge base are essential.

What you’ll be doing:

  • Participate in the design, implementation and enhancement of automated SW testing infrastructures for the automotive & mobile Tegra platforms.

  • Enhance and develop new modules for harness in python and shell scripts.

  • Develop, Improve and Maintain our infrastructure codebase

  • Drive automation of monitoring to gain more insight into applications and system health.

  • Working with software engineering teams as well as internal support groups world-wide to ensure that our software is produced, tested and delivered to the customer in a consistent and effective manner that meets our world-class standards

  • Develop and implement test plans, modification of existing test infrastructures, create new test suites as needed and perform code reviews for peer

  • Participate in bring up tasks for new Tegra platforms

  • Debug & fix existing and new tests, system configurations and hardware setup of Tegra’s

  • Maintain systems once they are live by measuring and monitoring availability, latency and overall system health

  • Implement & support end-to-end CI/CD system

What we need to see:

  • Strong object-oriented programming background, Java, Python preferred.

  • Experience of maintaining cloud infrastructure and highly-available production environment.

  • Excellent debugging, problem solving and analytical skills

  • Strong understanding of architectural requirements and development processes involved in building reliable, robust, scalable data products and pipelines

  • Background in Databases both SQL (MySQL) and NoSQL (Elastic Search /MongoDB/Cassandra).

  • Proficient with configuration management tools like Ansible, Puppet & Chef

  • Strong background with CI/CD systems

  • Experience in Kubernetes, dockers & virtualization

  • Background with source code management and binary repository systems like GitLab, GitHub, Artifactory etc.

  • Experience with data analytics/visualization tools like Kibana, Grafana, Splunk etc.

  • Knowledge of monitoring systems such as Zabbix, Prometheus and/or similar systems.

  • Advanced knowledge of standard methodologies related to security.

  • 8+ years of proven experience

  • Bachelor's or Master’s degree in Computer Science, Software Engineering, or equivalent experience

  • Knowledge and experience in Linux, Windows, Android and embedded OS like QNX is a plus

  • Experience on working with mobile and embedded systems

  • Ability to work with a variety discipline including technologists, software and hardware

Ways to stand out from the crowd:

  • Ability to analyze situations and utilize troubleshooting skills, systems and tools, and problem solving abilities

  • Prior experience on embedded & mobile systems

  • Previous experience with DevOps teams.

  • Thrives in a multi-tasking environment with constantly evolving priorities.

  • Prior experience with large scale operations team.

  • Outstanding interpersonal skills and communication with all levels of management.

  • Experience with using and improving data centers.

  • Background with computer algorithms and ability to choose the best possible algorithms to meet the scaling challenge.

  • Ability to analyze sophisticated problems into simple sub problems and then reuse available solutions to implement most of those.

  • Ability to design simple systems that can work efficiently without needing much support.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Listing Details

  • Citizenship: Not Provided
  • Incentives: Not Provided


  • Education: Not Provided
  • Travel: Not Provided
  • Telework: Not Provided

About Us

AtmosJobs is a community-run job platform developed by SaaS professionals. Our unique approach of focusing strictly on Cloud positions allows us to personalize the user experience.

Our Contacts

1765 Greensboro Station Pl.
Suite 900
Tysons Corner Va 22102

(703) 594-7765