Create Email Alert

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • Spry Info Solutions, INC

    SOFTWARE ENGINEER (SITE RELIABILITY)

    Santa Clara, CA, United States

    • Ending Soon

    We are looking for a site reliability engineer with an expertise in Splunk configuration, setup and monitoring. Responsibilities: Design, develop, document, analyze, create, test and modify the log analytics to maintain different Reports, Dashboards and interfaces to and from external systems. Implement integration to external system to develop Spl

    Job Source: Spry Info Solutions, INC
  • Wayve

    Site Reliability Engineer - Onboard Software

    Mountain View, CA, United States

    • Ending Soon

    At Wayve, we're not just another autonomous vehicle company. We stand out with our revolutionary approach to self-driving technology, embracing the power of embodied AI to redefine the boundaries of what's possible. While others depend on static maps and rigid rules, we believe in a future where vehicles perceive, understand, and navigate the world

    Job Source: Wayve
  • Zoox

    Site Reliability Engineer

    Foster City, CA, United States

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant throu

    Job Source: Zoox
  • Cryptoware Technologies Inc

    Site Reliability Engineer

    Santa Clara, CA, United States

    • Ending Soon

    Job DescriptionJob Description Responsibility • Lead the effort of global expansion of Huobi globe spanning infrastructure. • Work with engineering teams to make sure new features and changes are deployed quickly and safely. • Constantly improve our system performance and reliability through better tools, process and monitoring system. • Staff

    Job Source: Cryptoware Technologies Inc
  • Apple, Inc.

    Site Reliability Engineer

    Cupertino, CA, United States

    • Ending Soon

    Summary Posted: Mar 29, 2024 Role Number: 200545395 The Apple Information Apps Engineering teams power some of the most widely used Apple applications, such as Apple News, Stocks, Weather, and Books. We do this at a massive, global scale. We meet our high expectations through dedication to best practices, which enables us to deliver a vast array

    Job Source: Apple, Inc.
  • Apple Inc.

    Site Reliability Engineer

    Cupertino, CA, United States

    • Ending Soon

    The Apple Information Apps Engineering teams power some of the most widely used Apple applications, such as Apple News, Stocks, Weather, and Books. We do this at a massive, global scale. We meet our high expectations through dedication to best practices, which enables us to deliver a vast array of information that people worldwide use daily in over

    Job Source: Apple Inc.
  • NVIDIA

    Site Reliability Engineer

    Santa Clara, CA, United States

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers

    Job Source: NVIDIA
  • Wayve

    Site Reliability Engineer - Onboard Software (Senior)

    Mountain View, CA, United States

    • Ending Soon

    Job Overview At Wayve, we're not just another autonomous vehicle company. We stand out with our revolutionary approach to self-driving technology, embracing the power of embodied AI to redefine the boundaries of what's possible. While others depend on static maps and rigid rules, we believe in a future where vehicles perceive, understand, and navig

    Job Source: Wayve

Software Engineer - Site Reliability Engineer

Sunnyvale, CA, United States

Description

Software Engineer - Site Reliability Engineer

Qualifications

A love of solving hard problems

Putting your customers first, whether they be internal or external, and making them more productive, happy, and successful

Experience with Azure AKS, AWS

Experience with Kubernetes, ECS, EKS, or other container orchestration system

Some sort of infrastructure-as-code system: Ansible, Terraform, CloudFormation, CDK, etc

Logging systems: Splunk, EventHub, ELK etc

Bachelors degree in Computer Science or similar or equivalent experience

Experience creating automated solutions & eagerness to automate

Responsibilities

Experience monitoring services and infrastructure, log collection, analytics, and application performance monitoring (APM)

Improve metrics on our main services, and act as a subject matter expert for dev teams

Recommend and guide improved monitoring and alerting processes

Identify performance bottlenecks and provide recommendations for improvement

Proactively identify and solve problems that we didn't even know we had

Help build, deploy, and scale a load testing environment that is analogous to production

Enforce security and operational safety controls

Experience with Performance testing or Chaos testing a plus

Contribute to the architectural improvements to meet future scaling and observability requirements

Strong performance issue triaging skills. Log analysis, thread dump analysis , heap dump analysis.

Self-motivated individual who is proactive in driving tasks to completion.

Participate in on-call rotation (Team is scattered across America and Europe, so you can sleep at night!), support developers questions and attending incidents

At least 5 years in a Reliability Engineering, DevOps or infrastructure focused role

Advanced experience with programming languages (GoLang, Python, Java)Passion for designing and building reliable systems

Experience in managing and scaling distributed systems in a public, private, or hybrid cloud environment

Deep systems and infrastructure knowledge

Advanced knowledge and hands-on experience with CI/CD systems

Automation advocate - you truly believe in removing operation load with software

Education: Bachelors Degree

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Software Engineer - Site Reliability Engineer jobs in Sunnyvale, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.