Back to Careers


Site Reliability Engineer

Oakland, CA
Full Time
Apply now

Mission to reshape science

Breakthrough discoveries once occurred when scientists like Alexander Fleming noticed a halo of dead bacteria surrounding a mold colony. But today's breakthroughs often go unnoticed because they are buried in uninterpretable spreadsheets, inaccessible data, or excessive experimental noise. Riffyn’s mission is to unlock the full potential of today’s science by delivering clean, connected, meaningful data the moment it is collected.

At Riffyn, we think beyond the conventional. We dream big. Our mission is to help accelerate the science that improves all of our lives and our planet’s quality and sustainability.

Riffyn believes there is a better way to approach research and development. And that is to empower scientists to make better decisions and faster discoveries by unlocking the power of data. Our technology and people change lives by driving advancements in biological research, medicines, sustainability, and more. If that inspires you, join us.

Life at Riffyn

Life at Riffyn is about working with teammates who are deeply passionate about what we do because we know that our work has the power to change lives. We approach our work with trust and honesty. And we’ve built a diverse team and culture fueled by collaboration and openness.

Our employees developed our five core values, and these extend beyond the work we do and into how we support each other. We are continually evolving to ensure that every employee feels appreciated and encouraged to do what matters for themselves, the company, and each other.

Riffyn’s core ethos

· Get there honestly – have the courage to take the right path

· Do what matters – own the why of the work we do

· Make it fun – humor separates us from the machines

· Work well together – trust builds the strongest teams

· Keep evolving – challenge the status quo

The role

Riffyn is looking for a Site Reliability Engineer to work on our flagship product, Riffyn Nexus. SRE will work with teams across the organization to build and maintain auditable, secure, performant, reliable and highly scalable software systems.

We are focused on engineering excellence and feature development at scale, which delivers a seamless and intuitive user experience on a rock-solid platform that will continue to amaze our growing user base for years to come.

This role offers interesting data problems (scientists perform a lot of measurements), room to experiment and find the best solution, and an expectation that you will influence Riffyn's engineering practices. We operate a microservices architecture to enable independent development, scale, and testability of the different components in our infrastructure.

You won’t find egos here - our team is inspired by our mission, energized by our challenges, and is hungry to learn.


  • Design, develop and iterate Chaos Engineering practices, disaster recovery protocol, and run books etc.
  • Debug production issues across all services and levels of the stack.
  • Monitor and proactively address symptoms rather than solely addressing outages.
  • Implement and educate the team on best practices for monitoring, observability, reliability and security.
  • Devise and implement auto-scaling of infrastructure to support low and peak usage of the application
  • Build tooling and automation to reduce toil.
  • Work with engineering to improve the application issues..
  • Collaborate with the team to determine which features to build and how to architect them.
  • Help in maintaining an optimized Infrastructure stack from application and cost perspective.
  • Be part of the on-call rotation.
  • Adhere to - and support - Riffyn’s Information Security Management System (ISMS) policies and procedures.

Your background and skills

  • 3+ years of professional working experience with security focused.
  • Experience with AWS (specifically around networking, security, and administration).
  • Experience with Linux system administration.
  • Experience developing, monitoring and troubleshooting mission-critical secured systems.
  • Ability to dive 2-3 levels deep in logs to investigate the root cause of the issue. Implement or Recommend solutions to engineering teams to resolve the issues.
  • Experience scaling micro services architecture to handle large loads.
  • Experience with IaC, Kubernetes, Terraform, Helm, Docker, Ansible at scale.
  • Track record of building and maintaining performant micro service-oriented software products.
  • Know how to build resilience and durability into cloud infrastructure.
  • Know the ins and outs of MongoDB, SQL, Messaging or stream systems.
  • Knowledgeable about Node.js, Java, Python and are up for working in Typescript.
  • Demonstrated ability to proactively manage tasks, projects, and issues.
  • Authorization to work in the U.S.

Bonus points:

  • AWS, DynamoDB, Kafka, Encryption, Data Warehouse.
  • Familiarity with distributed and parallel processing data pipelines.
  • Familiarity with building RESTful API’s is a plus.
  • Familiarity with building real time applications is a plus.


If you thrive in a fast-paced, down-to-earth, collaborative, and mission-driven environment, Riffyn is the place for you. Join the journey of a lifetime and be at the forefront of the next revolution of scientific discovery.Apply with your CV, cover letter, and any portfolio of materials that illustrate your skills (presentations, websites, data analysis, etc.).

Apply for this job

Apply now
Mariana Maya's photo

Mariana Maya

Sr. Digital Marketing Manager

Other Jobs