OpsLevel is a very early stage startup based in Toronto, ON. We're an experienced team of ex-PagerDuty, ex-Amazon, and ex-Shopify. We have paying customers and just closed our seed round with fantastic Silicon Valley investors.
We want to make the Internet better by helping companies build more reliable software. We're building a product to bring the site reliability best practices from companies like Netflix and Google to everyone else.
Our users' experience is paramount, so we relentlessly focus on all aspects of UX, from our UI to our APIs. As developers, we also take great care with code quality, maintainability, and scalability.
Work/life balance is a priority for us. We have family, friends, and hobbies that we want to attend to at the end of the day (and we suspect you do too). Elon Musk can keep his 100 hour weeks. We're happy to grow fast at 40 hours per week.
~About you~
You'll be one of our initial hires and working closely with our CEO, CTO, and other senior employees.
Your day to day will be writing software, but you'll ultimately touch many aspects of the business: talking to customers, defining new features, and then actually implementing those features end to end. We don't have a lot of process or structure, so you should be good at working independently and getting stuff done.
On the technical side, we care most that you have an insatiable curiosity around technology and software. You care about improving your craft and can demonstrate how you've done so.
~Stack:~
- Vue.js - Frontend
- Ruby/Rails - Backend / API
- Currently on Heroku (migrating to Terraform, Docker, and AWS)
Yes, a good escalation policy would have a primary responder, a backup or secondary, and then one or managers, going up the hierarchy. PagerDuty supports that and my next post will be on that topic.
Having a good device is important too though; if you sleep through or miss an alert, it may take another 10-20 minutes or so (depending on the escalation policy) before the alert escalates to the next person. This slack time could be pretty important depending on the severity of the problem.
This is a good point... the first step is getting ahold of the right person, but after that there will probably need to be some sort of dialog or coordination as the on-call person tries to gather more data about the issue, reproduce the problem, tests the fix, etc. Providing that sort of system would certainly be very useful for your customers.
Overall great idea with PagerDuty though, especially if one's business relies (survives) on their website's/system's uptime. Reducing MTBF is often very hard, especially after a certain point, and reducing MTTR is therefore very important for improving availability.
Integration with a ticketing system is a great idea. We are thinking of adding support for Lighthouse, so you can coordinate, document and work with other people on resolving triggered alarms.
~About us~
OpsLevel is a very early stage startup based in Toronto, ON. We're an experienced team of ex-PagerDuty, ex-Amazon, and ex-Shopify. We have paying customers and just closed our seed round with fantastic Silicon Valley investors.
We want to make the Internet better by helping companies build more reliable software. We're building a product to bring the site reliability best practices from companies like Netflix and Google to everyone else.
Our users' experience is paramount, so we relentlessly focus on all aspects of UX, from our UI to our APIs. As developers, we also take great care with code quality, maintainability, and scalability.
Work/life balance is a priority for us. We have family, friends, and hobbies that we want to attend to at the end of the day (and we suspect you do too). Elon Musk can keep his 100 hour weeks. We're happy to grow fast at 40 hours per week.
~About you~
You'll be one of our initial hires and working closely with our CEO, CTO, and other senior employees.
Your day to day will be writing software, but you'll ultimately touch many aspects of the business: talking to customers, defining new features, and then actually implementing those features end to end. We don't have a lot of process or structure, so you should be good at working independently and getting stuff done.
On the technical side, we care most that you have an insatiable curiosity around technology and software. You care about improving your craft and can demonstrate how you've done so.
~Stack:~
- Vue.js - Frontend
- Ruby/Rails - Backend / API
- Currently on Heroku (migrating to Terraform, Docker, and AWS)
More info: Email john@opslevel.com.