SendGrid is looking for someone to help take our mission-critical cloud email management system to the next level. This system will manage our email infrastructure cluster – it will orchestrate the provisioning, load balancing, dynamic configuration/re-configuration, monitoring and spend optimization of servers across providers.
What you’ll do
Take personal responsibility for the availability and reliability of our service
Author tools that reliably manage infrastructure. We're looking for someone to write clean, re-usable code.
Write maintainable code with extensive test coverage
Support our existing production cluster management system while you improve it
Work on our server image configurations, collaborating with core server engineers to optimize for task performance, reliability, failover and scale
About you
You have the passion to "do server management infrastructure right"
Solid linux system administration
Experience with at least two of the following: Ruby, Perl, Python (a lot of our current code is Ruby but we use whatever tool is best for the job at hand)
Strong familiarity with the SMTP protocol
Socket programming experience
Computer Science / Engineering degree
A distributed systems foundation and a service-oriented mindset
You've "carried the pager" before (ideally at both a startup and a large infrastructure provider) & have first-hand experience with what happens when infrastructure / tools fail
A minimum of 5 years of coding experience
You are a prolific coder who works well independently
You have great communication skills
You have formal training in computer science
Bonus Points
You’ve written software tools to manage 1000+ servers
You are conversant in the pros and cons of different clouds: Softlayer, Rackspace, etc
You’ve made a substantial contribution to a widely used open source project
You read up on and experiment with new technologies because it’s in your nature, not because it’s a job requirement
You don’t just learn how things work, you learn why