System Reliability Engineer
- Posted 22 August 2023
- LocationSingapore
- Job type Permanent
Company's Benefits
-
Paid Parental Leave
-
Return to Work Policy
-
Childcare Facilities
-
Flexible Working Arrangements
-
Mentorship Program
-
Breastfeeding Rooms
-
Sponsorship Program
-
Leadership Development Program
-
Coaching Program
-
Internal Women's Networking Group
Job Description
About Us
At Cloudflare, we have our eyes set on an ambitious goal: to help build a better Internet. Today the company runs one of the world’s largest networks that powers approximately 25 million Internet properties, for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company.
We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!
Production Engineering is responsible for the world’s most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behaviour is and we are capable of determining and exposing anomalous behaviour.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
In this role, you can expect to:
Design, write, and deliver software that improves Cloudflare's Edge platform
Scale and evolve systems through software and automation to improve reliability and velocity
Work on highly distributed and scalable systems
Participate in the constant cycle of knowledge sharing and mentoring
Research and introduce cutting-edge technologies
Contribute to open-source
We are well-funded, growing quickly and focused on building an extraordinary company. This is a systems reliability engineering role and is a superb opportunity to be part of a high performing team and help to support Cloudflare’s mission and help build a better internet.
You will build services and APIs to constantly improve availability, performance, uptime and response times.
You may be a good fit for our team if you have:
Proficiency in distributed Linux/Unix environments
Proficiency in high-level programming (e.g., Golang)
Proficiency in configuration management (e.g., Saltstack, Chef, Puppet, Ansible)
Proficiency in networking protocols Layer 3-7 of the OSI model
Experience in performance analysis, debugging, and troubleshooting
Experience in SQL databases (e.g., Postgres, MySQL)
Experience in load balancing and reverse proxies (e.g., Nginx)
Familiarity with Key/Value stores (e.g., Redis)
Familiarity with Internetworking and BGP
Exquisite written and verbal communication skills
Strong bias for action
Bonus points if you have:
Experience with continuous integration and delivery (CI/CD)
Experience working in a 24/7/365 service environment
Experience with high-bandwidth transit Internetworking and routing
Passion for tooling and automation