 
        
        System Engineer, Messaging and Streaming Team
Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future generations? 
Amazon Web Services (AWS) builds and operates some of the largest internet infrastructure on the planet; providing companies of all sizes with an infrastructure web services platform in the cloud. With AWS, customers provision compute power, storage, database, and other cloud resources as their business demands them. 
AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. 
If you join us you’ll be part of a world-class team in a dynamic environment that has the entrepreneurial feel of a start-up. As a member of the team providing EC2 services you will be delivering foundational capability that benefits all customers! 
This is an opportunity to operate and engineer systems on a massive scale, and to gain world class experience in cloud computing. You'll be surrounded by people who are passionate about cloud computing, believe that first class service is critical to customer success, and are committed to improvement. 
Top reasons to join our team: 
- Be a catalyst to deliver a truly disruptive products that are growing rapidly 
- Solve unique and first-order problems at massive-scale across many AWS Services 
- Learn how to build and operate distributed systems at massive scale 
- Build and influence the tools and utilities that are part of the AWS fleet running our internal services 
Key job responsibilities: 
- Work proactively to solve potential problems and inefficiencies. Communicate clearly and collaborate with others to deliver results with minimal supervision. 
- Participating in 24/7 on-call rotation to troubleshoot high severity issues 
- Analysing dashboards and investigating metrics with the vision for improvements 
- Create and maintain Standard Operating Procedures (SOPs) and runbooks for documentation 
- Discuss radical new approaches to automate operational issues, assess risks and develop creative solutions. 
- Develop strategies for resolving identified problems to prevent future occurrences 
- Assist others in the team 
About the team: AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. 
Basic Qualifications
- Experience writing scripts from scratch for automating manual tasks (BASH, Python, Perl, Ruby or similar) 
- Solid background in Linux. Familiarity with in depth troubleshooting and ability to solve complex technical problems 
- Knowledge of network fundamentals (DNS, UDP, TCP/IP, HTTP (s), routing, switching) 
- Experience owning services that are secure, scalable, reliable and efficient. Can identify multiple operational and security risks and then resolve, mitigate and/or escalate them 
Preferred Qualifications
- Bachelor’s Degree in Systems Engineering, Computer Science or related field, or relevant work experience 
- Exposure to cloud computing concepts and design considerations 
- Experience in a 24x7 production environment 
- Experience of monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) 
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. 
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. 
#J-18808-Ljbffr