Oracle Senior Site Reliability Engineer in Chesterfield, Missouri
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5 years experience of running large scale customer facing web services.
Oracle is an Affirmative Action-Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, protected veterans status, age, or any other characteristic protected by law.
Oracle Information Technology is seeking a Software Reliability Engineer with 5 to 10 years of experience to work with our Compute team. The successful candidate will use their hands-on operational experience to identify areas that can be automated and then design, build, implement and support these solutions which will improve operational efficiencies.
Responsibilities will include working with a global operational support team and a team of front-end UI/UX and back-end API developers to provide a complete solution. You will work with other development teams to integrate multiple applications into a cohesive whole. Scaling applications to large user counts and very large data and resource requirements will be a regular challenge.
In addition, this role is responsible for complex problem resolution, creating and improving procedures and facilitating communication. Other duties include researching, proofing, and authoring technical documentation. This is a great career opportunity for a highly motivated individual who wants to extend and utilize their solid and diverse skills.
Develop new user-facing features
Develop API's for consumption within UI frameworks
Build reusable code and libraries for future use
Create automated unit and functional tests
Ensure the technical feasibility of UI/UX designs
Optimize application for maximum speed and scalability
Assure that all user input is validated
Collaborate with other team members and stakeholders
Skills and Qualifications
Linux server administration
Proficient in two of three areas: front-end development, back-end development and cloud infrastructure automation
Proficient in Python for back-end development
Proficient in cloud technologies
Clear understanding of web technologies like Oracle Jet, AngularJS, Web Services, REST
Good understanding of database languages such as SQL and PL/SQL
Basic understanding of Oracle RDBMS and MySQL
Good understanding of asynchronous request handling, multithreading and multiprocessing
Good understanding of machine learning and artificial intelligence
Basic understanding of web markup, including HTML5, CSS3
Proficient with code versioning tools, such as Git, Mercurial or Subversion
Good understanding of Agile software development principles including using common tools such as JIRA
MS Windows experience a plus
DevOps experience a plus
5 to 10 years development experience
Experience with Development Operations or Site Reliability Engineering
The work can be demanding at times, particularly as deadlines approach, when extra hours may be required based on the candidate's effective deliverable capacities.
- Bachelor s Degree in science or engineering (Computer science preferable)
Job: *Information Technology
Title: Senior Site Reliability Engineer
Location: United States
Requisition ID: 20000YTU