Missouri Information Technology Jobs

Jobs.mo.gov mobile logo

Job Information

UST Global Inc Associate III Cloud Infrastructure Services in Saint Louis, Missouri

Site Reliability Engineer - SREDescription: Company is looking for Site Reliability Engineer to manage end to end application and system stack and to work with one of the leading financial services organization in the US. Site Reliability Engineering SRE is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations.SRE is also an engineering approach to building and running production systems engineer solutions to operational problems. As SREs are responsible for overall system operation, utilizing a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages.Responsibilities:As a Site Reliability Engineer, Act as first responders supporting L1 and in certain cases L2 issues You will engage in and improve the software development lifecycle from inception and design, through development, deployment, operation and refinement Develop and maintain the large-scale infrastructure Partner with the development teams, to help them improve the scalability and reliability the services they own Own build tools and CI/CD automation pipeline You will influence and design infrastructure, architecture, standards and methods for large-scale systems You will support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews You will maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health You will automate system scalability and continually work to improve system resiliency, performance and efficiency Investigate, diagnose, and resolve performance and reliability problems in a wide range of large-scale and high-throughput services Collaborate with architects and application engineers to ensure applications are maintainable, scalable, and follow appropriate disaster recovery and high availability strategies Contributions to handbook, runbooks, and general documentation You will remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possibleRequirements: BS degree in Computer Science or related technical field, or equivalent job experience required Over 4 years of SRE/Production Support experience Experience in DevOps and CI/CD pipelines and build tools like Jenkins. Experience in software development in one or more of the following: C#, ASP.NET, MVC, Genesys, MS SQL, Oracle , Captiva, Cannon OCR Experience in Scanning and Imaging domain or Contact center applicationsIVR is a plus Experience working with cloud technologies is preferred Google Cloud, Genesys Cloud Must have great communication skills Experience operating a production environment at high scale with emphasis on availability, latency Knowledge of container orchestration tools such as Docker, Kubernetes Familiar with configuration management tools and Deployment tools such as Chef, Octopus Strong team player with a can do attitude, and the flexibility to jump in wherever needed Demonstrable cross-functional knowledge with systems, storage, networking, security and databases System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers Docker, Kubernetes, etc. Proficiency with continuous integration and continuous delivery tooling and practices Strong analytical and troubleshooting skills Extra Points for any of the following: You have expertise designing, analyzing and troubleshooting large-scale distributed systems. You take a system problem-solving approach, coupled with strong communication skills and a sense of ownersh