Principal Site Reliability Engineer (SRE)

Sedang Trending 3 hari yang lalu

There is simply a spot for you astatine T. Rowe Price to grow, contribute, learn, and marque a difference.?? We are a premier?asset?manager?focused connected delivering planetary concern absorption excellence and status services that investors tin trust connected contiguous and successful the future. The enactment we bash matters. We invitation you to research the accidental to articulation america and turn your vocation with us.

Job Title:?Principal Site Reliability Engineer (SRE) 

Department:?CDO Technology Group 

Summary: 

We are seeking a highly motivated and experienced Principal Site Reliability Engineer (SRE) to articulation the CDO Technology enactment squad to basal up and pb the SRE relation wrong CDO Technology. In this role, you volition beryllium liable for ensuring the availability, latency, performance, efficiency, and stableness of our captious infrastructure, which supports a scope of information platforms, applications, and services. You volition collaborate intimately with improvement teams to instrumentality and support reliable and scalable systems portion adhering to manufacture champion practices and information standards. 

Responsibilities: 

Availability: 

  • Proactively show and proactively place imaginable issues that could interaction the availability of our systems.
  • Implement and support automated alerting mechanisms to notify the due parties of imaginable outages oregon show degradation.
  • Collaborate with improvement teams to plan and instrumentality solutions that heighten strategy resilience and trim downtime.

Latency:

  • Analyze show metrics to place and resoluteness latency bottlenecks successful our infrastructure.
  • Implement show optimization techniques and tools to amended the wide responsiveness of our systems.
  • Work with improvement teams to guarantee that caller features and codification changes bash not present show regressions.

Performance:

  • Develop and support metrics dashboards to way cardinal show indicators (KPIs) for our captious systems.
  • Identify show trends and anomalies that whitethorn bespeak imaginable issues oregon areas for improvement.
  • Recommend and instrumentality show optimization strategies to heighten the wide ratio of our systems.

Efficiency

  • Optimize assets utilization and minimize unnecessary expenditure connected IT infrastructure.
  • Identify and instrumentality cost-effective solutions to amended the ratio of our IT operations.

Release Management: 

  • Design and instrumentality automated deployment and rollback procedures to mitigate risks associated with bundle updates.
  • Monitor the show of caller releases and code immoderate issues that originate promptly.
  • Lead the squad that executes the merchandise management.

Monitoring:

  • Design, implement, and support a broad monitoring infrastructure to way the wellness and show of our systems.
  • Analyze monitoring information to place imaginable issues and proactively troubleshoot problems earlier they impact
  • Develop and instrumentality alerts and notifications for captious events to guarantee timely

Emergency Response: 

  • Build and pb the squad that responds promptly to incidents and works collaboratively to resoluteness them successful a timely manner.
  • Analyze basal causes of incidents to place and instrumentality preventive measures to minimize their recurrence.
  • Document incidental responses and pass lessons learned to heighten our incidental handling processes.
  • Collaborate with your peers connected the enactment squad to specify a multi-year method roadmap. Stay up to day with manufacture developments and endeavor infrastructure, and expect important risks.
  • Work with improvement teams to reappraisal architecture plan to guarantee precocious availability and due catastrophe betterment strategy
  • Collaborate with reliability and infrastructure engineering squad successful T Rowe Price to physique synergy successful tooling for the implementation of observability, tracing, and alerting

Qualifications: 

  • Bachelor's grade successful Computer Science, Information Technology, oregon a related tract preferred.
  • 10+ years of acquisition arsenic a Site Reliability Engineer oregon equivalent successful a akin role.
  • Proven acquisition successful monitoring, analyzing, and optimizing the show of large-scale distributed systems.
  • Expertise successful Linux systems administration, including managing servers, operating systems, and web configurations.
  • Strong scripting and automation skills, preferably with acquisition successful Bash, Python, oregon akin languages.
  • Familiarity with AWS.
  • Experience with DevOps tools and practices, specified arsenic GitLab CI/CD, and Docker.
  • Excellent troubleshooting and problem-solving skills with a knack for identifying and resolving analyzable method issues.
  • Ability to enactment independently and arsenic portion of a collaborative team, efficaciously communicating method concepts to some method and non-technical stakeholders.
  • A passionateness for maintaining precocious availability, performance, and reliability of captious systems successful a fast-paced fiscal environment.

Benefits: 

  • Competitive wage and broad benefits package.
  • Opportunity to enactment with cutting-edge technologies and lend to the improvement of innovative solutions.
  • Collaborative and supportive enactment situation with a absorption connected continuous learning and nonrecreational development.
  • Rowe Price operates a hybrid moving exemplary with a minimum of 2 days per week successful the London bureau expected

Commitment to Diversity, Equity, and Inclusion:

We strive for equity, equality, and accidental for each associates. When we clasp the powerfulness of diverseness and make an situation wherever radical tin bring their authentic and champion selves to work, our steadfast is stronger, and we make greater worth for our clients. Our committedness and inclusive programming purpose to assistance the acquisition for each subordinate and builds allies for our planetary subordinate community. We cognize that a consciousness of belonging is cardinal not lone to your occurrence astatine the firm, but besides to your quality to bring your champion each day.

T. Rowe Price is an adjacent accidental leader and values diverseness of thought, gender, and race. We judge our continued occurrence depends upon the adjacent attraction of each associates and applicants for employment without favoritism connected the ground of race, religion, creed, colour, nationalist origin, sex, gender, age, intelligence oregon carnal disability, marital status, intersexual orientation, sex individuality oregon expression, citizenship status, subject oregon seasoned status, pregnancy, oregon immoderate different classification protected by country, federal, state, oregon section law.

Atas