We are Azure Data , a Microsoft team that drives the future of data processing in the Microsoft Cloud. Our software development team, located in Belgrade, is building some of the most advanced and widely used data processing cloud services in the world. The services we build are based on groundbreaking technology and are global market leaders, with millions of active users.
Azure SQL Database is our flagship database service powering millions of applications, ranging from small shops to large mission critical business workloads. Benefits of instant scaling and serverless mode, as well as intelligent performance tuning and security are available to all applications, whether modernized on-premises workloads or apps natively built for the cloud. Azure SQL Managed Instance is our customer's first choice service for migrating existing SQL Server instances from on-premises data centers to the cloud. Thousands of customers have easily migrated their apps to this service and got all the benefits of automated database management (patching, backups, high-availability, security). In aggregate, our cloud services house petabytes of customer data, and manage hundreds of thousands of machines with millions of CPU cores running billions of user database queries. Keeping them run smooth and highly available round the clock, while satisfying huge scale is a daunting challenge, one that we enjoy tackling.
We are looking for a Senior Software Engineer to join our site reliability team and closely collaborate with our software engineers, support teams, and other roles that ensure a great experience to our customers. Running software as a service means more than just developing and releasing features. Ensuring reliability and serviceability is critical part of the software cycle. This is where you come into the picture. As an SRE, you will ensure the service of Azure SQL Database or Managed Instance run smoothly with required reliability and availability. You will design and implement software to automatically resolve issues. You will work closely with feature teams to design, implement and release features that are reliable and serviceable. You will be a cross-domain expert who has a holistic view of our cloud service.
This is an opportunity to work with some of the best engineers in the industry to continue to innovate and deliver Azure SQL Database and Managed Instance for the Cloud. The challenges span the entire stack of database technology - connectivity, high availability, programming language, query processing, transaction processing and data management involving 100s of nodes. You will learn what it takes to deploy and run software as a 24x7 enterprise grade cloud service!
Responsibilities
Act as subject matter expert for configuring, troubleshooting and monitoring Azure Database/Managed Instance services.
Identify opportunities and implement automation to resolve and reduce live-site incidents.
Design and implement solutions to improve service health, manageability, reliability and telemetry.
Design and implement diagnostics infrastructure to improve product quality and customer satisfaction.
Own, triage, investigate, and resolve service issues with emphasis on broad communications, learning, and teaching throughout the process.
Author and maintain functional and technical documentation. Define and maintain process and procedures to run enterprise service.
Mentor, develop and deliver training.
Ability to meet on-call responsibilities periodically.
Qualifications
Required Qualifications:
Extensive experience in software development
Deep understanding of Windows Operating System level concepts such as processes, threading, memory allocation, and the network stack; understanding of how applications are affected by the above, and ability to debug the same.
Experience in working with multiple teams, including coordinating large projects, communicating well, and exercising initiative when presented with problems.
Proficient programming skills using managed code such as C#/Java. Ability to read native C/C++ code to debug issues and find answers not documented.
Preferred Qualifications:
Troubleshooting experience in SQL Database or SQL Managed instance or a similar database product.
Previous experience and proficiency in using debugging tools such as Windbg and Xperf to debug user dumps or live applications.
Skilled with troubleshooting approaches such as a Bayesian network.
BS/MS in Computer Science, Engineering and/or equivalent industry experience is preferred.
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .