New
Senior Software Engineer
Microsoft | |
United States, Texas, Irving | |
7000 State Highway 161 (Show on map) | |
Nov 28, 2024 | |
OverviewIn Azure Specialized we work collaboratively to bring the next generation of workloads to our Public Cloud platform. We work together across Microsoft to enable end to end new scenarios for Azure customers. Our team imagines and builds differentiating customer features and fundamental building blocks at the heart of the Azure platform working collaboratively with many industry partners.As a Senior Software Engineer, you will be critical in designing and delivering the next generations of High Performance Computing (HPC) to enable a wide variety of customer workloads including weather prediction, electronic design attestation, computational fluid dynamics and more. You will be challenged across a wide spectrum of hardware architectures, network types and processor types. You will part of delivering an end-to-end vertical view, with continuous focus on customer value, quality, performance and automation. This position involves deep technical work, focusing on defining, deploying and sustaining hardware and software Azure infrastructure for HPC workloads. The work for this position focuses on hardware/software interaction, coding and playing with next-gen hardware, end-to-end systems engineering anywhere in the infrastructure - - CPU differentiation, networking, switches, rack design, cluster design and more to come up with the best offering for the customer. This position offers a unique opportunity to have a huge impact on customers and the world. It is an exciting time for the team as we are working on expanding the capacity and range of supported scenarios to support the next 100X growth.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.#azurecore
ResponsibilitiesWe are looking for someone who is passionate about quality, wants the customer to succeed and get things done. You will join a phenomenal team of hardworking engineers with deep experience with replication systems, highly available systems, large scale algorithms, dynamic and high-performance solutions at massive scale. Your mission will be to help ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads. You will help building a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality. The following values drive us: Willing to dive deeply into any level or layer of a problem.Willing to learn emerging technologies, from hardware to software. Evaluate and make recommendations that advance Azure infrastructure for AI and other GPU-based workloads.Leads by example within the team by producing extensible and maintainable code. Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well asappropriate codingpatterns and best practices.Maintains communication with key partners across the Microsoft ecosystem of engineers. Ensures alignment with partners' expectations. Considers partner teams across organizations and their end goals for products to drive and achieve desirable user experiences and fitting dynamic needs of partners/customers through product development.Drives identification of dependencies and the development of design documents for a product, application, service, or platform.Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate. |