Job Title: L3/Lead SME - Azure & AWS Support Location: Bridgewater, NJ, USA Job Type: Full-Time Job Summary: We are seeking a highly skilled and experienced L3/Lead SME to join our team, focusing on Azure (75%) and AWS (25%) support. The ideal candidate will be responsible for managing and supporting cloud infrastructure, ensuring optimal performance, security, and availability. This role involves a mix of proactive monitoring, incident management, and performance tuning to support our critical cloud environments. Key Responsibilities: Azure Support:
- Perform all operations in Azure (deploy, delete, configure, update) using client approved methods (e.g., Bicep Templates, Azure DevOps pipelines).
- Manage Azure IaaS, including Virtual Machines (deployment, configuration, sizing, start/stop, snapshot management, backup/restore, disk management).
- Conduct initial OS configuration and setup.
- Monitor core health metrics (disk space utilization, CPU utilization, memory utilization).
- Configure local administrator credentials and store them in Key Vault.
- Set up relevant alerts and remediations as agreed upon by the client Azure Team.
- Manage SQL Servers on Virtual Machines (excluding SQL Server installation/configuration).
- Deploy and configure Azure PaaS services (Storage Accounts, Web Apps, Logic Apps, Key Vaults, etc.).
- Ensure resources are deployed via VNet integration or private endpoint.
- Configure Diagnostic Settings for log perpetuation.
- Monitor resource health and configure backups/geographic replication.
- Set up load testing and availability monitoring.
- Configure and monitor Defender for Cloud and Secure Score.
- Maintain Azure Policy and Custom Roles via Azure DevOps.
- Monitor resource cost utilization and recommend adjustments.
- Create and manage Entra ID groups for RBAC purposes.
- Liaise with Directory Services team for DNS records.
- Provide daily, weekly, monthly, quarterly, and annual reports on health incidents and resource changes.
- Track individual activities using an agreed-upon method.
- Manage supplier support for incidents with MS Azure.
- Perform RCA analysis and publish post-incident reviews.
- Leverage Azure native automation technologies (Automation Accounts, Logic Apps, Functions).
- Ensure proper closure of incident/change/service request records.
- Assist in troubleshooting infrastructure issues during application migration.
- Coordinate integration of cloud infrastructure with security tools and remediation support.
AWS Support:
- Monitor disk space utilization, memory utilization, and CPU utilization.
- Monitor event logs, backup jobs, and custom metrics.
- Perform daily health checks of infrastructure, endpoints, and critical services.
- Manage EBS volumes, EC2 instances, and S3 buckets.
- Add/remove EC2 servers to ELB, upload SSL certificates to ELB.
- Launch and resize EC2 instances.
- Schedule EBS volume snapshots and enable detailed monitoring.
- Create RDS instances, restore DB snapshots, and configure RDS security groups.
- Upload content to S3 and manage object permissions.
- Configure strong password policies and use IAM for access control.
- Apply policies to S3 buckets and IAM users.
- Create VPCs, subnets, NACLs, internet gateways, route tables, NAT gateways, and VPN connections.
- Create DB instances and clusters.
- Create IAM users and roles, attach policies, and enable MFA.
- Create Lambda functions and CloudFormation scripts.
- Assist in troubleshooting during application migration.
- Coordinate integration of cloud infrastructure with security tools.
- Remediate cloud infrastructure compliance violations reported by Dome9.
PaaS Services:
- Manage Azure App Services, Azure AD Domain Services, CDN Profiles, ExpressRoute, Application Insight, PowerShell Runbook, StorSimple Device Manager, Azure SQL Server, RDS, Redshift, AWS Managed AD, and S3.
Desired Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Proven experience in Azure and AWS cloud environments.
- Strong knowledge of cloud infrastructure management and automation.
- Experience with performance tuning, backup and recovery, and cloud security.
- Familiarity with ITIL processes and best practices.
- Excellent problem-solving and communication skills.
- Ability to work independently and as part of a team.
Preferred Skills:
- Certification in Azure and/or AWS.
- Experience with Azure DevOps and Bicep Templates.
- Knowledge of automation tools like Logic Apps, Automation Accounts, and Functions.
Salary and Other Compensation: Applications will be accepted until May 8, 2025. * This position is also eligible for Cognizant's discretionary annual incentive program, based on performance and subject to the terms of Cognizant's applicable plans. Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements: * * Medical/Dental/Vision/Life Insurance * * Paid holidays plus Paid Time Off * * 401(k) plan and contributions * * Long-term/Short-term Disability * * Paid Parental Leave * * Employee Stock Purchase Plan Disclaimer: The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.
|