| Company Description Ingram Barge is a quality marine transporter on America's inland waterways since 1946, starting out as a small, family-owned business and growing into what we are today: the largest dry cargo carrier and one of the top chemical carriers on the river.As the leading carrier, we operate a fleet of approximately 140 towboats and 5,000 barges.We're committed to being the best at whatever we do. We're continuously growing, adapting, and responding, and we're successful because of the outstanding hard work and creative energy of our associates.Job Description Ingram Barge is seekinga Senior DevOps Engineer to join our dynamicDevSecOps team. This person will work alongsideour Systems Architect, Application Development Architect, and SecurityEngineer and focuses on operationalizing our cloud-native infrastructure, enhancing CI/CD pipelines, ensuring system reliability and resilience, and providing24x7 operational support. What you will be doing: Pipeline& Automation 
 Designing and implementing advanced CI/CD pipelinefeatures using GitLab 
 Developing and maintaining Terraform modules for infrastructure provisioning 
 Creatingand optimizingAnsible playbooks for configurationmanagement and deployment automation 
 Integratingsecurity scanning and compliance checksinto deployment pipelines Container& Kubernetes Operations 
 Building, configuring, and maintaining Azure Kubernetes Service (AKS) clusters 
 Developing and optimizingHelm charts for applicationdeployments 
 Implementing and managingGitOps workflowsMonitoringand troubleshooting containerized applications and cluster performance Infrastructure & Reliability 
 Implementing Infrastructure as Code best practices using Terraformand Ansible 
 Designing and executingdisaster recovery procedures and business continuity plans 
 Performing system patching, upgrades, and maintenance activities 
 Establishing and maintaining comprehensive monitoring, alerting, and observability solutions using Prometheus and Grafana Cost Optimization & ResourceManagement 
 Monitoring and analyzingAzurecloud spending patterns and resource utilization 
 Implementing cost optimization strategies including right-sizing, reserved instances, and auto-scaling policies 
 Developing dashboards and reports forcost tracking and forecasting 
 Collaboratingwith teams to optimize resource allocation and eliminatingwaste Monitoring & Observability 
 Designing and implementing comprehensive monitoringsolutions using Prometheus for metrics collection 
 Building and maintaining Grafana dashboards for infrastructure, application, and business metrics 
 Configuringintelligent alerting rules and escalation procedures 
 Establishing SLIs, SLOs, and errorbudgets for critical services 24x7 Support & IncidentResponse 
 Participatingin on-call rotation for 24x7 production support 
 Leading Tier 3 incident response efforts for production outages and systemissues 
 Performing root cause analysis and implementing preventive measures 
 Collaboratingwith development teams onperformance optimization and troubleshooting 
 QualificationsMaintaining runbooks and documentation foroperational procedures Knowledge, Skills, and Abilities: Technical Expertise(5+ years) 
 Strong experience with Kubernetes(AKS preferred) and container orchestration 
 Proficiency in Infrastructure as Code: Terraform and Ansible 
 Advanced GitLab CI/CDpipeline development and optimization 
 Experience with GitOps methodologies and leading toolsets like Helm, Flux and/or ArgoCD 
 Pythonscripting for automation and pipeline tasks 
 Azurecloud services and networking concepts Monitoring& Cost Management 
 Hands-on experience withPrometheus for metrics collection and alerting 
 Proficiency in Grafana for dashboard creation and data visualization 
 Experience with AzureCost Management tools and FinOps practices 
 Knowledge of resource optimization techniques and auto-scaling strategies 
 Understanding of cloud pricing models and cost allocation methods DevOps & SRE Practices 
 Incidentmanagement and post-mortem processes 
 24x7on-call experience withescalation procedures 
 Disaster recovery planningand implementation 
 Securitybest practices in CI/CD and infrastructure 
 Experience with chaosengineering and resilience testing Collaborative Skills 
 Experience working with cross-functional teams 
 Strongtroubleshooting and problem-solving abilities under pressure 
 Documentation and knowledge sharing practices 
 Comfortable with24x7 on-call rotation responsibilities Preferred Qualifications 
 Azure certifications (AZ-104, AZ-400, or AKS-related) 
 Experiencewith message bus systems (AzureService Bus) 
 Knowledge of.NET applications and Angular frontend deployments 
 Familiarity withsecret management solutions(Delinea or similar) 
 Experience with additionalmonitoring tools (Azure Monitor, Application Insights) 
 FinOps certificationor cost optimization experience 
 Additional InformationExperience with alerting tools and PagerDutyintegration Why You Should Apply: 
 Professional and financial growth opportunitiesMedical benefitsRetirement benefits All your information will be kept confidential according to EEO guidelines. If you are requesting reasonable accommodation or disability assistance in submitting your application, you may email us at Recruiting@ingrambarge.com Ingram Marine Group and its affiliates ("Company") is an Affirmative Action/Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, work related mental or physical disability, veteran status, sexual orientation, gender identity, or genetic information. EEO/AA Employer/Vet/Disabled We participate in EVerify http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeosp.pdf |