Cloud Operations Lead
Lineage
Lineage builds state-of-the-art systems to make our employees productive, to meet and exceed our customers’ expectations, to assist in our growth, and to create an operating platform that enables Lineage to achieve its vision of transforming the global food supply chain. This is one of the most critical roles in that journey, and it will require someone who has a true passion for technology.
The role is part of the runtime organizations Datacenter Operations team (DCOps). Lineage is a cloud “only” company with all front-line applications being hosted in the cloud (loosely defined in Lineage as any AWS services). The DCOps team owns the core infrastructure and platform for Lineage globally, we build the foundation for the transformation and data science teams to build their platforms to support business operations. The DCOps team is customer focused and manages the support and operations via an MSP partnership. The MSP is responsible for the day-to-day operations including handling incidents, change and escalations. DCOps is governance of that MSP responsibilities as well as working on continuous improvement, problem resolution and new initiatives.
This role is primarily responsible for managing the runtime environment inside the Lineage AWS organization. Duties include ensuring the performance of the MSP, project development, identifying efficiencies (CI), cost management (FinOps), problem resolution, as well as to function as the SME to the runtime organization. To be successful, the cloud Operations Lead will need to be capable of overseeing all aspects of technology tied into AWS, Azure, and other cloud platforms as they are adopted. A successful candidate will need to be capable to analyze current infrastructure technology used then develop roadmaps to improve and expand upon them. Iterating designs to increase the quality, reliability and service of the infrastructure. Although the team relies on the MSP to perform the day-to-day operations, this role will be expected to be hands on during escalation incidents.
Primary Responsibilities:
Infrastructure Design and Deployment: Planning, designing, and implementing cloud infrastructure architectures, ensuring scalability, availability, and reliability.
System Administration: Managing and maintaining virtual machines, storage, databases, and other cloud resources. Performing system monitoring, optimization, and troubleshooting.
Security and Compliance: Implementing security measures to protect cloud environments, including network security, access controls, and data encryption. Ensuring compliance with industry standards and regulations.
Performance Optimization: Identifying and resolving performance bottlenecks, optimizing resource utilization, and recommending infrastructure improvements.
Automation and Scripting: Developing scripts and automation workflows to streamline administrative tasks, infrastructure provisioning, and deployments.
Incident Management: Responding to and resolving escalated incidents, performing root cause analysis, and implementing preventive measures.
Escalations: From time-to-time large outages require our best people to be pulled into calls, while this is infrequent it is expected that this position is available when needed to help on major incidents.
Backup and Disaster Recovery: Implementing and managing organization backup strategies and disaster recovery plans to ensure business continuity.
Managing access across the entire Cloud Organization to ensure standardized access levels.
Collaboration and Documentation: Collaborating with cross-functional teams, providing technical guidance and support. Documenting system configurations, processes, and procedures.
Travel: less than 10% with potential international travel required (Europe & Asia).
Migration & Optimization: Moving legacy systems to the cloud and optimizing cost/performance.
Technical Enablement: Delivering workshops, demos, and whitepapers.
Education and Work Experience:
University degree in Information Technology, or Business Administration and/or equivalent work experience.
Minimum 5 years of experience as senior engineer/architect.
Minimum 4 years of experience with AWS cloud services.
Exceptional verbal, written and interpersonal communication skills; including the ability to communicate effectively across all levels of the organization.
Experience working in a matrixed global organization where success requires broad orchestration of resources and services.
Strong understanding of the business impact of IT tools, technologies, and policies.
Required Knowledge, Skills, and Abilities:
Cloud Foundations and Services (VMware (SDDC), AWS, and Azure).
Expert knowledge of AWS, System Administration, Networking and Security.
Expert in Cloud Governance.
Containers (Docker and Kubernetes).
Automation tools (Terraform / Ansible).
Experience with one or more scripting languages.
Experience at leading teams or resources.
Excellent communication and presentation skills.
#LI-Rremote
Why Lineage?
This is an excellent position to begin your career path within Lineage! Success in this role enables greater responsibilities and promotions! A career at Lineage starts with learning about our business and how each team member plays a part each and every day to satisfy our customers’ requirements. Beyond that, you’ll help us grow and learn on our journey to be the very best employer in our industry. We’ll ask you for your opinion and ensure we do our part to keep you developing and engaged as we grow our business. Working at Lineage is energizing and enjoyable. We value respect and care about our team members.
Lineage is an Equal Employment Opportunity Employer and is committed to compliance with all federal, state, and local laws that prohibit workplace discrimination and unlawful harassment and retaliation. Lineage will not discriminate against any applicant on the basis of race, color, age, national origin, religion, physical or mental disability or any other protected status under federal, state and local law.
Benefits
Lineage provides safe, stable, reliable work environments, medical, dental, and basic life and disability insurance benefits, 401k retirement plan, paid time off, annual bonus eligibility, and a minimum of 7 holidays throughout the calendar year.