IT Operations Manager - San Francisco, United States - Dodge & Cox

    Default job background
    Description


    The IT Operations Manager will work with development teams to design, plan, create and implement software so that Cloud and on-premise infrastructure and applications are integrated and optimized for performance.

    This person will be responsible for oversight of systems' availability 24*7 for on premise, Cloud or Vendor hosted solutions as per the defined SLAs (service level agreements).

    The person hired will be expected to manage critical incidents to resolution working with internal teams and third party vendors.

    This person will help manage, evolve and govern our Change Management process, and validate that any changes adhere to the firm's software development lifecycle policy, do not conflict with one another and do not, in the aggregate, increase risk beyond our established tolerance levels.

    ResponsibilitiesCloud GovernanceWork with Cloud Platform Engineering, Cloud Application Development and Data Platform development teams to operationalize new capabilities as per Dodge & Cox standard.

    Partner with Development team to design, plan, create and implement software so systems are designed about appropriate availability and performance.

    Help evolve our DevSecOps processes to follow our software development lifecycle.

    Build and maintain cloud operations procedures and policiesObservability & Monitoring Ensure our Cloud and on-premise systems have the right monitoring tools, so we can identify potential problems early on and fix them proactively.

    Manage offshore and onshore resources to monitor our systems 24*7.Ensure our contractors and vendors have documented, detailed procedures for resolving issues.

    Monitor proper execution of nightly job activities.

    Maintain escalation procedures and run books for systems to ensure timely resolution of incidents 24*7.Provide oversight on development, implementation, maintenance and monitoring of highly secured, cloud infrastructure.

    Develop SLAs (service level agreements) and benchmark each system's availability relative to benchmark.

    Cloud & On-premise Operations Scale the infrastructure to accommodate growing user demands while maintaining performance and reliabilityIdentify areas for improvement and advocate for and prioritize development to improve our foundation.

    Maintain the firm's asset inventory and tightly manage any changes to the asset inventory.
    Maintain and expand on our outage, incidents, service requests and change ticket metrics. Identify and prioritize projects that will reduce volume and improve user experience.

    Crisis Management Serve as a Crisis Manager in a disaster recovery event by facilitating failover to a secondary or tertiary location for on premise, Cloud and/or vendor hosted solutions.

    Identify business impacts and be the point person in facilitating resolution on any high severity incident including trading, Cloud, cybersecurity, and data incidents.

    Track identification of root cause analysis, conduct lessons learned and share lessons learnt after each high severity incident.

    Coordinate with Business Continuity Manager on Business Continuity events and provide timely updates to the SIRT (Strategic Incident Response Team).Change Management Governance Help evolve our Change Management process to support DevOps.

    Ensure IT teams follow our software development policies, processes and industry best practices.

    Ensure implementation of next gen tools to build software release cycle in the Cloud such that they follow our policies, procedures and industry best practices.

    Ensure system reliability is not impacted with system releases.

    Required skills and qualifications 5+ years in IT / Cloud operations management role and 15+ years in IT sector3+ years of working knowledge of Azure Cloud preferable- 10 years of financial services/asset management experienceExperience managing critical firm-impacting incidents.

    Experience managing incidents and service request queue and asset inventoryExperience managing release management lifecycle for Infrastructure as Code and Applications both on premise and in the Cloud.

    Experience implementing observability and monitoring solutionsGreat communication skills to clearly articulate and guide resolution to problems in high stress situations.

    Technical knowledge Knowledge of Azure Cloud Platform including VPC, Load balancer, and WAF,Microsoft 365 Platform, including Exchange, SharePoint, OneDrive and Office 365, Teams, Power AutomateKnowledge of Cisco NetworkingWindows and Linux Operating Systems, SQL Server and Active DirectoryPowerShell scripting, Terraform, AnsibleService Now incident workflow and CMDB (configuration management database)

    Workflow automation with Power Automate, Workato or like toolsFamiliar with Converged Infrastructure, VMware, VxBlock, VDIKnowledge of Azure Virtualization PlatformsEnterprise Backup, Data Replication and restoration experience in Business Continuity or Disaster Recovery eventsExperience developing DevSecOps CI/CD pipelineCRM Dynamics 365 know-howRequirements:
    Dodge & Cox has a 3/2 hybrid work model, and all Dodge & Cox employees are required to be in their assigned office as noted in the job posting Tuesday
    • Thursday each week, with the option to work remotely on Monday and Friday.
    The salary range for this position is $150k - $170k.

    The listed pay scale denotes only the pay range of the base salary and does not include discretionary bonus compensation, which may make up an important portion of the total remuneration.

    Dodge & Cox encourages applicants to consider the value of the many competitive benefits it offers, including coverage of 100% of all healthcare premiums for employees and their families and fully funding a retirement plan at 25% of the total compensation to the IRS limit.

    Dodge & Cox also provides additional benefits such as commuter, health & wellness, backup care, matching gift, employee assistance, and life and disability insurance.

    The listed pay scale reflects the base salary Dodge & Cox reasonably expects to pay for this position and is not a reflection of the highest and lowest base salary of any current Dodge & Cox employee.

    Actual base salary will be based on factors such as the candidate's prior relevant experience (including within and external to Dodge & Cox, as applicable), education, skills, and knowledge.

    The job description above is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee.

    It is the Company's policy to provide equal opportunity to all persons without regard to race, color, religion, sex, pregnancy, marital or domestic partner status, sexual orientation, gender identity or expression, age, ancestry, national origin, disability, or medical condition, as defined in state and federal laws.

    This policy covers all aspects of employment including, but not limited to, recruitment, selection, training, promotion, transfer, compensation, demotion, and termination.

    By applying for a position with Dodge & Cox, you acknowledge that you have read our EEO PolicyAll Dodge & Cox employees must adhere to the Firm's security policies and Code of Ethics.

    Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

    #J-18808-Ljbffr