Technical Program Manager, Core Infra - Menlo Park
1 day ago

Job description
Meta's Core Infrastructure team seeks a Technical Program Manager (TPM) to lead complex, large-scale projects focused on advancing language model scaling.
In this key position, you will collaborate across engineering, hardware, data center, research, and product teams to design, build, and scale foundational OSS software systems, and tools that support Meta's AI innovation.
This role sits at the intersection of building Public Cloud and system software integration, requiring technical expertise across OSS systems, Inference and foundations compute systems.
You will be responsible for driving the end-to-end integration of new hardware and core infra stack, from initial design validation of our software stack through production deployment across both data centers and public cloud.
This includes developing and refining repeatable frameworks for efficient onboarding, ensuring robust and predictable execution, and proactively resolving technical and organizational challenges to maintain project momentum.
You will use your problem-solving, technical acumen, and business insight to streamline onboarding of new hardware platforms into Meta's suite of core infrastructure services.
You will communicate transparently across all levels, motivate multidisciplinary teams, and champion best practices to deliver impactful outcomes that advance Meta's infrastructure.
Technical Program Manager, Core Infra Responsibilities:
Establish and lead effective program teams to ensure alignment and achieve common objectives
Partner closely with engineering, cloud engineering, data center, hardware and business stakeholders to define program requirements, prioritize initiatives, and establish scope, including shaping the roadmap and long-term strategy for partner teams
Develop and implement communication strategies to proactively share program status, challenges, and risks with stakeholders
Drive successful outcomes by actively managing cross-functional dependencies, data center & public cloud vendor tech milestones, mitigating risks, and adjusting scope, timeline, and resources as needed
Collaborate with cross-functional teams to lead the end-to-end lifecycle of programs, including technical analysis, design, development, testing, implementation, and post-launch support
Establish and track key metrics, quality benchmarks, and performance indicators to drive accountability and ensure effective cross-functional execution of program deliverables
Anticipate and evaluate complex, long-term infrastructure challenges in close partnership with engineering leaders and key stakeholders
Drive product strategy to support and align with key company initiatives such region and public cloud turn-ups
Lead process improvements across internal and external teams, streamlining workflows and reducing manual effort through automation
Minimum Qualifications:
10+ years of experience in software engineering, hardware engineering, systems engineering, or technical product/program management for large scale programs
Demonstrated knowledge of software and hardware development for large scale data center readiness, including end-to-end product development processes
Excel at clearly communicating complex technical investments in a simple and understandable manner
Experience delivering complex technology programs and products from inception through to successful delivery
Knowledge of understanding user needs, gathering requirements, and defining project scope
Experience working under your own initiative, across multiple teams, demonstrating critical thinking and providing thought leadership in ambiguous spaces
Experience defining and optimizing engineering processes and public cloud vendor technical milestones at scale
Excel at building cross-functional relationships, thrive amid complex challenges, excel at clearly communicating complex technical investments in a simple and understandable manner
Demonstrated experience of identifying new opportunities for the larger organization and influencing the appropriate stakeholders
Preferred Qualifications:
Advanced knowledge of Open Source Software such as Kubernetes development for large-scale systems for inference
About Meta:
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world.
Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.
People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer.We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-$168,000/year to $234,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location.
Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable.
In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.Similar jobs
· ...
2 days ago
Meta's Core Infrastructure team seeks a Technical Program Manager (TPM) to lead complex, large-scale projects focused on advancing language model scaling. In this key position, you will collaborate across engineering, hardware, data center, research, and product teams to design, ...
7 hours ago
We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward. · As a Staff Software Engineer, you will deliver the core data needs for the rapidly evolving conversat ...
1 month ago
We are looking for engineers who bring fresh ideas from all areas to develop the next-generation technologies that change how billions of users connect, explore and interact with information and one another. · ...
1 month ago
+ We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security artificial intelligence natural language processing UI design and mobile the list goes on a ...
1 month ago
We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design... · Provide technical leadership on high-impact projects. · Influence and coach a distributed team of engineers. · ...
1 month ago
Facebook's Feed and Video Ranking & Recommendation team seeks a Technical Program Manager to lead complex large-scale programs advancing the state-of-the-art in machine learning and AI systems for content ranking and recommendation. · You will collaborate across engineering resea ...
1 month ago
The Impact You'll Be Contributing to Moloco: As the Head of Machine Learning, you'll be the senior technical lead for ML modeling stack—owning core CTR/CVR and retrieval models that directly move ad spend and revenue for our commerce media customers. · You will lead a team of ind ...
4 weeks ago
The Head of Machine Learning will lead a team of industry ML experts and shape how we design, ship, and maintain production ML across multiple platforms. · ...
4 weeks ago
Monetization Infra and Ranking Production Engineering's mission is to champion reliability scalability and security that enables us to make every impression meaningful and trustworthy. · ...
1 month ago
Moloco is seeking a Head of Machine Learning to lead technical strategy and direction for MCM ML. · ...
4 weeks ago
We are seeking a Head of Machine Learning to lead our team in developing and maintaining cutting-edge AI solutions for advertising growth and monetization. · Lead technical strategy and direction for MCM ML. · Owning end‑to‑end model lifecycle (design, offline eval, deployment, r ...
4 weeks ago
· About Moloco: · Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced AI that has historically been reserved for tech giants. Led by machine l ...
2 days ago
+ Drive reliability and capacity foundation efforts for Monetization Infra PE + As the organization lead you will be coaching the team partnering across the broader organization and driving large initiatives in bleeding edge problem spaces + Champion reliability scalability perfo ...
1 month ago
Partner with internal stakeholders to understand business requirements. · ...
1 week ago
Monetization Infra and Ranking Production Engineering's mission is to champion reliability scalability and security that enables us to make every impression meaningful and trustworthy. · We enable people and businesses to connect in a positive and reliable way by building robust ...
1 month ago
Champion reliability scalability performance and security improvements for our core products and services Enable people and businesses to connect in a positive and reliable way by building robust infrastructure and services across our platform and products Measure and improve eff ...
4 weeks ago
The Impact You'll Be Contributing to MolocoMachine Learning Engineer focused on ML Infra, you will drive the development and optimization of scalable machine learning infrastructure directly enhancing the performance and reliability of our Ads product line. · ...
2 weeks ago
· Establish and lead effective program teams to ensure alignment and achieve common objectives · Work closely with engineering, data center, hardware and business stakeholders to define program requirements, · ...
4 weeks ago
This role is about developing the core PyTorch 2.0 technologies innovating and advancing the state-of-the-art of ML compilers accelerating PT2 adoption through direct engagements with OSS and industry users. · ...
1 week ago