Staff Software Engineer, AI Data, Multimodal - Sunnyvale, CA, USA
2 days ago

Job description
Minimum qualifications:
- Bachelor's degree or equivalent practical experience.
- 8 years of experience in software development.
- 5 years of experience testing and launching software products, and 3 years of experience with software design and architecture.
- 5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
- 5 years of experience with full-stack development across the back end, such as Java, Python, Golang, or C++ codebases, and the front end.
Preferred qualifications:
- Master's degree or PhD in Engineering, Computer Science, or a related technical field.
- 8 years of experience with data structures and algorithms.
- 3 years of experience working in an organization involving cross-functional or cross-business projects.
- Experience with innovation of technology at scale and passion for development and the use of cross-platform shared code.
- Understanding of ML systems and infrastructure for production with technical knowledge to be credible with customers and engineers.
- Understanding of genAI model development workflows for post-training and product fine-tuning, especially multimodal and generative media.
About the job
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
In this role, you will build systems to secure high-quality data and also improve velocity for ML researchers and product developers to use the data efficiently for model training/fine-tuning and product adoption.
The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.
We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.
The US base salary range for this full-time position is $197,000-$291,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.
Responsibilities
- Act as a technical leadership and IC work in building and enhancing Google-wide infrastructure for multimodal and generative media use cases.
- Design and build scalable, general-purpose multimodal data tooling and infrastructure to support a wide range of research and product areas for multimodal understanding (Gemini) and multimodal generation (Nano Banana, Veo).
- Take on issues in multimodal data such as data sourcing, data sampling, evaluation automation, and loss analysis.
- Build and optimize infrastructure for developing and deploying multimodal signals and autoraters, including tools for prompting, large-scale inference, visualization, and evaluation.
- Address unique and emerging technical issues of the rapidly evolving field of multimodal AI, working to create stable and impactful solutions for a dynamic landscape.
Similar jobs
+Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for ...
1 month ago
We're looking for engineers who bring fresh ideas from all areas to develop the next-generation technologies that change how billions of users connect and interact with information and one another. · We need our engineers to be versatile display leadership qualities and be enthus ...
1 week ago
Senior Data Engineer with 6 years of experience in data modeling and scalable data pipelines using Python and SQL. Expertise in multimodal machine learning development for data curation and annotation. · ...
2 weeks ago
We are seeking a Senior Data Engineer to join our team in Sunnyvale, CA. The successful candidate will have 6 years of experience in software development on data modeling, scalable data pipeline to process image&video data, and full-stack development using Python. · ...
1 week ago
A globally leading consumer device company is looking for Data Engineer to join their team. · Data Pipeline Engineering: Building efficient and scalable data pipelines to process multimodal data on cloud platform and local clusters. · ...
3 weeks ago
We are continuously advancing the state of the art in Computer Vision and Machine Learning, touching all aspects of language and multimodal foundation models, from data collection, data curation to modeling, evaluation and deployment. · Comprehensive medical and dental coverage · ...
1 month ago
We are continuously advancing the state of the art in Computer Vision and Machine Learning, touching all aspects of language and multimodal foundation models. · ...
1 month ago
We are seeking an exceptional and highly-motivated Engineer to lead our team's data quality and curation process across a variety of core products and technologies. Our team helps develop multimodal data modeling validation, and curation process and pipeline. · ...
1 month ago
We're starting to see the incredible potential of multimodal foundation and large language models, · and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach.Description · You'll work on ground breaking re ...
1 month ago
We are seeking an exceptional and highly-motivated Engineer to lead our team's data quality and curation process across a variety of core products and technologies. · In this role, you will be responsible for the quality and accessibility of the multimodal data generated from var ...
2 weeks ago
We are seeking a highly motivated and skilled senior Applied Research Engineer to join our team. The ideal candidate will have a strong background in developing and exploring capabilities of foundation models and multimodal large language models that integrate various types of da ...
1 week ago
+We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach. · +* Work on ground breaking research projects to ...
1 week ago
We are looking for individuals who thrive on collaboration and have a desire to push the boundaries of what is possible today The Video Computer Vision org is a centralized applied research and engineering organization responsible for developing real-time on-device Computer Visio ...
3 weeks ago
Do you have a passion for computer vision and solving deep learning problems? · The Video Engineering Data Analytics and Quality group is seeking an expert in evaluating machine learning and deep learning models, · including foundation models and multimodal systems.This role wil ...
1 month ago
+ Model Design & Development: Developing, implementing and enhancing multimodal foundation models · + Model Evaluation: Collaborating closely with cross-functional partners to define data and infrastructure requirements crucial for robust evaluation of model designs and developm ...
2 weeks ago
We are looking for Language Engineers to join our science and engineering team to support the development of complex, multimodal datasets using synthetic data generation, · model-supported data generation, · and human-in-the-loop data collections.Design complex data collections w ...
2 days ago
We're building the next generation of AI systems — combining classic computer vision with cutting-edge multimodal LLMs. None of this is possible without the right data. · ...
1 month ago
The Video Computer Vision organization is working on breakthrough technologies for future Apple products. In this role, you will collaborate with world-class experts in AI, ML, Software, and Hardware to tackle fundamental challenges in human-centric solutions that will impact mil ...
1 week ago
We are launching AV Labs to accelerate the autonomous technology ecosystem. · Design and implement cutting-edge techniques for autonomous vehicle systems. · Collaborate closely with engineering teams and product teams across AV Labs to integrate modeling approaches into productio ...
1 week ago
We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach. · ...
2 weeks ago