Bilingual Large Model Inference Acceleration Engineer - San Francisco Bay Area

Only for registered members San Francisco Bay Area, United States

1 week ago

Job summary

We are seeking an experienced AI model optimization engineer specializing in large model inference acceleration.

Responsibilities

Design and optimize large model inference pipelines for low-latency and high-throughput production deployments
Benchmark and profile deep learning models to identify performance bottlenecks

Job description

Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.

Get full access

Access all high-level positions and get the job of your dreams.

Similar jobs

Work in company

Software Engineer, Accelerators

Only for registered members

Help OpenAI evaluate and bring up new compute platforms that can support large-scale AI training and inference. · ...

San Francisco

6 days ago

Work in company

GPU Accelerated Bioinformatics Engineer

Only for registered members

+We are seeking a GPU Accelerated Bioinformatics Engineer to join our team at Prima Mente. As an engineer not an analyst you will thrive pushing the boundaries of performance and efficiency and enjoy architecting the systems and making multiple fast technical decisions. · +Design ...

San Francisco

1 month ago

Work in company

GPU Accelerated Bioinformatics Engineer

Only for registered members

We are a frontier biology AI lab that generates our own data, builds general purpose biological foundation models, and translates discoveries into research and clinical outcomes. We're looking for an engineer to architect, build, and own scalable production pipelines that process ...

San Francisco Full time

1 month ago

Work in company

Software Engineer, Research Acceleration

Only for registered members

We're looking for engineers to build the libraries and tools that accelerate research at Thinking Machines. · ...

San Francisco $350,000 - $475,000 (USD) Full time

2 weeks ago

Work in company

GPU/CPU Accelerated Bioinformatics Engineer

Only for registered members

We generate our own data, build general purpose biological foundation models, · and translate discoveries into research and clinical outcomes. · In this role you will architect, · build, and own scalable production pipelines that process multi-omics data for AI model development. ...

San Francisco

1 month ago

Work in company

Research Infrastructure Engineer, Research Acceleration

Only for registered members

We're looking for engineers to build libraries and tools that accelerate research at Thinking Machines. · Design, build, and operate research infrastructure including evaluation frameworks. · Develop high-throughput pipelines for distributed evaluation. · ...

San Francisco

1 week ago

Work in company

Software Engineer, Accelerator Build Infrastructure

Only for registered members

A systems-level engineer specializing in build infrastructure and low-level systems optimization. · Expert-level proficiency with build/packaging systems. · Nix experience in particular is a huge plus. · ...

San Francisco, CA

1 week ago

Work in company

Bilingual Large Model Inference Acceleration Engineer

Only for registered members

+Job summary · We are seeking an experienced AI model optimization engineer specializing in large model inference acceleration. · +Design and optimize large model inference pipelines for low-latency and high-throughput production deployments. · Benchmark and profile deep learning ...

San Francisco

1 week ago

Work in company

Bilingual Large Model Inference Acceleration Engineer

Only for registered members

Weare an AI platform engineering team building large-scale end-to-end AI production pipelines covering model training optimization deployment and real-world applications. · We are seeking an experienced AI model optimization engineer specializing in large model inference accelera ...

San Francisco

1 week ago

Work in company

Bilingual Large Model Training Acceleration Engineer

Only for registered members

We are an AI platform engineering group focused on large-scale model training systems and performance acceleration. · Optimize large model training pipelines for performance and scalability · Design and improve distributed training systems · ...

San Francisco

1 week ago

Work in company

Machine Learning Engineer, Payments ML Accelerator

Only for registered members

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world's largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue and accelerate new business opportunities. · We are looking for ML Engineers w ...

San Francisco, CA

2 weeks ago

Work in company

Machine Learning Engineer, Payments ML Accelerator

Only for registered members

+A Machine Learning Engineer will develop advanced ML solutions that directly impact Stripe's payment products and core business metrics. · +Design and deploy deep learning architectures and foundation models to address problems across key payment entities such as merchants, issu ...

San Francisco $212,000 - $318,000 (USD)

1 week ago

Work in company

Principal Software Engineer, AI Accelerated SDLC

Only for registered members

We re a next-generation financial services company and national bank using innovative mobile-first technology to help our millions of members reach their goals. We re proud to come to work every day knowing that what we do has a direct impact on people s lives with our core value ...

San Francisco

1 month ago

Work in company

Staff Software Engineer, AI Accelerated SDLC

Only for registered members

We are looking for a seasoned Staff Software Engineer to join our Builder Tools engineering organization with a mission to enable SoFi engineers to elegantly solve problems. · Technical leadership - Provide leadership for technical architecture and design, implementation, deliver ...

San Francisco, CA

1 month ago

Work in company

Accelerator Electronics Engineer

Only for registered members

We're here for the same mission – to bring science solutions for the world. Join our team and YOU will play a supporting role in our goal to address global challenges Have a high level of impact and work for an organization associated with 17 Nobel Prizes · We invest in our emplo ...

Berkeley $114,000 - $139,000 (USD)

1 month ago

Work in company

Senior Staff Software Engineer, AI Accelerated SDLC

Only for registered members

We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering organization with a mission to enable SoFi engineers to elegantly solve problems. · Join us to invest in yourself, your career, and the financial world.The Role: · We are lookin ...

San Francisco, CA

1 month ago

Work in company

Component Quality Engineer, AI Accelerator

Only for registered members

Meta is seeking a Component Quality Engineer responsible for working with Meta's suppliers and manufacturing partners to develop and implement quality programs for AI accelerators used in Meta AI/ML and Compute platforms. · ...

Fremont $144,000 - $204,000 (USD) Full time

1 month ago

Work in company

Component Quality Engineer, AI Accelerator

Only for registered members

· ...

Fremont $144,000 - $204,000 (USD)

3 weeks ago

Work in company

Component Quality Engineer, AI Accelerator

Only for registered members

Responsible for managing end-to-end quality. · Working closely with suppliers and internal teams to assure new product readiness. · ...

Fremont $144,000 - $204,000 (USD)

1 month ago

Work in company

Mechanical Engineer, SSRL Accelerator Division

Only for registered members

The Stanford Synchrotron Radiation Lightsource (SSRL), a directorate of SLAC National Accelerator Laboratory, is a national Department of Energy Office of Science user facility providing synchrotron radiation for research in biology, chemistry, engineering, environmental science, ...

Menlo Park, CA

2 weeks ago

Work in company

Mechanical Engineer, SSRL Accelerator Division

Only for registered members

The Stanford Synchrotron Radiation Lightsource (SSRL), a directorate of SLAC National Accelerator Laboratory, seeks a skilled mechanical engineer to expand the group¿s capabilities and to fulfill the group¿s mission. · ...

Menlo Park, CA

2 weeks ago

Bilingual Large Model Inference Acceleration Engineer - San Francisco Bay Area

Job summary

Responsibilities

Job description

Similar jobs

Software Engineer, Accelerators

GPU Accelerated Bioinformatics Engineer

GPU Accelerated Bioinformatics Engineer

Software Engineer, Research Acceleration

GPU/CPU Accelerated Bioinformatics Engineer

Research Infrastructure Engineer, Research Acceleration

Software Engineer, Accelerator Build Infrastructure

Bilingual Large Model Inference Acceleration Engineer

Bilingual Large Model Inference Acceleration Engineer

Bilingual Large Model Training Acceleration Engineer

Machine Learning Engineer, Payments ML Accelerator

Machine Learning Engineer, Payments ML Accelerator

Principal Software Engineer, AI Accelerated SDLC

Staff Software Engineer, AI Accelerated SDLC

Accelerator Electronics Engineer

Senior Staff Software Engineer, AI Accelerated SDLC

Component Quality Engineer, AI Accelerator

Component Quality Engineer, AI Accelerator

Component Quality Engineer, AI Accelerator

Mechanical Engineer, SSRL Accelerator Division

Mechanical Engineer, SSRL Accelerator Division

Directory

for Recruiters

Information