FastAPI + GPU

Only for registered members United States

4 days ago

Default job background
We have an existing AI solution running in the cloud (Django + PyTorch + NVIDIA GPU). · We want to convert this into a local Windows-based AI service using: · FastAPI (instead of Django) · TensorRT (for NVIDIA GPU inference) · ONNX export pipeline · Windows service deployment · E ...
Lorem ipsum dolor sit amet
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.

Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.

Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Get full access

Access all high-level positions and get the job of your dreams.



Similar jobs

  • Work in company Remote job

    GPU Workload Execution Framework Development

    Only for registered members

    We are building a GPU-accelerated workload orchestration prototype on Rocky Linux. A senior Linux engineer is needed to implement and harden a GPU job execution framework. · ...

    $650 - $0 (USD) budget

    3 weeks ago

  • Work in company Remote job

    Deploy Llama 3.1

    Only for registered members

    I need an expert to set up our backend inference engine. · Deploy a GPU-optimized instance on RunPod, Lambda Labs or AWS. · ...

    $600 - $0 (USD) budget

    3 weeks ago

  • Work in company Remote job

    Deploy Embedding Model

    Only for registered members

    We are looking for an experienced ML / MLOps engineer to deploy a production-ready self-hosted embedding inference service. · ...

    2 weeks ago

  • Work in company Remote job

    Senior Machine Learning Engineer

    Only for registered members

    We are hiring a Senior Machine Learning Engineer to design deploy and operate production-grade AI systems that run entirely on local infrastructure.This role focuses on reliability auditability predictable latency and safe operation under real-world constraints. · ...

    $1,000 - $0 (USD) budget

    1 month ago

  • Work in company Remote job

    QA Software Intern

    Only for registered members

    We are seeking a QA Software Intern to support the testing and validation of FastAPI-based microservices powering an AI platform. · ...

    1 month ago

  • Work in company Remote job

    AI Integration Engineer

    Only for registered members

    We are hiring an AI Integration Engineer to integrate MedGemma into our on-prem Hospital Information Management System (ZypoCare by Zyposoft). · You will build/extend an AI microservice (FastAPI preferred) to provide context-aware clinical drafting and a screen-aware AI Copilot a ...

    $90,000 - $160,000 (USD) per year

    3 days ago

  • Work in company Remote job

    Long-Term AI Engineer Needed

    Only for registered members

    Long-term AI engineer needed to build and deploy machine learning systems. · Training and fine-tuning ML models (CNNs, Transformers) · Building and optimizing RAG systems (custom retrievers) · ...

    $30 - $40 (USD) per hour

    2 weeks ago

  • Work in company Remote job

    ML Engineer — LLM MVP

    Only for registered members

    We are looking for an ML Engineer to help build an MVP using a self-hosted LLM on our own GPUs in Dubai. · ...

    Part time

    1 month ago

  • Work in company Remote job

    Senior ML Engineer – Real-Time Object Detection

    Only for registered members

    We are seeking a highly experienced Senior Machine Learning Engineer to design, build, and deploy a production-grade real-time object detection system. · This role requires strong expertise in deep learning (PyTorch), computer vision, backend API development (FastAPI), containeri ...

    $500 - $0 (USD) budget

    1 week ago

  • Work in company

    Software Engineer

    Only for registered members

    GPU Compiler QA | San Diego, California | On-site | $133,600 - $200,400 · We're working with a world-class mobile GPU innovator on this exciting opportunity. · Join a fast-moving engineering team building compiler test infrastructure for cutting-edge mobile GPU hardware. You'll w ...

    San Diego $75,000 - $140,000 (USD) per year

    3 days ago

  • Work in company

    AI/ML Engineer with Python and GCP experience

    Only for registered members

    We are seeking an experienced AI/ML Engineer to join our dynamic team in Charlotte, NC. · This hybrid role offers an engaging work environment with 3 days onsite and 2 days working from home. Our projects focus on pioneering AI technologies and solutions, · providing ample opport ...

    Charlotte

    1 month ago

  • Work in company Remote job

    Full-Stack AI Engineer – Virtual Try-On

    Only for registered members

    We are seeking a skilled and motivated Full-Stack AI Engineer to design, develop and deploy a web-based AI Virtual Try-On (VTON) Platform. The successful candidate will be responsible for delivering a complete production-ready solution within two months including frontend backend ...

    $3,000 - $0 (USD) budget

    3 weeks ago

  • We are looking for an experienced AI / ML Engineer to help us build a system that generates high-quality artistic QR codes while keeping them fully scannable. · ...

    $250 - $0 (USD) budget

    1 month ago

  • I need a Principal Architect/Engineer to build the "skeleton" of a real-time Voice AI agent. · ...

    $5,000 - $0 (USD) budget

    1 week ago

  • Work in company Remote job

    Senior AI Engineer

    Only for registered members

    We are building a large-scale security and attendance system for a university campus. We need a core AI Inference Engine that can process multiple parallel IP camera streams to detect recognize verify faces against a database of 25000 to 50000 identities. · Handle 4-8 parallel RT ...

    3 weeks ago

  • Work in company Remote job

    EDI Portal

    Only for registered members

    EDI Portal that will manage EDI files with Admin panel where I can enter the EDI ISA and GS ID for my client sometime multiple per customer with their partner EDI ID and which type of files. · manage EDI files · ...

    1 month ago

  • Work in company Remote job

    Oceanographic or Lagrangian models

    Only for registered members

    We are a geospatial software company based in Ireland looking to establish contact with a specialist developer or data scientist for an upcoming environmental project. · We need someone with specific experience in visualizing the results of oceanographic models, particularly Lagr ...

    2 days ago

  • Work in company Remote job

    Object Detection Implementation to GPU Server

    Only for registered members

    We receive MP4 video files from our systems in the field and have them stored in our server. We have a website where we have people review the videos that contain boats. They are looking for plant material on the boats.Our goal is to implement object detection on our Modal GPU se ...

    1 week ago

  • Work in company

    Senior AI Engineer – Data and AI

    Only for registered members

    Overview · It is an exciting time to join Fisher Investments We are expanding our Generative AI capabilities to support our international growth. We are looking for a Senior AI Engineer who will design, build, advanced AI applications using large language models, GPU‑accelerated ...

    Plano Full time

    17 hours ago

  • Work in company

    Senior Software Engineer, Compute Platform

    Only for registered members

    Moonlite delivers high-performance AI infrastructure. We provide infrastructure deployed in our facilities or co-located in yours, delivering flexible on-demand or reserved compute that feels like an extension of your existing data center. · ...

    Chicago $165,000 - $225,000 (USD) Full time

    1 month ago