Data Ingestion Research Engineer: Scale Web Crawling - San Francisco - Y-Axis

    Y-Axis
    Y-Axis San Francisco

    1 week ago

    $320,000 - $405,000 (USD) per year
    Description

    A leading AI research organization based in San Francisco is seeking an experienced Research Engineer to develop a large-scale web crawler and ensure data quality.

    This unique role combines both engineering and data research, emphasizing the importance of building strong data processing systems for cutting-edge AI applications.

    The position offers a competitive salary between $320,000 and $405,000, along with a diverse and inclusive work culture that values communication and collaboration.

    #J-18808-Ljbffr

  • Work in company

    Software Engineer, Web Crawling

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California

    1 hour ago

  • Work in company

    Research Engineer, Generalist

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California

    1 hour ago

  • Work in company

    Member of Technical Staff, Data Curation

    Only for registered members

    The Role · We seek experienced engineers and scientists to shape how we collect, process, and curate the datasets that power our models. You'll combine engineering expertise with research insight to build scalable data pipelines, develop synthetic data generation techniques, and ...

    San Francisco Full time

    1 day ago

  • Work in company

    Member of Technical Staff, Data Infrastructure

    Only for registered members

    The Role · We seek experienced engineers to architect and scale the core infrastructure behind distributed training pipelines and petabyte-scale data catalogs. You'll work directly with researchers to accelerate experiments, develop new datasets, improve infrastructure efficiency ...

    San Francisco Full time

    1 day ago

  • Work in company Remote job

    Software Engineer

    Only for registered members

    MixRank processes petabytes of data every month from web crawling. · We have hundreds of customers using our data products including Google, Amazon, Facebook, Intel, and Adobe, · across industries Sales Marketing Finance Security Team is 47+ full-time full-remote · from 20+ coun ...

    San Francisco, CA

    1 month ago

  • Work in company

    Software Engineer, Distributed Data Systems

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California

    52 minutes ago

  • Work in company

    Software Engineer

    Only for registered members

    +MixRank procesa petabytes de datos cada mes mediante web crawling. Somos una empresa creciente, rentable y propiedad de los empleados. Los solicitantes procedentes de todos los países y culturas son bienvenidos. · +Desarrollar software para aplicaciones web.Diseñar e implementar ...

    San Francisco

    1 month ago

  • Work in company

    Research Scientist

    Only for registered members

    We are bringing a new web to life: it's built with by and for AIs Our work spans innovations across crawling indexing ranking retrieval and reasoning systems. · You will answer the question how to train and scale a model that can serve a web index? · ...

    San Francisco

    1 month ago

  • Work in company

    Data Scientist

    Only for registered members

    Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from AMD engineers the team is scaling rapid ...

    San Francisco $85,000 - $150,000 (USD) per year

    1 hour ago

  • Work in company

    Founding Engineer

    Only for registered members

    We help companies get mentioned in ChatGPT by rebuilding the internet for AI agents. · Our product helps companies like NVIDIA, Rho, and Exa get cited by ChatGPT, Gemini, and Perplexity. · You'll join as a founding engineer to design experiments and systems that uncover how AI ag ...

    San Francisco $90,000 - $150,000 (USD)

    2 days ago

  • Work in company

    Full-Stack Software Engineer

    Only for registered members

    Responsibilities · Build new features on our website and mobile app. (Our stack is Javascript (Typescript): modern React on the web, React Native on mobile, and on the server.) · Design and decide what to build based on what would help travelers and drive growth. You wont be han ...

    San Francisco $80,000 - $120,000 (USD) per year

    6 days ago

  • Work in company

    Full-Stack Software Engineer

    Only for registered members

    · Responsibilities · Build new features on our website and mobile app. (Our stack is Javascript (Typescript): modern React on the web, React Native on mobile, and on the server.) · Design and decide what to build based on what would help travelers and drive growth. You wont be ...

    San Francisco, California, United States $80,000 - $150,000 (USD) per year

    26 minutes ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    Our Mission · Reflection's mission is to · build open superintelligence and make it accessible to all · . · We're developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI ...

    San Francisco, CA

    1 week ago

  • Work in company

    Member of Technical Staff, Back-end

    Only for registered members

    We are bringing a new web to life: it's built with, by, and for AIs. Our work spans innovations across crawling, indexing, ranking. · You will build and maintain our backend that already handles millions of custom requests daily. · ...

    San Francisco

    1 month ago

  • Work in company

    Software Engineer, Intern

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California $64,000 - $96,000 (USD) per year

    29 minutes ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    We are building open superintelligence and making it accessible to all. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond. · You will own the machinery that acquires, extracts, normalizes, versions, ...

    San Francisco Full time

    1 month ago

  • Work in company

    Member of Technical Staff, Front-end

    Only for registered members

    We are bringing a new web to life: it's built with by and for AIs. Our first product is a set of APIs for AIs to do more with web data. · ...

    San Francisco

    1 month ago

  • Work in company

    Software Engineer, Backend

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California $90,000 - $160,000 (USD) per year

    1 hour ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    Caddie is hiring on behalf of a stealth-mode AI company building the taste layer for artificial intelligence. · ...

    San Francisco

    1 month ago

  • Work in company

    Research, ML

    Only for registered members

    Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M ...

    San Francisco, California

    1 hour ago

  • Work in company

    Member of Technical Staff

    Only for registered members

    This is an early Member of Technical Staff role with a strong product engineering focus. You'll span backend systems, AI workflows, and user-facing product experiences. · Familiarity with AI- or LLM-powered systems. · Experience building or maintaining CI/CD systems and developer ...

    San Francisco, CA

    1 month ago

Jobs
>
San Francisco