
Shubham Solanki
Technology / Internet
About Shubham Solanki:
AI/ Backend Engineer with 3 years of experience specializing in production LLM systems, RAG pipelines, and model serving infrastructure.Expert in the full on-premise AI deployment stack Ollama, LangChain, vector search, embedding generation, inference optimization, and real-time retrieval built and monitored on GPU infrastructure.
Experience
AI / LLM Backend Engineer with 3+ years of experience building production-grade AI systems and scalable backend infrastructure. Specialized in on-prem LLM deployment, RAG pipelines, inference optimization, and GPU-based model serving using Ollama, LangChain, and FastAPI. Proven track record of reducing latency, cutting cloud costs, and deploying reliable AI systems supporting 100+ concurrent users in live environments.
Education
M.S. in Applied Data Science (Computer Science focus),
State University of New York at Binghamton, Dec 2025.
Bachelor of Engineering in Computer Science,
Savitribai Phule Pune University, May 2022.
Professionals in the same Technology / Internet sector as Shubham Solanki
Professionals from different sectors near San Francisco, San Francisco County
Other users who are called Shubham
Jobs near San Francisco, San Francisco County
-
This is a full-time on-site Healthcare Data Scientist position in the San Francisco Bay Area. · ...
San Francisco2 weeks ago
-
· We're looking for researchers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling. · Our team contributes to data curation across all stages of LLM development (pre-training, mid-training, post-training) and all domains/modalities (e ...
Menlo Park1 month ago
-
· ...
Menlo Park, CA1 month ago