
Allen Zhu
Technology / Internet
Services offered
9 years of experiences in ML, with a focus on distributed machine learning. Proficient in the PyTorch ecosystem, with hands-on experience building enterprise-level machine learning platforms from the ground up. Led an ASR model project that achieved a 25% improvement in CER. Skilled in federated learning as well as Python and SQL.
Experience
Senior Machine Learning Engineer with over 8 years of experience in large language models, distributed systems, and privacy-preserving computation. Led the design and deployment of scalable LLM and federated learning platforms using Ray, BentoML, and Kubernetes, and post-trained ASR models with RLHF for significant performance gains. Experienced in building end-to-end machine learning systems—from big data processing and heterogeneous scheduling to intuitive drag-and-drop modeling tools—and managing cross-functional algorithm teams. Skilled in Python, PyTorch, Spark, and cloud-native ML infrastructure.
Education
Master’s degree in Neuroscience from The Chinese University of Hong Kong
Professionals in the same Technology / Internet sector as Allen Zhu
Professionals from different sectors near Santa Clara, Santa Clara
Other users who are called Allen
Jobs near Santa Clara, Santa Clara
-
· As a Machine Learning Engineer you'll work alongside the data annotation platform team dedicated to addressing our evolving annotation data needs for training and evaluating LLMs. We value expertise in data science paired with a robust data engineering foundation. Together we' ...
Santa Clara4 hours ago
-
We're seeking a Machine Learning Engineer to design and implement scalable AI/ML solutions that drive product innovation and business insights. · Design, build, and deploy ML applications and pipelines on Google Cloud Platform. · Develop and integrate ML and NLP/SLM solutions, in ...
San Jose3 weeks ago
-
We are looking for experienced Machine Learning engineers to deploy and tune both traditional ML and generative AI models to enhance our systems' capability. · Implementing high-performance machine learning models and infrastructure in concert with software engineers, designers, ...
Cupertino1 week ago