Research Intern – Multimodal Foundation Model for Vision - New York
4 hours ago

Job description
Sony Corporation of America
, located in New York, NY, is the U.S. headquarters of Sony Group Corporation, based in Tokyo, Japan. Sony's principal U.S. businesses include Sony Electronics Inc., Sony Interactive Entertainment LLC, Sony Music Entertainment, Sony Music Publishing and Sony Pictures Entertainment Inc. With some 900 million Sony devices in hands and homes worldwide today, a vast array of Sony movies, television shows and music, and the PlayStation Network, Sony creates and delivers more entertainment experiences to more people than anyone else on earth. To learn more:
.
Research Intern – Multimodal Foundation Model for Vision
Sony AI is seeking research interns to join us. Our team mainly focuses on fundamental and applied research, with a focus on building next-generation foundation models for vision in a responsible manner. The role of a research intern is to develop efficient and effective methodologies and prototype solutions. You will work with a productive team of world-class scientists and engineers to tackle the most challenging problems in foundation models and generative AI, including low-cost yet powerful vision foundation models (VFM), vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge. You will see your ideas not only published in papers, but also improve the experience of billions of customers.
Roles And Responsibilities
- Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
- Design or implement state-of-the-art techs on model compression, inference speedup, deployement on harwares, tool automation.
- PoC for various vision+text, generation relevant tasks (VQA, captioning, understanding, etc) and hardwares.
- Contribute to library and tool development to support business; or Publish influential research in top-tier conferences and journals.
Required Qualifications And Skills
- Currently has, or is in the process of obtaining, a master/PhD degree in computer science or related field.
- Be very self-motivated and capable of proposing and implementing innovative ideas.
- Solid presentation and communication skills to internal and external audiences.
- Publications or expertise in compact foundation model development and deployment. Influential open-source projects or paper publication at top conferences, e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ACL, etc.
- Better to have front-end development experience.
- Solid coding skills in Python, Pytorch, etc.
Working Location
Location flexible (Tokyo, Europe , US)
The target hourly rate for this internship is $50.00 per hour. The individual will be paid hourly and eligible for overtime.
All qualified applicants will receive consideration for employment without regard to any basis protected by applicable federal, state, or local law, ordinance, or regulation.
Disability Accommodation for Applicants to Sony Corporation of America
Sony Corporation of America provides reasonable accommodation for qualified individuals with disabilities and disabled veterans in job application procedures. For reasonable accommodation requests, please contact us by email at or by mail to: Sony Corporation of America, Human Resources Department, 25 Madison Avenue, New York, NY Please indicate the position you are applying for.
We are aware that unauthorized individuals or organizations may attempt to solicit personal information or payments from job applicants by impersonating our company through fraudulent job postings. We take these matters seriously but cannot control third-party websites. To protect your personal information, please verify that any job posting you respond to also appears on our official Careers page: Please also be advised that we never request personal identifying information (such as Social Security numbers, bank details, or copies of identification documents) during the initial stages of our application process. If you have any doubts about the authenticity of a job posting or communication, please contact before submitting any information.
Right to Work (English/Spanish)
E-Verify Participation (English/Spanish)
Similar jobs
We are seeking an experienced and highly motivated Principal Computer Vision Engineer to join our team. · ...
1 week ago
We're seeking a Founding Applied Scientist to build enterprise-grade AI capabilities from the ground up.This is a hands-on role requiring strong computer vision expertise and the ability to write production-quality code. · ...
4 weeks ago
We're looking for a Founding Computer Vision / Applied ML Scientist to lead model development for an enterprise-grade video security platform in the retail sector. · This is a hands-on early-stage role with ownership over vision capabilities from research and training through to ...
1 month ago
We are seeking an experienced and highly motivated Principal Computer Vision Engineer to join our team. · Contribute to the implementation of computer vision algorithms for aerial object detection and tracking. · Conduct research and stay up-to-date with the latest advancements i ...
1 month ago
A cutting-edge AI research startup is assembling a pioneering founding team to build a generative video system with world-class fidelity and creativity.This is a rare chance to help define a category-defining product at the intersection of AI, film, and design. · ...
2 weeks ago
We're hiring a Dental Vision Actuary to join our Actuarial team. · Oscar is the first health insurance company built around a full stack technology platform and a relentless focus on serving our members. · The Associate, Actuarial supports pricing for several ACA markets and lead ...
1 week ago
Oscar is hiring a Dental Vision Actuary to join their Actuarial team. · Maintain core pricing and plan design models. · ...
4 weeks ago
We're looking for a Computer Vision Engineer to help us build the internal systems, tools and models that power our robotics data platform. · This role is highly hands-on and product-oriented · ...
2 weeks ago
+We're looking for a Computer Vision Engineer to help us build internal systems tools models powering our robotics data platform. · +Build computer vision models pre- post-processing visual data validation quality checks automated labeling assistance · Design ship internal CV too ...
2 weeks ago
Hi we re Oscar We re hiring a Dental Vision Actuary to join our Actuarial team. · Oscar is the first health insurance company built around a full stack technology platform and a relentless focus on serving our members. · ...
4 weeks ago
Company Description · PolyMind is a high-growth startup focusing on proprietary AI research to solve healthcare and insurance issues that are prominent in the industry. Inspired by quantitative finance techniques and modern computational neuroscience methodologies, we build solut ...
4 hours ago
Vision therapy assistant provides individual treatment under optometrist supervision for vision disorders such as lazy eyes crossed eyes difficult focus and learning difficulty due to vision disorders as prescribed by under professional supervision of an optometrist. · High Schoo ...
1 week ago
We are looking for a manager to lead cross-functional projects and teams to improve processes that support our growing vision insurance business. · ...
1 month ago
Warby Parker is on the lookout for a manager to lead cross-functional projects and teams to improve processes that support our growing vision insurance business.You'll thrive here if you're a resourceful team player and a great communicator who's eager to roll up their sleeves an ...
1 month ago
A Certified Vision Rehabilitation Therapist provides rehabilitation services to enhance the independence of people who are visually impaired or blind. · ...
1 month ago
Description · DUTIES AND RESPONSIBILITIES: · Responsible to verify insurance prior to appointment. · Check in patients scheduled for vision appointments. Collect patient information such as address, additional phone numbers, insurance information and referral if necessary or ve ...
23 hours ago
ViaTouch Media is an AI-powered IoT self-checkout company that is hiring a senior computer vision engineer to join their engineering team and help drive a major platform optimization of their current Raspberry Pi-based CV stack/NVIDIA Jetson Orin device family. · ...
1 month ago
We are a fast-growing start-up building the next generation of copyright protection for live and recorded video content. · We're looking for a Computer Vision Engineer to scale our computer vision and audio analysis systems · ...
1 month ago
We're looking for a Founding Applied Scientist to build enterprise-grade AI from day one as our platform scales from single locations to large, · multi-site deployments.You'll own the entire model lifecycle - from data strategy and quality checks to training,inference optimizatio ...
1 month ago
We're driven by curiosity,purpose,and a shared commitment to serving our customers, · communities,and each other.We believe the best ideas come from diverse perspectives · and are proud to cultivate an inclusive, · high-performance culture that inspires growth, · accountability,a ...
1 month ago
We're looking for a Software Engineer with deep experience in computer vision and image signal processing to help build the next generation of CLEAR products... · A brief highlight of our tech stack:Python / Java / C++ · AWS cloud · Google Cloud · ...
1 month ago