Boson AI is an early-stage startup building large language tools for interaction and entertainment. Our founders, Alex Smola, Mu Li, and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on modeling and training LLMs, understanding and interpreting model behavior and aligning models to human values. The ideal candidate will possess a strong background in machine learning, and have motivations for developing state-of-the-art models towards AGI.
Responsibilities
Design and verify novel model architectures and training objectives.
Investigate novel model alignment algorithms
Write efficient and clean code for ML training
Conduct large-scale experiments to verify the modeling choices and identify improvement areas
Experience
Summarize results and clearly communicate the motivations and observations in your work
Proficiency in at least one deep learning framework, such as PyTorch
Participation in at least one research project related to LLM or multimodal models, e.g. experience in training or fine-tuning them.
Experience in alignment research
Experience in large-scale distributed model training
Experience in writing GPU kernels in CUDA
Qualifications
PhD or Master's degree with solid scientific contributions
Active GitHub repository
Active scientific track record
Excellent problem-solving skills
Total compensations includes base pay, equity, and benefits. We have a 401k plan, HSA, FSA, free food (even dried mangoes).
Seniority level
Not Applicable
Employment type
Full-time
Job function
Engineering and Information Technology
Industries
Transportation, Logistics, Supply Chain and Storage
Referrals increase your chances of interviewing at Boson AI by 2x