Top Benefits
A variety of fantastic health benefits (health, dental, vision insurance; life insurance etc)
A 401k plan with up to 5% match
Free tax advice on Carta
About the role
Who you are
- If you’re passionate about shaping the future of AI and creating tools that make a real difference in people’s lives, we want you on our team
- A passion for staying up to date with the latest literature in the field, as well as prior research experience is a plus
- Solid understanding of deep learning architectures and concepts such as transformers, diffusion models, attention, KV cache, and more
- Strong experience with Python, core numerical libraries (like NumPy) and deep learning frameworks (e.g., Pytorch, JAX, Tensorflow)
- Solid Python coding skills
- Good grasp of related mathematical concepts, especially linear algebra
- Kernel development experience (CUDA, TritonLang, etc.)
- Deep learning research experience
- ML API development experience
- Experience training or deploying deep learning models
- Understanding of performance tradeoffs in modern accelerators
What the job involves
- At Modular, we’re on a mission to revolutionize AI infrastructure by systematically rebuilding the AI software stack from the ground up
- Our team, made up of industry leaders and experts, is building cutting-edge, modular infrastructure that simplifies AI development and deployment
- By rethinking the complexities of AI systems, we’re empowering everyone to unlock AI’s full potential and tackle some of the world’s most pressing challenges
- The GenAI Modeling team implements the latest models on top of Modular’s next generation AI platform, whether it’s public open source models to be showcased on the MAX Builds website, or private models for our enterprise partners and customers
- As a senior modeling engineer, you will use your in-depth understanding of modern deep learning architectures, especially but not limited to LLMs, to implement, test and validate models
- You’re also expected to mentor more junior engineers in this task
- Not only will your work enrich Modular’s model library, but it will also serve as example code for the MAX developer ecosystem worldwide, and will directly inform and guide the development of our platform
- You will develop high-level ML code using APIs such as the Python MAX Graph API, as well as basic kernels using the Mojo programming language
- Develop AI models, with a focus on GenAI and LLMs, using the MAX platform's Python APIs
- Research and stay up-to-date on the latest model architectures, such as Llama and DeepSeek
- Implement novel models and architectures based on research papers
- Test and evaluate the inference accuracy of models
- Mentor less experienced engineers on model architectures, implementation details, and effective use of modern APIs like PyTorch
- Collaborate with the MAX Platform team by providing feedback on API design and developer experience
- Work with leads and product managers to estimate and plan the development of new models
Benefits
- A variety of fantastic health benefits (health, dental, vision insurance; life insurance etc) are available
- A 401k plan with up to 5% match
- Free tax advice on Carta
- Generous work-from-home stipend of $1500 to help you improve your home office
- Unlimited paid time off and flexible work hours
Top Benefits
A variety of fantastic health benefits (health, dental, vision insurance; life insurance etc)
A 401k plan with up to 5% match
Free tax advice on Carta
About the role
Who you are
- If you’re passionate about shaping the future of AI and creating tools that make a real difference in people’s lives, we want you on our team
- A passion for staying up to date with the latest literature in the field, as well as prior research experience is a plus
- Solid understanding of deep learning architectures and concepts such as transformers, diffusion models, attention, KV cache, and more
- Strong experience with Python, core numerical libraries (like NumPy) and deep learning frameworks (e.g., Pytorch, JAX, Tensorflow)
- Solid Python coding skills
- Good grasp of related mathematical concepts, especially linear algebra
- Kernel development experience (CUDA, TritonLang, etc.)
- Deep learning research experience
- ML API development experience
- Experience training or deploying deep learning models
- Understanding of performance tradeoffs in modern accelerators
What the job involves
- At Modular, we’re on a mission to revolutionize AI infrastructure by systematically rebuilding the AI software stack from the ground up
- Our team, made up of industry leaders and experts, is building cutting-edge, modular infrastructure that simplifies AI development and deployment
- By rethinking the complexities of AI systems, we’re empowering everyone to unlock AI’s full potential and tackle some of the world’s most pressing challenges
- The GenAI Modeling team implements the latest models on top of Modular’s next generation AI platform, whether it’s public open source models to be showcased on the MAX Builds website, or private models for our enterprise partners and customers
- As a senior modeling engineer, you will use your in-depth understanding of modern deep learning architectures, especially but not limited to LLMs, to implement, test and validate models
- You’re also expected to mentor more junior engineers in this task
- Not only will your work enrich Modular’s model library, but it will also serve as example code for the MAX developer ecosystem worldwide, and will directly inform and guide the development of our platform
- You will develop high-level ML code using APIs such as the Python MAX Graph API, as well as basic kernels using the Mojo programming language
- Develop AI models, with a focus on GenAI and LLMs, using the MAX platform's Python APIs
- Research and stay up-to-date on the latest model architectures, such as Llama and DeepSeek
- Implement novel models and architectures based on research papers
- Test and evaluate the inference accuracy of models
- Mentor less experienced engineers on model architectures, implementation details, and effective use of modern APIs like PyTorch
- Collaborate with the MAX Platform team by providing feedback on API design and developer experience
- Work with leads and product managers to estimate and plan the development of new models
Benefits
- A variety of fantastic health benefits (health, dental, vision insurance; life insurance etc) are available
- A 401k plan with up to 5% match
- Free tax advice on Carta
- Generous work-from-home stipend of $1500 to help you improve your home office
- Unlimited paid time off and flexible work hours