About the role
Who you are
- Do you want to join the team working on the bleeding edge technology? Have you ever wondered how we can give voice to the devices? Or join the team developing GenAI models?
- We are looking for an Applied Scientist with experience in building highly optimized Machine Learning models for speech generation
- PhD in engineering, computer science, machine learning, mathematics or equivalent quantitative field
- Experience working in Speech Science
- Experience applying theoretical models in an applied environment
- Experience in state-of-the-art deep learning models architecture design and deep learning training and optimization and model pruning
- Experience implementing algorithms using toolkits and self-developed code
- Experience with programming languages such as Python, Java, C++
- Experience with model optimization techniques (quantization, distillation, compression, inference optimization etc.)
- Experience in professional software development
- Experience communicating across technical and non-technical audiences, including executive level stakeholders or clients
- Experience working with agile development methodologies
What the job involves
- Our team is working on all of the above, join us to see yourself. Text-to-Speech on Device team is responsible for development AI based voice models working locally on the devices. This require specific mix of skills between devices integration, voice generation technologies and machine learning
- We are delivering solutions for multiple customers, including offline solutions for Alexa, automotive customers and accessibility voices for visually impaired users. All our models are integrated for devices and working with limited hardware resources
- Work with the team on end-to-end development of an ML models for speech generation, from early experimentation to building production ready models
- Engage in state-of-the-art and innovative research in areas such as Speech Generation, Gen AI, model compression, and knowledge distillation
- Invent optimization techniques to push the boundaries of deep learning model training and inference
- Create and propose detailed theoretical specifications for novel research ideas and directions, and rigorously justify their correctness
- Train custom Speech Generation and Gen AI models that beat the state-of-the-art and paves path for developing production models
- Collaborate with other science teams to bring state-of-the-art Speech Generation models from cloud to devices
- Text-to-Speech on Device team is focused on delivery of low-footprint AI models for speech generation that can work locally on devices (Android, FireOS, etc.)
- These models require much less computation power then the ones hosted in cloud. We are cooperating directly with the teams developing devices and with scientists responsible for the cloud models in order to provide our customers best possible experience
About Amazon
Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. We are driven by the excitement of building technologies, inventing products, and providing services that change lives. We embrace new ways of doing things, make decisions quickly, and are not afraid to fail. We have the scope and capabilities of a large company, and the spirit and heart of a small one.
Together, Amazonians research and develop new technologies from Amazon Web Services to Alexa on behalf of our customers: shoppers, sellers, content creators, and developers around the world.
Our mission is to be Earth's most customer-centric company. Our actions, goals, projects, programs, and inventions begin and end with the customer top of mind.
You'll also hear us say that at Amazon, it's always "Day 1." What do we mean? That our approach remains the same as it was on Amazon's very first day - to make smart, fast decisions, stay nimble, invent, and focus on delighting our customers.
Similar jobs you might like
About the role
Who you are
- Do you want to join the team working on the bleeding edge technology? Have you ever wondered how we can give voice to the devices? Or join the team developing GenAI models?
- We are looking for an Applied Scientist with experience in building highly optimized Machine Learning models for speech generation
- PhD in engineering, computer science, machine learning, mathematics or equivalent quantitative field
- Experience working in Speech Science
- Experience applying theoretical models in an applied environment
- Experience in state-of-the-art deep learning models architecture design and deep learning training and optimization and model pruning
- Experience implementing algorithms using toolkits and self-developed code
- Experience with programming languages such as Python, Java, C++
- Experience with model optimization techniques (quantization, distillation, compression, inference optimization etc.)
- Experience in professional software development
- Experience communicating across technical and non-technical audiences, including executive level stakeholders or clients
- Experience working with agile development methodologies
What the job involves
- Our team is working on all of the above, join us to see yourself. Text-to-Speech on Device team is responsible for development AI based voice models working locally on the devices. This require specific mix of skills between devices integration, voice generation technologies and machine learning
- We are delivering solutions for multiple customers, including offline solutions for Alexa, automotive customers and accessibility voices for visually impaired users. All our models are integrated for devices and working with limited hardware resources
- Work with the team on end-to-end development of an ML models for speech generation, from early experimentation to building production ready models
- Engage in state-of-the-art and innovative research in areas such as Speech Generation, Gen AI, model compression, and knowledge distillation
- Invent optimization techniques to push the boundaries of deep learning model training and inference
- Create and propose detailed theoretical specifications for novel research ideas and directions, and rigorously justify their correctness
- Train custom Speech Generation and Gen AI models that beat the state-of-the-art and paves path for developing production models
- Collaborate with other science teams to bring state-of-the-art Speech Generation models from cloud to devices
- Text-to-Speech on Device team is focused on delivery of low-footprint AI models for speech generation that can work locally on devices (Android, FireOS, etc.)
- These models require much less computation power then the ones hosted in cloud. We are cooperating directly with the teams developing devices and with scientists responsible for the cloud models in order to provide our customers best possible experience
About Amazon
Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. We are driven by the excitement of building technologies, inventing products, and providing services that change lives. We embrace new ways of doing things, make decisions quickly, and are not afraid to fail. We have the scope and capabilities of a large company, and the spirit and heart of a small one.
Together, Amazonians research and develop new technologies from Amazon Web Services to Alexa on behalf of our customers: shoppers, sellers, content creators, and developers around the world.
Our mission is to be Earth's most customer-centric company. Our actions, goals, projects, programs, and inventions begin and end with the customer top of mind.
You'll also hear us say that at Amazon, it's always "Day 1." What do we mean? That our approach remains the same as it was on Amazon's very first day - to make smart, fast decisions, stay nimble, invent, and focus on delighting our customers.