Staff Generative AI Engineer

Infojini Inc about 21 hours ago

Hybrid

Toronto, Ontario, Canada

Senior Level

Full-Time

About the role

Staff Generative AI Engineer Location: Greater Toronto Area, Canada Work Model: Hybrid (3 days onsite per week) Employment Type: Full-Time Permanent (No Contract)

Staff Generative AI Engineer Background We are looking for a Staff Generative AI Engineer to design, build, and ship production-grade Generative AI and Agentic AI applications that deliver business value across the organization. This role is focused on building AI applications and services at scale. You will be responsible for building robust, secure, and highly scalable systems that integrate with leading cloud-based AI services. As a senior individual contributor, you will bring deep technical expertise, drive high-quality engineering practices, and serve as a mentor for junior to mid-level engineers. You will work closely with software engineers, data scientists, and product teams to translate business problems into production-grade AI applications and services.

Key Responsibilities Application & Solution Engineering Design, build, and ship production-grade Generative AI and Agentic AI applications and services for internal and external users. Develop high-quality backend services in Python, with strong software engineering rigor around testing, performance, and maintainability. Champion reusability and abstraction by designing and building modular, well-abstracted components and libraries. Build multi-agent systems using frameworks such as LangChain, LangGraph, Claude Agent SDK, and Google ADK. Integrate with leading LLM and foundation model APIs, including Azure OpenAI, Google Vertex AI, and AWS Bedrock. Design and implement Retrieval-Augmented Generation (RAG) pipelines, including document ingestion, chunking strategies, embeddings, vector search, and re-ranking. Build clean, well-tested RESTful and/or gRPC APIs with a strong focus on reliability, security, and performance. Implement observability, tracing, evaluation, and guardrails for Generative and Agentic AI applications. Deploy and operate services on major cloud providers (GCP, AWS, and Azure) leveraging managed services. Contribute to platform architecture decisions and engineering best practices. Take applications from prototype through production deployment, hardening, and ongoing operation. Mentorship & Team Development Mentor and coach junior and mid-level engineers through code reviews, architecture discussions, and pair programming. Foster a culture of engineering excellence, knowledge sharing, and continuous improvement. Participate in technical design reviews and contribute to the professional growth of team members.

Required Qualifications 10–12 years of professional software engineering experience, including 3–5 years building AI/ML software products. Bachelor's degree in Computer Science or a related field (Master's degree preferred). Strong proficiency in Python with deep software engineering fundamentals including abstraction, modularity, system design, testing, and performance optimization. Hands-on experience building and shipping Generative AI and Agentic AI applications, including LLM integration, prompt engineering, and agentic workflows. Practical experience integrating cloud-hosted LLM APIs such as Azure OpenAI, Vertex AI, and/or AWS Bedrock. Experience with agent frameworks such as LangChain, LangGraph, Google ADK, and Claude Agent SDK. Experience with vector databases including Pinecone, Weaviate, pgvector, OpenSearch, and AlloyDB. Hands-on experience with Google Cloud Platform (GCP), Amazon Web Services (AWS), or Microsoft Azure. Strong understanding of API design, distributed systems, and cloud-native architecture. Proven track record of taking systems from design through production deployment and ongoing operation.

Preferred Qualifications Experience with containerization and orchestration technologies such as Docker and Kubernetes. Knowledge of Generative AI Risk Management frameworks (NIST RMF). Experience supporting developer platforms or internal tooling. Experience writing design documents and helping define engineering standards.

Not the right fit? Search for Generative AI Engineer jobs in Toronto, Ontario, Canada

About Infojini Inc

Staffing and Recruiting

501-1000

Founded in 2006, Infojini has been a trusted partner for IT services and workforce solutions, supporting Fortune 1000 companies and government agencies worldwide.

We are honored to be recognized as the Grand Winner (201–500 employees) in SIA’s 2025 Best Staffing Firms to Work For in North America! This marks our third consecutive year (2023, 2024, and 2025) receiving this prestigious recognition, a testament to our commitment to fostering a people-first culture and delivering exceptional workforce solutions.

Our CMMI Level 3 appraisal reflects our dedication to quality and innovation in staffing and IT consulting. With five consecutive years on the INC 5000 Fastest-Growing Companies list and recognition in the USPAACC Fast 100, we continue to drive success for our clients by staying ahead of industry trends.

At Infojini, we don’t just connect businesses with talent—we build sustainable workforce solutions that empower growth and innovation.

Website LinkedIn

Similar Jobs