About the role
Who you are
- If this role is Hybrid, there will be an expectation to reside within commutable distance to the office/location specified in the job listing. This will include, but not limited to, weekly/bi-weekly/monthly events in the office with your specific team. This is a requirement for this role
- Are you a software engineer passionate about building intelligent systems that make infrastructure smarter, faster, and more resilient?
- 5+ years' experience in software engineering
- Experience with SRE principles
- Experience with AI/ML in production environments
- A passion for automation, intelligent systems, and operational excellence
- Strong debugging, problem-solving, and system design skills
- Experience with AIOps platforms
- Contributions to open-source or AI communities
- Familiarity with Responsible AI frameworks
- Participation in AI hackathons or conferences
- This is your opportunity to code with purpose — building systems that think, learn, and adapt. If you're excited about the fusion of software engineering and AI, let’s talk
What the job involves
- Join us as we reimagine operational engineering through AI-first principles. In this role, you’ll design and implement AI-powered solutions that drive observability, automate incident response, and optimise cloud-native platforms
- This is more than a traditional SRE role — it’s a chance to engineer the future of reliability using machine learning, generative AI, and predictive analytics
- Build ML-based anomaly detection and pattern recognition systems
- Enhance telemetry with smart tagging and metadata for better AI insights. Intelligent Automation
- Develop event-driven workflows and self-healing systems using AI triggers
- Automate incident response with generative AI and custom AI agent orchestration
- Use time-series forecasting and predictive modelling to anticipate failures
- Optimise infrastructure with AI-powered autoscaling and cost-aware resource allocation
- Build scalable, fault-tolerant systems in a cloud-native environment
- Participate in on-call rotations and lead incident response for critical systems
- Skilled in API integration for streamlined data exchange and system connectivity
- Tech Stack & Skills:
- Run internal AIOps workshops and help teams adopt AI maturity models
- Champion responsible AI practices and ethical automation
- Languages: Python, Java, Bash, Terraform
- Platforms: Azure, Kubernetes, Docker
- Tools: Datadog, Prometheus, AppDynamics, ELK, GitHub Actions
- ML/AI: MCP framework, AI agents, Vector store, Agent orchestration (LangChain), RAG
- CI/CD: Jenkins, ArgoCD, Spinnaker
- Databases: SQL Server, PostgreSQL, MySQL
About PointClickCare
PointClickCare is a leading healthcare technology platform enabling meaningful collaboration and access to real‐time insights at every stage of the patient healthcare journey. More than 27,000 long‐term and post‐acute care providers, over 3,100 hospitals and health systems, over 3,600 ambulatory clinics, every major U.S. health plan, and over 70 state and government agencies use PointClickCare, enabling care collaboration and value‐based care delivery for millions across North America.
Similar jobs you might like
About the role
Who you are
- If this role is Hybrid, there will be an expectation to reside within commutable distance to the office/location specified in the job listing. This will include, but not limited to, weekly/bi-weekly/monthly events in the office with your specific team. This is a requirement for this role
- Are you a software engineer passionate about building intelligent systems that make infrastructure smarter, faster, and more resilient?
- 5+ years' experience in software engineering
- Experience with SRE principles
- Experience with AI/ML in production environments
- A passion for automation, intelligent systems, and operational excellence
- Strong debugging, problem-solving, and system design skills
- Experience with AIOps platforms
- Contributions to open-source or AI communities
- Familiarity with Responsible AI frameworks
- Participation in AI hackathons or conferences
- This is your opportunity to code with purpose — building systems that think, learn, and adapt. If you're excited about the fusion of software engineering and AI, let’s talk
What the job involves
- Join us as we reimagine operational engineering through AI-first principles. In this role, you’ll design and implement AI-powered solutions that drive observability, automate incident response, and optimise cloud-native platforms
- This is more than a traditional SRE role — it’s a chance to engineer the future of reliability using machine learning, generative AI, and predictive analytics
- Build ML-based anomaly detection and pattern recognition systems
- Enhance telemetry with smart tagging and metadata for better AI insights. Intelligent Automation
- Develop event-driven workflows and self-healing systems using AI triggers
- Automate incident response with generative AI and custom AI agent orchestration
- Use time-series forecasting and predictive modelling to anticipate failures
- Optimise infrastructure with AI-powered autoscaling and cost-aware resource allocation
- Build scalable, fault-tolerant systems in a cloud-native environment
- Participate in on-call rotations and lead incident response for critical systems
- Skilled in API integration for streamlined data exchange and system connectivity
- Tech Stack & Skills:
- Run internal AIOps workshops and help teams adopt AI maturity models
- Champion responsible AI practices and ethical automation
- Languages: Python, Java, Bash, Terraform
- Platforms: Azure, Kubernetes, Docker
- Tools: Datadog, Prometheus, AppDynamics, ELK, GitHub Actions
- ML/AI: MCP framework, AI agents, Vector store, Agent orchestration (LangChain), RAG
- CI/CD: Jenkins, ArgoCD, Spinnaker
- Databases: SQL Server, PostgreSQL, MySQL
About PointClickCare
PointClickCare is a leading healthcare technology platform enabling meaningful collaboration and access to real‐time insights at every stage of the patient healthcare journey. More than 27,000 long‐term and post‐acute care providers, over 3,100 hospitals and health systems, over 3,600 ambulatory clinics, every major U.S. health plan, and over 70 state and government agencies use PointClickCare, enabling care collaboration and value‐based care delivery for millions across North America.