About the role
About Ampliwork, Inc Ampliwork builds Enterprise-Grade AI Agents that amplify Human Potential™ to drive
business productivity and growth since 2017. Ampliwork’s AI Agents streamline complex
multi-step workflows in highly regulated industries such as Financial Services.
The Mission Our AI agents run complex, high-stakes workflows for enterprise customers. Your job: make them smarter, faster, and more reliable. You’ll iterate on agent behavior, design rigorous evaluation sets, and run focused AI research—shipping improvements tied to real metrics.
What You’ll Do
- Improve agent performance: Diagnose failure modes, refine routing/planning/tool-use strategies, optimize prompts, reduce latency without hurting quality.
- Own evaluation sets: Design balanced, adversarial, and scenario-based evals; build/extend an automated evaluation harness; track win-rates, pass@k, precision/recall/F1, and task success.
- Run applied AI research: Scout papers/techniques, reproduce bite-size results, prototype ideas (self-reflection, planning, retrieval augmentation, tool composition), and ship what works.
- Experiment & measure: A/B test agent changes, define guardrails, and land improvements with clear dashboards and acceptance criteria.
What You’ll Work With Python
- SQL
- Git
- RAG Pipelines
- Document Parsing/Extraction Pipelines
- Eval Pipelines
- Jupyter
- Experiment Tracking
- Prompt/Agent Tooling
- Retrieval & Tool Orchestration
- Metrics Dashboards
What We’re Looking For Must-haves: Analytical mindset, attention to detail, crisp communication, comfort with spreadsheets/Docs, and a bias for rapid iteration.
Nice-to-haves: Strong Python and SQL foundations; familiarity with evaluation metrics; curiosity about LLM planning, retrieval augmentation, and agent architectures.
Bonus signals: A small repo, AI research paper summary, or write-up where you improved an agent/model or designed a solid eval.
Why you’ll love it
- Impact from day one: Your changes roll into agents real customers use.
- Mentorship: Tight feedback loops with ML, Engineering, and Product.
- Ownership & speed: Small team, meaningful problems, quick ships.
- Hybrid flexibility: Deep-work time, remote, collaboration days, downtown.
Logistics & eligibility Language: English (French an asset).
Targeted fields of study: Engineering, Computer Science/IT, Physical Sciences.
How To Apply Deadline: August 25, 2025, 11:59 PM ET (applications reviewed on a rolling basis)
Submit: Resume + short cover note (what you’d improve in an AI agent, in ~150 words)
About Ampliwork
Our vision is to reimagine work through the implementation of enterprise grade AI agents designed to manage complex tasks.
The modern enterprise stands at an inflection point. Today's organizations face unprecedented complexity - billions lost to inefficient workflows, mounting regulatory pressures, and human potential trapped in routine tasks. The promise of artificial intelligence offers a new path forward, but only if it can operate at enterprise scale while meeting the most stringent compliance demands.
At Ampliwork, we're building AI agents that don't just meet these challenges - they transform how enterprises work at their core, liberating human intelligence to drive unprecedented value creation. We do this not by replacing human expertise, but by elevating it across every level of the organization, from individual workflows to global operations.
About the role
About Ampliwork, Inc Ampliwork builds Enterprise-Grade AI Agents that amplify Human Potential™ to drive
business productivity and growth since 2017. Ampliwork’s AI Agents streamline complex
multi-step workflows in highly regulated industries such as Financial Services.
The Mission Our AI agents run complex, high-stakes workflows for enterprise customers. Your job: make them smarter, faster, and more reliable. You’ll iterate on agent behavior, design rigorous evaluation sets, and run focused AI research—shipping improvements tied to real metrics.
What You’ll Do
- Improve agent performance: Diagnose failure modes, refine routing/planning/tool-use strategies, optimize prompts, reduce latency without hurting quality.
- Own evaluation sets: Design balanced, adversarial, and scenario-based evals; build/extend an automated evaluation harness; track win-rates, pass@k, precision/recall/F1, and task success.
- Run applied AI research: Scout papers/techniques, reproduce bite-size results, prototype ideas (self-reflection, planning, retrieval augmentation, tool composition), and ship what works.
- Experiment & measure: A/B test agent changes, define guardrails, and land improvements with clear dashboards and acceptance criteria.
What You’ll Work With Python
- SQL
- Git
- RAG Pipelines
- Document Parsing/Extraction Pipelines
- Eval Pipelines
- Jupyter
- Experiment Tracking
- Prompt/Agent Tooling
- Retrieval & Tool Orchestration
- Metrics Dashboards
What We’re Looking For Must-haves: Analytical mindset, attention to detail, crisp communication, comfort with spreadsheets/Docs, and a bias for rapid iteration.
Nice-to-haves: Strong Python and SQL foundations; familiarity with evaluation metrics; curiosity about LLM planning, retrieval augmentation, and agent architectures.
Bonus signals: A small repo, AI research paper summary, or write-up where you improved an agent/model or designed a solid eval.
Why you’ll love it
- Impact from day one: Your changes roll into agents real customers use.
- Mentorship: Tight feedback loops with ML, Engineering, and Product.
- Ownership & speed: Small team, meaningful problems, quick ships.
- Hybrid flexibility: Deep-work time, remote, collaboration days, downtown.
Logistics & eligibility Language: English (French an asset).
Targeted fields of study: Engineering, Computer Science/IT, Physical Sciences.
How To Apply Deadline: August 25, 2025, 11:59 PM ET (applications reviewed on a rolling basis)
Submit: Resume + short cover note (what you’d improve in an AI agent, in ~150 words)
About Ampliwork
Our vision is to reimagine work through the implementation of enterprise grade AI agents designed to manage complex tasks.
The modern enterprise stands at an inflection point. Today's organizations face unprecedented complexity - billions lost to inefficient workflows, mounting regulatory pressures, and human potential trapped in routine tasks. The promise of artificial intelligence offers a new path forward, but only if it can operate at enterprise scale while meeting the most stringent compliance demands.
At Ampliwork, we're building AI agents that don't just meet these challenges - they transform how enterprises work at their core, liberating human intelligence to drive unprecedented value creation. We do this not by replacing human expertise, but by elevating it across every level of the organization, from individual workflows to global operations.