Research Methodologist, Data Science
About the role
Staff - Non Union
Job Category
M&P - AAPS
Job Profile
AAPS Salaried - Statistical Analysis, Level B
Job Title
Research Methodologist, Data Science
Department
Regier Laboratory | School of Population and Pubic Health | Faculty of Medicine
Compensation Range
$7,622.83 - $11,886.67 CAD Monthly
The Compensation Range is the span between the minimum and maximum base salary for a position. The midpoint of the range is approximately halfway between the minimum and the maximum and represents an employee that possesses full job knowledge, qualifications and experience for the position. In the normal course, employees will be hired, transferred or promoted between the minimum and midpoint of the salary range for a job.
Posting End Date
September 15, 2025
Note: Applications will be accepted until 11:59 PM on the Posting End Date.
Job End Date
September 30, 2026
At UBC, we believe that attracting and sustaining a diverse workforce is key to the successful pursuit of excellence in research, innovation, and learning for all faculty, staff and students. Our commitment to employment equity helps achieve inclusion and fairness, brings rich diversity to UBC as a workplace, and creates the necessary conditions for a rewarding career.
Job Summary
The Research Methodologist for Data Science will be a key member of Dr. Dean Rieger’s Regulatory Science Lab research team. They will contribute to ongoing and emerging research activities augmenting health data and data systems to enable rapid evidence generation for innovative health products. The Research Methodologist will support data preparation and application of natural language processing (NLP) and other machine learning methods, including training and deploying large language models (LLMs) for extracting data from electronic health records (EHRs). They will also undertake statistical analysis for performance evaluation. Other responsibilities include supporting grant applications, writing reports and manuscripts, presenting results to multidisciplinary audiences and other forms of knowledge exchange. The working environment will require both independent research and working as a core member of project teams, with opportunities to engage with interdisciplinary researchers, health care stakeholders, and policy makers.
This is a full time 1-year term position, with the possibility of extension.
Organizational Status
The Research Methodologist for Data Science works independently under the direction of Dr. Dean Regier in his Regulatory Science Lab, contributing to both technical and project management activities. The incumbent will interact and work with other faculty, research staff, and graduate students within SPPH, the ATM, and external collaborators.
Housed within the Faculty of Medicine, the School of Population and Public Health (SPPH) is an innovative unit that encompasses many of the health-related groupings at UBC as a collaborative venture. The School is structured around four divisions: Occupational and Environmental Health; Health Services and Policy; Epidemiology, Biostatistics and Public Health Practice; and Health in Populations. The resulting mix of professions and disciplines is seen as a means of connecting individuals and learners to galvanize the relationship between health research, public health and health services and to enhance learning.
The ATM at the University of British Columbia (UBC) is established by the Faculty of Medicine. The ATM is a nucleus for translational and regulatory science research, education and training, and innovation. The ATM is a strategic imperative for the Faculty of Medicine and has a mandate to collaborate with other units in the Faculty of Medicine, the wider University, and beyond to accelerate the translational medicine continuum. The ATM will drive impactful medical and policy research to create new knowledge to improve health outcomes and benefit society. The ATM is the nucleus for the development of new academic and training programs, creating a cutting-edge ecosystem in which top educators and researchers train the next generation of health innovators.
Work Performed
- Develop, program, test, validate, and report on NLP, including LLMs, and other machine learning models for health data extraction and augmentation to support life-cycle evidence generation.
- Test model accuracy across various data sources and scenarios, comparing performance across models, and assessing safety and risks. Ensure compliance with ethics, privacy, and data security requirements.
- Conduct bias audits to ensure model predictions are fair and consistent across all relevant data sources.
- Stay up-to-date with reviewing and potentially integrating new approaches and techniques from emerging scientific literature on generative AI and NLP methods.
- Critically evaluate and appraise complex data and analytic project needs, addressing methodologic and other problems.
- Document data and analysis components of projects, including on data sources, training protocols, and compliance with data safety and ethics requirements, and ensure project completion to deadlines.
- Develop scripts and data processing pipelines using R or Python, leveraging advanced expertise for data processing, model development, and data visualization.
- Work with senior staff members to develop and modify models, and mentor junior staff and students, as needed.
- Develop, write, coordinate, and edit scientific manuscripts, reports and other knowledge translation products, including design, compilation, synthesis, dissemination and evaluation.
- Contribute to development of new research proposals and other knowledge translation activities.
Consequence of Error/Judgement
The incumbent is given wide latitude for exercising independent initiative and judgment in performing specialized duties and responsibilities. A lack of judgment could harm the research team and partner organizations' research and funding. The incumbent will interact with multiple researchers across various organizations to address their data needs and research findings, and discretion is vital.
Consequences of inappropriate judgment exercised by position include:
-
Loss of funding opportunities and collaborative partnerships.
-
Compromising the quality of research findings.
-
Missed project deadlines.
-
Damage to the reputation of any or all of the investigators and/or their affiliated organizations.
Failure to maintain a high degree of attention to detail could negatively affect the accuracy of research findings.
This position requires employees to work under strict confidentiality requirements; internal procedures and will be responsible for access, collection, use and disclosure of personal information in accordance with the BC Freedom of Information and Protection of Privacy Act (RSBC 1996) and other UBC privacy and security policies. This position requires employees to work under strict confidentiality requirements; internal procedures and policies to protect personal information must be followed and adherence to these requirements will be regularly reviewed by the employer.
Supervision Received
The incumbent will be able to work independently with minimal supervision and regularly report to the Regulatory Science Lab Principal Investigator. The incumbent will also receive support from research collaborators as needed. Performance will be reviewed periodically based on the quality and timeliness of work.
Supervision Given
The incumbent may assist the Regulatory Science Lab Principal Investigator with the supervision and mentoring of junior research trainees, such as summer students.
Minimum Qualifications
Post-graduate degree in Statistics. Minimum of three years of related experience in research analysis, or the equivalent combination of education and experience.
-
Willingness to respect diverse perspectives, including perspectives in conflict with one’s own
-
Demonstrates a commitment to enhancing one’s own awareness, knowledge, and skills related to equity, diversity, and inclusion
Preferred Qualifications
- Education in Computer Sciences, Health Economics, Epidemiology, Public Health, Statistics, or other relevant fields (emphasis on health research) preferred.
- Experience developing and applying generative AI, NLP and other machine learning models and using open-source statistical software (e.g. R and/or Python).
- Education in Computer Sciences, Health Economics, Epidemiology, Public Health, Statistics, or other relevant fields (preferred emphasis on health research).
- Demonstrated competence in machine learning, NLP, and generative AI, and ongoing curiosity about the latest technical advancements.
- Experience working with and interpreting medical/health-related natural language data in an AI or machine learning context. Experience with cleaning and analyzing complex, large scale, potentially incomplete data from varying sources. Working knowledge of different LLM prompt engineering techniques and relevant theory.
- Knowledge of relevant generative AI, NLP, machine learning and data visualization packages, libraries, and tools, as well as excellent programming and scripting abilities in either Python or R.
- Experience implementing Retrieval-Augmented Generation (RAG) pipelines for generative AI applications and using applications to locally run LLMs (e.g. LM Studio, Ollama) is preferred
- Detailed knowledge of study design, data collection, and analysis.
- Experience with health research projects in an academic setting and familiarity with writing, editing, and reviewing reports and journal manuscripts.
- Excellent communication skills including the ability to build and maintain effective working relationships both internally and externally,
- Ability to exercise tact, discretion, initiative, confidentiality and judgment.
- Demonstrated analytical and problem-solving skills including the ability to comprehend complex issues and related data/information and present information in concise and meaningful ways.
- Ability to maintain accuracy and attention to detail and handle multiple concurrent tasks.
- Demonstrated organizational skills to generate work plans to meet project deadlines within timeline, scope and budget.
- Ability to work effectively independently and in a diverse team environment.
Research Methodologist, Data Science
About the role
Staff - Non Union
Job Category
M&P - AAPS
Job Profile
AAPS Salaried - Statistical Analysis, Level B
Job Title
Research Methodologist, Data Science
Department
Regier Laboratory | School of Population and Pubic Health | Faculty of Medicine
Compensation Range
$7,622.83 - $11,886.67 CAD Monthly
The Compensation Range is the span between the minimum and maximum base salary for a position. The midpoint of the range is approximately halfway between the minimum and the maximum and represents an employee that possesses full job knowledge, qualifications and experience for the position. In the normal course, employees will be hired, transferred or promoted between the minimum and midpoint of the salary range for a job.
Posting End Date
September 15, 2025
Note: Applications will be accepted until 11:59 PM on the Posting End Date.
Job End Date
September 30, 2026
At UBC, we believe that attracting and sustaining a diverse workforce is key to the successful pursuit of excellence in research, innovation, and learning for all faculty, staff and students. Our commitment to employment equity helps achieve inclusion and fairness, brings rich diversity to UBC as a workplace, and creates the necessary conditions for a rewarding career.
Job Summary
The Research Methodologist for Data Science will be a key member of Dr. Dean Rieger’s Regulatory Science Lab research team. They will contribute to ongoing and emerging research activities augmenting health data and data systems to enable rapid evidence generation for innovative health products. The Research Methodologist will support data preparation and application of natural language processing (NLP) and other machine learning methods, including training and deploying large language models (LLMs) for extracting data from electronic health records (EHRs). They will also undertake statistical analysis for performance evaluation. Other responsibilities include supporting grant applications, writing reports and manuscripts, presenting results to multidisciplinary audiences and other forms of knowledge exchange. The working environment will require both independent research and working as a core member of project teams, with opportunities to engage with interdisciplinary researchers, health care stakeholders, and policy makers.
This is a full time 1-year term position, with the possibility of extension.
Organizational Status
The Research Methodologist for Data Science works independently under the direction of Dr. Dean Regier in his Regulatory Science Lab, contributing to both technical and project management activities. The incumbent will interact and work with other faculty, research staff, and graduate students within SPPH, the ATM, and external collaborators.
Housed within the Faculty of Medicine, the School of Population and Public Health (SPPH) is an innovative unit that encompasses many of the health-related groupings at UBC as a collaborative venture. The School is structured around four divisions: Occupational and Environmental Health; Health Services and Policy; Epidemiology, Biostatistics and Public Health Practice; and Health in Populations. The resulting mix of professions and disciplines is seen as a means of connecting individuals and learners to galvanize the relationship between health research, public health and health services and to enhance learning.
The ATM at the University of British Columbia (UBC) is established by the Faculty of Medicine. The ATM is a nucleus for translational and regulatory science research, education and training, and innovation. The ATM is a strategic imperative for the Faculty of Medicine and has a mandate to collaborate with other units in the Faculty of Medicine, the wider University, and beyond to accelerate the translational medicine continuum. The ATM will drive impactful medical and policy research to create new knowledge to improve health outcomes and benefit society. The ATM is the nucleus for the development of new academic and training programs, creating a cutting-edge ecosystem in which top educators and researchers train the next generation of health innovators.
Work Performed
- Develop, program, test, validate, and report on NLP, including LLMs, and other machine learning models for health data extraction and augmentation to support life-cycle evidence generation.
- Test model accuracy across various data sources and scenarios, comparing performance across models, and assessing safety and risks. Ensure compliance with ethics, privacy, and data security requirements.
- Conduct bias audits to ensure model predictions are fair and consistent across all relevant data sources.
- Stay up-to-date with reviewing and potentially integrating new approaches and techniques from emerging scientific literature on generative AI and NLP methods.
- Critically evaluate and appraise complex data and analytic project needs, addressing methodologic and other problems.
- Document data and analysis components of projects, including on data sources, training protocols, and compliance with data safety and ethics requirements, and ensure project completion to deadlines.
- Develop scripts and data processing pipelines using R or Python, leveraging advanced expertise for data processing, model development, and data visualization.
- Work with senior staff members to develop and modify models, and mentor junior staff and students, as needed.
- Develop, write, coordinate, and edit scientific manuscripts, reports and other knowledge translation products, including design, compilation, synthesis, dissemination and evaluation.
- Contribute to development of new research proposals and other knowledge translation activities.
Consequence of Error/Judgement
The incumbent is given wide latitude for exercising independent initiative and judgment in performing specialized duties and responsibilities. A lack of judgment could harm the research team and partner organizations' research and funding. The incumbent will interact with multiple researchers across various organizations to address their data needs and research findings, and discretion is vital.
Consequences of inappropriate judgment exercised by position include:
-
Loss of funding opportunities and collaborative partnerships.
-
Compromising the quality of research findings.
-
Missed project deadlines.
-
Damage to the reputation of any or all of the investigators and/or their affiliated organizations.
Failure to maintain a high degree of attention to detail could negatively affect the accuracy of research findings.
This position requires employees to work under strict confidentiality requirements; internal procedures and will be responsible for access, collection, use and disclosure of personal information in accordance with the BC Freedom of Information and Protection of Privacy Act (RSBC 1996) and other UBC privacy and security policies. This position requires employees to work under strict confidentiality requirements; internal procedures and policies to protect personal information must be followed and adherence to these requirements will be regularly reviewed by the employer.
Supervision Received
The incumbent will be able to work independently with minimal supervision and regularly report to the Regulatory Science Lab Principal Investigator. The incumbent will also receive support from research collaborators as needed. Performance will be reviewed periodically based on the quality and timeliness of work.
Supervision Given
The incumbent may assist the Regulatory Science Lab Principal Investigator with the supervision and mentoring of junior research trainees, such as summer students.
Minimum Qualifications
Post-graduate degree in Statistics. Minimum of three years of related experience in research analysis, or the equivalent combination of education and experience.
-
Willingness to respect diverse perspectives, including perspectives in conflict with one’s own
-
Demonstrates a commitment to enhancing one’s own awareness, knowledge, and skills related to equity, diversity, and inclusion
Preferred Qualifications
- Education in Computer Sciences, Health Economics, Epidemiology, Public Health, Statistics, or other relevant fields (emphasis on health research) preferred.
- Experience developing and applying generative AI, NLP and other machine learning models and using open-source statistical software (e.g. R and/or Python).
- Education in Computer Sciences, Health Economics, Epidemiology, Public Health, Statistics, or other relevant fields (preferred emphasis on health research).
- Demonstrated competence in machine learning, NLP, and generative AI, and ongoing curiosity about the latest technical advancements.
- Experience working with and interpreting medical/health-related natural language data in an AI or machine learning context. Experience with cleaning and analyzing complex, large scale, potentially incomplete data from varying sources. Working knowledge of different LLM prompt engineering techniques and relevant theory.
- Knowledge of relevant generative AI, NLP, machine learning and data visualization packages, libraries, and tools, as well as excellent programming and scripting abilities in either Python or R.
- Experience implementing Retrieval-Augmented Generation (RAG) pipelines for generative AI applications and using applications to locally run LLMs (e.g. LM Studio, Ollama) is preferred
- Detailed knowledge of study design, data collection, and analysis.
- Experience with health research projects in an academic setting and familiarity with writing, editing, and reviewing reports and journal manuscripts.
- Excellent communication skills including the ability to build and maintain effective working relationships both internally and externally,
- Ability to exercise tact, discretion, initiative, confidentiality and judgment.
- Demonstrated analytical and problem-solving skills including the ability to comprehend complex issues and related data/information and present information in concise and meaningful ways.
- Ability to maintain accuracy and attention to detail and handle multiple concurrent tasks.
- Demonstrated organizational skills to generate work plans to meet project deadlines within timeline, scope and budget.
- Ability to work effectively independently and in a diverse team environment.