Sorry %%REM_ADDRESS%%, your request cannot be processed.For security reasons, it was blocked and logged.%%NINJA_LOGO%%If you believe this was an error please contact thewebmaster and enclose the following incident ID:[ #%%NUM_INCIDENT%% ]

Data Scientist

Resume posted by moldach in Scientific.

Desired position type: Any
Location: Montreal Quebec, Canada

Contact moldach


I am a professional data scientist/bioinformatician whose job consists in helping companies and researchers to analyse their datasets. I am fluent in the R statistical programming language for most data-science steps: data pre-processing, application of statistical methods, data visualization and results communication. I also have a strong working knowledge in a Linux/Unix shell; having worked both on cloud infrastructure (Amazon EC2) and on multi-user high performance computing (HPC) clusters, using job schedulers (SLURM & PBS), containers (Docker), and workflow management tools (snakemake & Drake). I use version control (Git + Github/Gitlab), report building with literate programming workflows (Rmarkdown) and dashboards (Shiny + Flexdashboard).


University of Calgary, Calgary, Alberta, Canada

M.Sc.. Biological Sciences                                             January 2013 – December 2015

  • Advisor: Prof. Peter Vize
  • Qualifying exam committee: Gordon Chua, Douglas Muench, Peter Vize
  • Thesis: Transcriptome dynamics over a lunar cycle in Acropora humilis
  • Supported by Queen Elizabeth II Graduate Scholarship


Dalhousie University, Halifax, Nova Scotia                      September 2007 – April 2011

B.Sc., Marine Biology

  • Advisor: Prof. Roger Croll & Jocelyne Hellou
  • Thesis: Ecotoxicological effects of 17β-Estradiol in marine and freshwater gastropods, Ilyanassa obsoleta & Lymnaea stagnalis
  • Graduated with cum laude


I am a former Marine Biologist turned Data Scientist whose achievements include leveraging cloud-computing for coral genetic analysis, developing an image analysis pipeline for Remotely Operated Vehicles, using machine learning for spatial proteomics in oncology, working with large genotyping studies like UKBIOBANK containing TB’s of data from over 500,000 individuals in the UK. I program mostly in R and love sharing my knowledge, that’s why I started a blog and I share my posts also on R-bloggers and Rweekly.


  • R, Bash, LaTeX, Git, Markdown, Adobe Illustrator, AWS services (EC and S3), Python, Spark, Selenium, MATLAB

Spoken Languages