Hi, I'm Yan!
My job consists to help companies and researchers to analyse their datasets. I am skilled for most of the data-science steps: data pre-processing, application of statistical methods, data visualization and results communication.
I am currently working for the Queensland Brain Institute in Australia, where I develop data visualisation methods at the border between epidemiology and genetics.Publication Talks Twitter Github
From Data to Viz - A classification of graphics based on input data format
July 2018 - 6 minutes read
I’m delighted to announce a new dataviz project called Data to Viz. It is a classification of chart types based on input data format. It comes in the form of a decision tree leading to a set of potentially appropriate visualizations to represent the dataset. - Read more
Pimp my RMD - A collection of tips for R Markdown
July 2018 - 2 minutes read
R markdown creates interactive reports from R code. I’ve created a document that provides a few tips I use on a daily basis to improve the appearance of my html outputs (my memory aid). This document is built using R Markdown and hosted on Github. - Read more
I've developped several websites dedicated to data visualization. These resources are visited thousands of time every day.
Visualize how all the co-authors of my previous supervisor are inter connected.
A dataviz challenge on water pollution in France. Third price at the GreenTech Challenge.
Online tool to estimate standard error of allele frequency in pool-sequencing experiments. Published in Molecular Ecology
"[...] Yan did an absolutely brilliant job. He has multiple skills and qualities making him a great data science project member, and even leader : excellent knowledge and learning ability of diverse statistical and data management tools obviously, and a real knack for data viz."
"Yan always performed with outstanding skills and impressive capacity for adaptation and efficiency. His job was crucial for the publication of important results. [...] I thus highly recommend him for any work concering framing and analysis of huge data sets."