Cyrille Rossant

Setting up a blog with Pelican and GitHub Pages

2016-02-01

I describe how I set up my static blog/website in Python with Pelican, pandoc, Docker, Dockerhub, GitHub pages, and Travis CI.


Should you use HDF5?

2016-01-30

This is a follow-up on my post Moving away from HDF5 (see also Konrad Hinsen's post, and discussions on Twitter and Hacker News). Here are some further thoughts, in no particular order.


Moving away from HDF5

2016-01-06

Update [2016-01-30]: I wrote a follow-up here

In the research lab where I work, we've been developing a data processing pipeline for several years. This includes not only a program but also a new file format based on HDF5 for a specific type of data. While the choice of HDF5 was looking compelling on paper, we found many issues with it. Recently, despite the high costs, we decided to abandon this format in our software.

In this post, I'll describe what is HDF5 and what are the issues that made us move away from it.


New year

2016-01-01

I didn't write a lot lately: only 2 posts in 2015! I'll try to do better this year: more and shorter posts about programming, technology, and science. There's no shortage of topics to discuss.

Happy New Year.


A compiler infrastructure for data visualization

2015-07-24

There are many data visualization tools out there. Yet, I believe we're still lacking a robust, scalable, and cross-platform visualization toolkit that can handle today's massive datasets.

Most existing tools target simple plots with a few hundreds or thousands of points: bar plots, scatter plots, histograms and the like. Typically, these figures represent aggregated statistical quantities. Maps are also particularly popular, and there are now really great open source tools.

Perhaps contrary to a common belief, this is not the end of the story. There are much more complex visualization needs in academia and industry, and I've always been unsatisfied by the tools at our disposal.