ODG: Omics Database Generator has been published in BMC Bioinformatics and is available online now. ODG is a tool that allows users to supply -omics data and ODG will integrate the data into a coherent database and generate a web-based user-interface. Advanced users can query the database directly, through a programming language or by using the CYPHER query language. ODG uses Neo4j’s… Read More »
I needed to analyze a large batch of samples (~300) of genes.fpkm_tracking files in Clojure and Incanter. This guide will show you how I read the files in, only looked at the FPKMs, and converted it into a single dataset. You need a project.clj file somewhere with the dependencies below (incanter, me.raynes.fs).
I’ve now started using Pulsar instead of trying out core.async as I needed a lightweight barrier to entry. The problem: My database of choice (Neo4j) takes batch insertions in a single thread only, but clojure is by its very nature multi-threaded/concurrent/parallel (the exact wording of which I am no longer certain!). I process many files when building the… Read More »