ODG, the Omics Database Generator, has been published

ODG: Omics Database Generator has been published in BMC Bioinformatics and is available online now. ODG is a tool that allows users to supply -omics data and ODG will integrate the data into a coherent database and generate a web-based user-interface. Advanced users can query the database directly, through a programming language or by using the CYPHER query language.… Read More »

Bio* Library for Clojure.

Biotools is my basic bioinformatics file parsing library. You can find it at GitHub. It can parse BLAST+ Tab output (-outfmt “6 std qlen slen”), ExPASY ENZYME.dat, FASTA, GFF3, FPKM Tracking files from Cufflinks, Interproscan tab delimited output, Gene Ontology/Plant Ontology OBO format (any Ontology in OBO format), PMN Pathways format, and a PSI-MITAB 2.5 format.… Read More »

Experimenting with Pulsar in Clojure

I’ve now started using Pulsar instead of trying out core.async as I needed a lightweight barrier to entry. The problem: My database of choice (Neo4j) takes batch insertions in a single thread only, but clojure is by its very nature multi-threaded/concurrent/parallel (the exact wording of which I am no longer certain!). I process many files… Read More »

Graph Database example using Gene Ontology – Part 1

The Gene Ontology project is a useful tool for anyone doing genomics. It’s a highly relational and controlled vocabulary, making it ideal for use in a graph database. In this example I will show you what a graph database is, and throughout this series we will create a graph database of GO terms, properly linked,… Read More »