Samposium

Ghostbusting

Sam Nicholls 3rd May 201518th January 2016 No Comments yet Meta

Shortly after setting up this blog, I embedded Google Analytics tracking; primarily because I like numbers but also in hope of discovering that at least one other person who isn’t me or one my supervisors is interested in my adventures. It’s also great writing practice and gives me the chance to properly think through the […]

google analytics, spam, stats

Aligned Annihilation

Sam Nicholls 1st May 201526th October 2015 No Comments yet AU-PhD

This afternoon in a coffee fueled fugue, I nuked every directory containing output for any attempt to align the limpet contigs to any form of database so far. Here’s why, and what I did next.

Pipelines

Sam Nicholls 29th April 201526th October 2015

[text: ‘pipelines’, photograph of a standard bioinformatics pipeline transforming data from one mess to another]

What am I doing?

Sam Nicholls 27th April 201513th January 2016 No Comments yet AU-PhD

A week ago I had a progress meeting with Amanda and Wayne, who make up the supervisory team for the computational face of my project. I talked about how computers are terrible and where the project is heading. As Wayne had been away from meetings for a few weeks, I began with a roundup of […]

introduction, phd, project

`memblame`

Sam Nicholls 26th April 20151st November 2015 No Comments yet System Administration, Tools

As a curious and nosy individual who likes to know everything, I wrote a script dubbed memblame which is responsible for naming and shaming authors of “inefficient”1 jobs at our cluster here in IBERS. It takes time, often days, sometimes longer, of patience to see large-input jobs executed on a node on the compute cluster […]

Scratch

Sam Nicholls 25th April 201526th October 2015

[text: ‘~/scratch/’, photograph of you desperately trying to pack your data for a programmatic excursion only to find that the airline charges by the bit for hold luggage]

TrEMBLing

Sam Nicholls 24th April 20151st November 2015 No Comments yet Bioinformatics, Mysteries

Something appears amiss with TrEMBL, millions of sequences are “missing”. Where did they go? At the end of last month, to build a database of bacterial sequences with known hydrolase activity1, I extracted around 2.9 million sequences from UniProtKB/TrEMBL; a popular database which contains sequences that have been automatically annotated and are awaiting manual curation […]

The Story so Far: Part I, A Toy Dataset

Sam Nicholls 21st April 201526th October 2015 No Comments yet AU-PhD

In this somewhat long and long overdue post; I’ll attempt to explain the work done so far and an overview of the many issues encountered along the way and an insight in to why doing science is much harder than it ought to be. This post got a little longer than anticipated, so I’ve sharded […]

fastq, fastqc, introduction, limpet, quality control

Exit codes, core dumps, `set -e` and `expr`

Sam Nicholls 17th April 201525th October 2015 One Comment AU-PhD

The kernels on our cluster clients have recently been updated after I inadvertently stumbled across an old1 kernel bug that caused erratic behaviour when NFS tries to open a directory containing many files that are being written to simultaneously (more on which is another post in itself really, as usual). The update seems to have […]

My First Research Proposal

Sam Nicholls 10th March 20151st November 2015

[text: “my first research proposal”, photograph of the front cover of my new fantasy fairytale]