Archive For July 26, 2016
Recently i had to remove all small RNAs from some cufflinks data. There are lots of way to do this, but this was relatively painless (aside from figuring it out). The HUGO website has subcategories of genes annotated and we can grab the data from there. It’s available as both JSON or TEXT. I found…
Spin up an EC2 instance. For this demo, I used an t2.micro instance with the base amazon Linux AMI. First connect to our machine. You need to find your EC2 instance public dns. It will be something like: ec2-1-2-3-4-us-west-1.compute.amazonaws.com. The user is always ec2-user. ssh -i /path/to/pem_file ec2-1-2-3-4-us-west-1.compute.amazonaws.com Once on the EC2 machine, I needed…
A good use for Ensembl’s biomart is the ability to pull a list of gene names linked to ensembl transcript ID’s, as when using Sleuth. We will need Bioconductor and the biomaRt library if you don’t have it already
1 2 3 |
source("http://bioconductor.org/biocLite.R") biocLite("biomaRt") library(biomaRt) |
The usual bioC way of doing things like this is to create an object that…