Tuesday, October 21, 2014

GRC track hub arrives!

The GRC are now providing assembly-related tracks for reference genomes via a track hub, which will allow you to view that data in a range of genome browsers, including Ensembl and UCSC. The location of this hub is: http://ngs.sanger.ac.uk/production/grit/track_hub/hub.txt

What tracks are available?
The GRC generates a range of annotations on the reference genomes it curates. These tracks describe the assembly of the genome, as well as quality issues with the genome, and provides information relevant to their resolution or planned improvement. Much of this annotation is already available via the gEVAL browser. However, the track hub allows you to view this annotation in the genome browser of your choice.

The individual tracks currently available are:
  • Genome issues under review by the GRC
  • Genomic regions defined by the GRC
  • Alignments between the primary assembly and alternate loci or patches
  • Clone sequence anomalies
  • Human regions with clones from the CHORI-17 library (CHM1tert)
These tracks are updated on a weekly basis. We will be adding to this range of information as time goes on.

How does this look?
Here's how this data looks in Ensembl:
 Here's how it looks in UCSC:

What is a track hub?
A track hub is a means of attaching multiple annotation tracks to a genome browser via a single URL. Full documentation on track hubs is available here.

How do I attach the track hub in my favourite genome browser?
You need to specify the following URL to the genome browser:
http://ngs.sanger.ac.uk/production/grit/track_hub/hub.txt
  • In Ensembl: Go to "Add/Manage your data", select "Add your data" (if necessary), then select the data format "TrackHub", and add the hub URL.
  • In UCSC: select the "Track Hubs" button just beneath the main browser area, then select the "My Hubs" tab, and add the hub URL.

Tuesday, October 14, 2014

GRCh38.p1 has arrived!

The first patch release for the GRCh38 reference assembly is now available. The GRCh38.p1 release includes 16 scaffolds: 13 FIX patches and 3 NOVEL patches. The FIX patch scaffolds correct existing assembly sequences, while the NOVEL patch scaffolds provide new alternate sequence representations. You can download the GRCh38.p1 assembly, including the alignments of the patches to GRCh38, from the GenBank FTP site: ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_000001405.16_GRCh38.p1/.

Stay tuned for upcoming blog posts on individual patches!