Title

Overview

Ruminant Genome Database (RGD) is a comprehensive repository that integrated genomics, transcriptomics, and epigenetics data of Ruminantia. This suborder includes traditional livestock such as cattle, sheep and goat, endangered species such as milu, Arctic and dessert lived species such as reindeer and gemsbok, and species of biomedical interest, such as antlered deer. Therefore, ruminants have great significance in agriculture, conservation, adaptability, and biomedicine. RGD currently hosts all 55 published ruminant genomes, 1,232 RNA-seq datasets (including goat: 131, sheep: 832, cattle: 163, yak: 27, water buffalo: 39, roe deer: 20, and sika deer: 20), epigenomics signals predicted by 32 ruminant and 726 human epigenomic data, comparative genomic analysis results such as synteny blocks and orthologous genes annotated with Gene Ontology and pathway, and the traits data from AnimalQTLdb. Furthermore, a set of analysis (including BLAT, BLAST, and Table Browser) and visualization tools (the UCSC Genome Browser and heatmap) and user-friendly query interfaces have been implemented in RGD to facilitate the usage of these large-scale data by the community.

01. Phylogenetic tree

Ruminant phylogenetic tree supports a sister-group relationship for Antilocapridae and Giraffidae, as well as for Moschidae and Bovidae. Using fossil calibrations, the emergence of crown Ruminantia is at 39.1 - 32.3 Mya (late Oligocene), and the origin of the Pecora is at 23.3 - 20.8 Mya (Neocene).

Fig 1. Phylogeny and trait evolution of ruminants (Chen et al).

02. Comparative Genomics and Epigenomic Signals

The UCSC Genome Browser is a feature-rich graphical viewing tool especially for displaying comparative genomic data. RGD releases "Genes and Gene Predictions", "Expression and Regulation", and "Comparative Genomics" tracks against goat genome assembly (ARS1) that displays improved goat gene annotation1, gene expression profile2, epigenomics signals3, five scales of conservative evaluation4, and a multiz multiple sequence alignments of 67 species5 (55 ruminants and 12 mammalian outgroup species). Users can type a genomic region, or a gene symbol, or a transcript name in the search box and click the "go" button to visualize these data. Especially, users can find the clade-specific or species-specific sequence differences that related to livestock production or ruminant morphological characteristics by zooming in the alignments to nucleotide or amino acid levels. Furthermore, users can click "PDF/PS" item under the "View" menu of navigation bar to generate a high quality image in PostScript or PDF formats.

Figure 2. Main Genome Browser display page on the goat assembly (ARS1), showing tracks of gene annotations, expreBar, epigenomics signals, five conservation scores, and 67-way multiple alignment.

03. Gene Expression Heatmap

The expression atlas database includes 1232 RNA-seq datasets (goat: 131, sheep: 832, cattle: 163, yak: 27, water buffalo: 39, roe deer: 20, and sika deer: 20). Click on the related species and enter a gene symbol in the search box to get gene expression in heatmap and Genome Browser.

04. Gene Ontology & Pathway Annotation

Users can enter a gene symbol to get results of three parts: Orthologous Genes, the corresponding Gene Ontology, and KEGG & Wiki Pathways.

Results:

  • Orthologous Genes includes:
    Gene symbol, Transcript name, Species, CDS Length, CDS sequence, and Protein sequence.
  • Gene Ontology includes:
    Molecular Function: GO ID, GO term, and Evidence
    Cellular Component: GO ID, GO term, and Evidence
    Biological Process: GO ID, GO term, and Evidence
    GO ID can be linked to AmiGO 2 database.
  • Pathway includes:
    KEGG pathway items and Pathway ID. Users can click the Pathway ID to get a detailed KEGG pathway figure.
    WikiPathway items. Users can click the "Pathway show" to get a detailed WikiPathway figure.

05. Quantitative Trait Locus

We filtered QTL items that were not anchored to genome chromosome, and finally got 1,082 sheep QTLs and 97,914 cattle QTLs. We provide three ways to retrieve QTLdb: Search by Gene symbol, Find QTL by genome location, and Find associated gene by trait name or keyword. Users can get various traits with annotated genes to further confirm the gene function.

06. Tools

6.1 Local UCSC Table Browser

UCSC Table Browser is a powerful tool for retrieving raw data and performing intersections and unions between data in different tracks. For the basic data queries, users can select clade, genome, assembly, group, track, table, regions of interest, output format and output file name to get query results in a tab-delimited text format or compressed format. While for the advanced queries, users can filter and refine queries, intersect query results from different tables and configure the resulting output. The UCSC Table Browser can retrieve and download all data from tracks of goat, sheep ,and cattle genome for other analysis.

6.2 Blat

webBlat is a web-based version of BLAT developed by Jim Kent. User can type a DNA, mRNA, or protein sequence against the genome assembly of goat (ARS1), sheep (Oar_v4.0), and cattle (ARS-UCD1.2_Btau5.0.1Y) to return a list of links to all genome positions that share 95% or greater identity with the input sequence. Then the alignment regions can be displayed in the genome browser when users click the "Gbrowse" link.

6.3 Blast

NCBI's wwwBLAST is also available in our database as an online tool. User can enter query sequences of DNA against goat (ARS1), sheep (Oar_v4.0), and cattle (ARS-UCD1.2_Btau5.0.1Y) genome database getting a group of high-scoring pairwise alignments.

6.4 LiftOver

RGD sets up a local LiftOver tool which was created by UCSC Genome Browser Group. This tool converts genome coordinates and genome annotation files between assemblies.

  • Cattle (ARS_UCD_addY) --> Goat (ARS1)
    Cattle (Btau_5.0.1) --> Cattle (UMD_3.1.1)
    Cattle (Btau_5.0.1) --> Cattle (ARS_UCD_addY)
  • Goat (CHIR_2) --> Goat (ARS1)
  • Huamn(hg38) --> Cattle (ARS_UCD_addY)
    Huamn(hg38) --> Goat (ARS1)
    Huamn(hg38) --> Sheep (Oar_rambouillet_v1_0)
    Huamn(hg38) --> Sheep (Oar_v3.1)
    Huamn(hg38) --> Sheep (Oar_v4.0)
  • Sheep(Oar_v3.1) --> Sheep (Oar_v4.0)
    Sheep(Oar_v4.0) --> Sheep (Oar_rambouillet_v1_0)
    Sheep(Oar_v4.0) --> Goat (ARS1)

6.5 Batch Download

RGD data can be downloaded easily by Table Browser and Download Page.