2.1.1.1. Introduction to G-OnRamp Walkthrough

2.1.1.1.1. Introduction

The overall goal of this walkthrough is to illustrate the use of the G-OnRamp workflow to create a genome browser. The walkthrough will cover topics including how to create an account and login, how to import the G-OnRamp workflow into your workspace, how to modify the workflow by changing tool parameters and adding new tools, and how to run the workflow. This walkthrough assumes that the reader has some basic familiarity with Galaxy. (See the “Overview of Galaxy” presentation for a detailed introduction to Galaxy).

2.1.1.1.2. Login

Start the G-OnRamp virtual image and start the Galaxy server. Click on the link http://192.168.56.11:8080/gonramp/ to navigate to the G-OnRamp instance.

First of all, you need to log into your Galaxy account in order to access the full functionality of Galaxy. First click on “User” at the menu bar, and then choose “Login” (Fig. 2.1). Then enter your username and password (Fig. 2.2).

  • username: galaxyadmin
  • password: 12341234
../_images/Login1.png

Fig. 2.1 Open the login form from the “User” menu at the main menu bar

../_images/Login2.png

Fig. 2.2 Enter the email address that you used to create your Galaxy account and your account password to log into Galaxy

2.1.1.1.3. Import G-OnRamp workflow

The G-OnRamp workflow has been shared with you in the “Shared Data”. Click on “Shared Data” in the top menu bar and choose “Workflows” in the drop down menu (Fig. 2.3). Find the workflow called “G-OnRamp” in the published workflows and click on the down arrow on the right side. Choose “import” to import the G-OnRamp workflow into your Galaxy account (Fig. 2.4).

../_images/workflow.png

Fig. 2.3 Find the G-OnRamp workflow at “Shared Data” in the top menu bar

../_images/import-workflow.png

Fig. 2.4 Click on the down arrow on the right side and import the workflow

You can access the imported workflow via the “Workflow” menu item on the menu bar. The “imported: G-OnRamp” workflow will appear under the “Your workflows” section. When you right click (control-click on the Mac) on the workflow or click on the down arrow, a context-sensitive menu will appear where you can edit, run, share or download, copy, rename, view, and delete the workflow (Fig. 2.5).

../_images/workflow2.png

Fig. 2.5 Context-sensitive menu that specifies the different ways that you can manipulate the imported workflow

Enter “G-OnRamp: D. biarmipes F element” into the “Workflow Name” field and then click on “Rename” to change the name of the workflow (Fig. 2.6). Go back to the home page.

../_images/rename.png

Fig. 2.6 Rename the workflow

2.1.1.1.4. Run the G-OnRamp workflow

2.1.1.1.4.1. Upload your datasets

The test datasets that we will use in this walkthrough are available at the “Data Libraries” within the “Shared Datasets” (Fig. 2.7). Click on the folder “Dbia3”. There are four test datasets in the folder: the reference genome sequence from the Drosophila biarmipes Muller F element, a collection of Drosophila melanogaster protein sequences, and the forward and reverse paired-end reads from contig16 of D. biarmipes RNA-Seq data. Select all the datasets in the foder(Fig. 2.8). Import them to a new History, which you can name it “Dbia3” (Fig. 2.9).

../_images/data-lib.png

Fig. 2.7 Go to “Shared Data” and choose “Data Libraries” from the drop down menu

../_images/import-data.png

Fig. 2.8 Select all the datasets in the Dbia3 folder

../_images/import-data2.png

Fig. 2.9 Create a new History named Dbia3 and import the datasets to the new History

Once all of your datasets have been import to History, you can use the G-OnRamp workflow to analyze your datasets. However, before running the G-OnRamp workflow, you need to verify that Galaxy has assigned the correct data format to each of the file. In some cases, Galaxy incorrectly assigned the “dmel-all-translation-r6.11.fasta” file to the sdf format instead of the fasta format. Therefore, you need to manually edit its datatype by clicking on “pencil” icon at the top right corner of this dataset (Fig. 2.10). You can skip this datatype editing by going directly to Run the workflow if all your files have been assigned the correct data format.

../_images/datatype.png

Fig. 2.10 Edit the attributes of the dataset by clicking on the “pencil” button

Click on the Datatype tab, click on the down arrow under the “New Type” field, and then select “fasta” (Fig. 2.11). Don’t forget to click on the “Save” button (Fig. 2.12).

../_images/datatype2.png

Fig. 2.11 Change the datatype of the dmel-all-translation-r6.11.fasta dataset to fasta

../_images/datatype3.png

Fig. 2.12 Save the new datatype for the dmel-all-translation-r6.11.fasta dataset

2.1.1.1.4.2. Run the workflow

Click on the “Workflow” menu on the menu bar. Click on the “G-OnRamp:D. biarmipes F element” workflow and then select “Run” from the drop-down menu to run the workflow (Figure 15).

../_images/run-workflow.png

Fig. 2.13 Click on the drop-down menu for the “G-OnRamp: D. biarmipes F element” workflow to run the workflow

The workflow consists of 16 different steps. In order to run the workflow, you need to specify the datasets that should be used in Step 1 to Step 4 (Fig. 2.14). Select the reference genome for Step 1. Select the protein sequence(s) that you would like to align against the assembly in Step 2. Select the forward and reverse reads from your RNA-Seq experiments for Step 3 and Step 4.

../_images/specify-data.png

Fig. 2.14 Specify the datasets that should be used in Step 1 to Step 4

You can also use this form to examine and edit the parameter settings for each step of the workflow. Let’s run the workflow with the default settings this time. We will cover how to modify the G-OnRamp workflow in a subsequent walkthrough. (See the “Customize the Genome Browsers produced by G-OnRamp” walkthrough for details.)

Finally, scroll down to the bottom of the page. Check the box to send the results to a new History and then click on the “Run workflow” button to start the analysis (Fig. 2.15) .

../_images/run-workflow2.png

Fig. 2.15 Click on the “Run workflow” button to start the analysis

2.1.1.1.4.3. View the results

After all the steps in the G-OnRamp workflow are complete (which will take a few minutes), you can view the genome browser for the D. biarmipes F element by expanding the “Hub Archive Creator” step (i.e. Step 18) and then click on the “main” link next to the “display at Track Hub UCSC” field (Fig. 2.16).

../_images/main.png

Fig. 2.16 View results on the UCSC genome browser

Because the RNA-Seq dataset that we used in this analysis only contains the RNA-Seq reads that mapped to contig16, we will examine this contig using the UCSC Genome Browser. Enter “contig16” into the “Position/Search Term” field and then click on the GO button (Fig. 2.17). Finally, you can then see the results of the G-OnRamp workflow as different evidence tracks on the UCSC genome browser (Fig. 2.18).

../_images/contig16.png

Fig. 2.17 Specify the contig number in the “Position/Search Term” field and then click on GO

../_images/browser.png

Fig. 2.18 View the genome assembly and the evidence tracks produced by the Hub Archive Creator on the UCSC genome browser