-
Notifications
You must be signed in to change notification settings - Fork 6
Create beerDEcoded.yaml #7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
PlushZ
wants to merge
7
commits into
StreetScienceCommunity:main
Choose a base branch
from
PlushZ:tourtest
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 1 commit
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
8b2621a
Create beerDEcoded.yaml
PlushZ 464a318
Update beerDEcoded.yaml
PlushZ 582a990
Update beerDEcoded.yaml
PlushZ fd93de0
Update beerDEcoded.yaml
PlushZ 196ae73
Update beerDEcoded.yaml
PlushZ 9f102af
Create readme
PlushZ 53a8253
Add files via upload
PlushZ File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,24 +1,13 @@ | ||
id: beerDEcoded | ||
name: BeerDEcoded - StreetScienceCommunity | ||
description: BeerDEcoded - StreetScienceCommunity | ||
description: Workflow finds yeast strains contained in a sequenced beer sample. | ||
title_default: <b>BeerDEcoded</b> | ||
steps: | ||
- title: <b>BeerDEcoded - StreetScienceCommunity</b> | ||
content: >- | ||
What is a beer microbiome? There are collections of small living | ||
creatures. These small creatures are called bacteria and they are | ||
everywhere. In our gut, in the soil, even on vending machines. Some of | ||
these bacteria are actually very good for us. And some others can make us | ||
very ill. Bacteria come in different shapes and sizes but they have the | ||
same components. One crucial component is the DNA, the blueprint of life. | ||
The DNA encodes the shape and size and many other things unique for a | ||
bacterial species. Because of the encoding information the DNA can be used | ||
to identify what kind of bacteria the DNA is from. Therefore, within a | ||
sample form soil, our gut or beer one can specify what kind of species are | ||
inside the sample. Follow this tour to learn more about this kind of | ||
analysis. <br><br> Read and Follow the instructions before clicking | ||
<b>'Next'</b>.<br><br> Click <b>'Prev'</b> in case you missed out on any | ||
step." | ||
Beer contains DNA that comes from its ingredients (hops, malts, yeast) and some hundred microbes. There are 1,000+ yeasts used for brewing and 200+ hop varieties, each one bearing a different DNA and contributing to differentiate its properties.<br> | ||
It is easier now to explore the potential of genome sequencing to understand the contribution of various species to product characteristics. The sequencing of the full genome of 157 brewing yeast strains was, for example, recently reported (<a href="https://pubmed.ncbi.nlm.nih.gov/27610566/">Gallone B, Steensels J, Prahl T, et al. 2016</a>).<br> | ||
Based on the identification of strains present in beer with desired characteristics, controlled experiments in which the microbial composition of the brew is altered could allow us to investigate if the presence of specific microorganisms affects flavour. The origin of each yeast species can be investigated; i.e. whether they come with the ingredients or from the environment at the production site. Furthermore, plant DNA, such as malt and hop varieties, can be found in beer samples, and the bacterial diversity can be mapped. | ||
backdrop: true | ||
- title: <b>Create a new history</b> | ||
element: '#history-new-button' | ||
|
@@ -33,7 +22,7 @@ steps: | |
intro: Change the name of the new history to <b>'BeerDEcoded'</b>. | ||
position: left | ||
- title: <b>Uploading the input data</b> | ||
content: We need to upload data. We will start with a <b>fastq</b> file. | ||
content: We need to upload data. We will start with a <b>fastq</b> file.<br>FASTQ format is a text-based format for storing both a biological sequence and its corresponding quality scores. Both the sequence letter and the quality score are encoded with a single ASCII character for brevity.<br>A FASTQ file normally uses four lines per sequence.<br><ul type="circle"> <li>Line 1 begins with a '@' character and is followed by a sequence identifier and an optional description.</li><li>Line 2 is the raw sequence letters.</li><li>Line 3 begins with a '+' character and is optionally followed by the same sequence identifier (and any description) again.</li><li>Line 4 encodes the quality values for the sequence in Line 2, and must contain the same number of symbols as letters in the sequence.</li> </ul><br>You can find more info in the <a href="https://en.wikipedia.org/wiki/FASTQ_format">Wikipedia article.</a> | ||
backdrop: true | ||
- title: <b>Uploading the input data</b> | ||
element: .upload-button | ||
|
@@ -56,22 +45,15 @@ steps: | |
The upload may take awhile.<br><br> Hit the <b>Close</b> button when you | ||
see that the files are uploaded into your history. | ||
- title: <b>Uploading the input data Complete !</b> | ||
content: 'Now that your data is ready, let''s use some tools.<br>' | ||
content: 'Now that your data is ready, let''s analyze it!<br>' | ||
backdrop: true | ||
- title: <b>Assign taxonomic classifications</b> | ||
content: >- | ||
One of the key steps in metagenomic data analysis is to identify the taxon | ||
to which the individual reads belong. Taxonomic classification tools are | ||
based on microbial genome databases to identify the origin of each | ||
sequence.<br> | ||
One of the key steps in metagenomic data analysis is to identify the taxon to which the individual read belongs. Taxonomic classification tools are using microbial genome databases to identify the origin of each sequence.<br> | ||
backdrop: true | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
content: >- | ||
To perform the taxonomic classification we will use Kraken2. This tool | ||
uses the minimizer method to sample the k-mers (all the read’s | ||
subsequences of length k) in a deterministic fashion in order to reduce | ||
memory constumption and processing time. In addition, it masks | ||
low-complexity sequences from reference sequences by using dustmasker.<br> | ||
To perform the taxonomic classification we will use Kraken2. This tool uses <a href="https://en.wikipedia.org/wiki/K-mer">k-mers</a> (the read’s subsequences of length k) to assign a taxonomic label to the sequence (if possible).<br>The taxonomic label is assigned based on matches of k-mer content of the considering sequence to the k-mer content of reference genome sequence. The result is a classification of the considering sequence to the most likely taxonomic label. If the k-mer content is not similar to any genomic sequence in the database used, it will not assign any taxonomic label.<br> | ||
backdrop: true | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
element: .toolMenuContainer | ||
|
@@ -104,7 +86,8 @@ steps: | |
file'</b>: Yes<br> <b>'Format report output like | ||
Kraken 1’s kraken-mpa-report'</b>: Yes<br> <b>'Select a Kraken2 | ||
database'</b>: fungi2019-03' <br><br>Click <b>'Next'</b> and the tour will | ||
<b>'Execute'</b> the Kraken2 tool for you.' | ||
<b>'Execute'</b> the Kraken2 tool for you.'<br><br><br> | ||
<small>Additional info:<br><b>Parameter ‘Single or paired reads’</b><br>Single-end reads are the fragments sequenced from one side. With paired-end sequencing, the fragments are sequenced from both sides. This approach results in two reads per fragment, with the first read in forward orientation and the second read in reverse-complement orientation. With this technique, we have the advantage to get more information about each DNA fragment compared to reads sequenced by only single-end sequencing<br><b>Parameter ‘Confidence'</b><br>A confidence score of 0.0 means that non-restrictive taxonomic assignation is desired. This value can be increased if a more restrictive taxonomic assignation is desired. For example, a confidence score of 0.1 means that at least 10% of the k-mers should match entries in the database. <br><b>Parameter ‘Select a Kraken2 database’</b><br>We need to identify the taxon to which the individual reads belong. To identify the origin of each sequence, taxonomic classification tools use microbial genome databases. For this tutorial, we will use the <i>fungi2019-03</i> database.<small/><br> | ||
|
||
postclick: | ||
- '#execute' | ||
- title: <b>Assign taxonomic labels with Kraken2</b> | ||
|
@@ -125,8 +108,7 @@ steps: | |
- title: <b>Adjust dataset format</b> | ||
element: '#__BVID__106' | ||
intro: >- | ||
But before that, we need to adjust the format of the data output from | ||
Kraken2.<br><br> Search and select the <b>'Reverse'</b> tool. | ||
It can happen, that the output format of the tool needs to be changed in order for the next tool to read the data.Galaxy offers several "manipulation" tools.<br><br>The format of the file created after executing Kraken2 has a tabular format with one column and one line per taxon. Every line contains the information divided by | symbol. In order to make it more readable and usable by next tools we need a tab-delimited format with one line per taxon. The fields of the output, from left-to-right, should be as follows:1) Number of fragments assigned directly to this taxon; 2) A rank code, indicating (U)nclassified, (R)oot, (D)omain, (K)ingdom, (P)hylum, (C)lass, (O)rder, (F)amily, (G)enus, or (S)pecies.<br><br>Here we need to adjust the format of the data output from Kraken2. <br><br> Search and select the <b>'Reverse'</b> tool. | ||
position: right | ||
- title: <b>Adjust dataset format</b> | ||
intro: >- | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
idea maybe the introduction could contain a bit more info about the beer microbiome. Like what microorganisms can be found in a beer. Why are they important. A more general introduction is hopefully already done before and then we could have some repetition issues here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest this introduction for the tour:
"Beer contains DNA that comes from its ingredients (hops, malts, yeast) and some hundred microbes. There are 1,000+ yeasts used for brewing and 200+ hop varieties, each one bearing a different DNA and contributing to differentiate its properties.
Thanks for new technologies of sequencing it is easier nowadays to explore the potential of genome sequencing to understand the contribution of various species to product characteristics. The sequencing of the full genome of 157 brewing yeast strains was, for example, recently reported (Gallone B, Steensels J, Prahl T, et al. 2016).
Based on the identification of strains present in beer with desired characteristics, controlled experiments in which the microbial composition of the brew is altered could allow us to investigate if the presence of specific microorganisms affects flavour. The origin of each yeast species can be investigated; i.e. whether they come with the ingredients or from the environment at the production site. Furthermore, plant DNA, such as malt and hop varieties, can be found in beer samples, and the bacterial diversity can be mapped.
"
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general: sounds very good. Maybe break it up into two boxes, so it is not too much text in one.
Some hints:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What if we add also information about 3 steps of the process:

"The beerDeCoded process contains 3 consistent steps. The first step is DNA extraction from beer. Then, this DNA can be sequenced. That means that we can obtain the sequence of nucleotides for this specific DNA. Finally, we have to analyze received data in order to know which organisms this DNA is from." And add the process diagram from https://streetscience.community/projects/beerdecoded/
And point out that we are currently on the third step?
Or this would be redundant information?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is a good idea.
In general: It would be good if each box of the tour gets a title, explaining the content of the box. Like this, it will be easy for us to see the flow of the tour.
To this box: It would be good to connect it to the explanation before. E.g. To study the microbiome of beer you need to find out what is inside the beer. Getting this inside can be found by extracting the DNAs of the living organisms (yeast) inside the beer. Now you would also like to 'read' this DNA. This can be achieved by sequencing the DNA.
Having this sequences now enabels us to do a Data analysis, which we will do in the following.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done