Collection of datasets containing the extension to SeqENV pipeline, Illumina sequencing data from sugarcane including subhabitats from soil, rhizsophere, stem and root and two marine datasets and analysis results
  • Description

    This dataset contains the extension to SeqENV pipeline (Sinclair 2016) (including its source code and associated toolkit). Preprocessed FASTA files belonging to sugarcane habitat are included, consisting of samples from soil, rhizosphere, root and stem sub-habitats, as well as FASTA files from two marine subhabitats. Taxonomic annotation was performed on the FASTA files from both habitats in the TaxaSE pipeline. Using the included toolkit, datasets consisting of randomly selected sequences were generated and analysed in the SeqENV pipeline. Finally, the extension was used to generate Per Term Taxa Abundance and Per Taxa Term Abundance results. Sugarcane leaf, stalk, root and rhizosphere soil samples were collected by Dr. Kelly Hamonts at Hawkesbury Institute for the Environment, Western Sydney University, Australia, in November 2014 from eight sugarcane fields growing three sugarcane varieties (KQ228, MQ239 and Q240) near Ingham, Queensland, Australia. In each field, 3 stools were randomly selected and samples were collected from 2 plants per stool. Samples were snap-frozen in liquid nitrogen on the field, transported to the laboratory on dry ice and stored at -80C. Frozen sugarcane tissue samples were ground using mortar and pestle and DNA was extracted from the resulting powder using the MoBio PowerPlant DNA extraction kit, following the manufacturer’s instructions. The MoBIO PowerSoil DNA extraction kit was used to extract DNA from the soil samples. Bacterial 16S rRNA amplicon sequencing was performed by the NGS facility at Western Sydney University using Illumina Miseq (2x 301 bp PE) and the 341F/805R primer set. Marine dataset came from (Jeffries, T.C. et. al. 2015).

    Data can be downloaded from the link provided in this record. Supplemental information associated with the article: Extending SEQenv: a taxa-centric approach to environmental annotations of 16S rDNA sequences; is available in the Attachments section.


    • Data publication title Collection of datasets containing the extension to SeqENV pipeline, Illumina sequencing data from sugarcane including subhabitats from soil, rhizsophere, stem and root and two marine datasets and analysis results
    • Description

      This dataset contains the extension to SeqENV pipeline (Sinclair 2016) (including its source code and associated toolkit). Preprocessed FASTA files belonging to sugarcane habitat are included, consisting of samples from soil, rhizosphere, root and stem sub-habitats, as well as FASTA files from two marine subhabitats. Taxonomic annotation was performed on the FASTA files from both habitats in the TaxaSE pipeline. Using the included toolkit, datasets consisting of randomly selected sequences were generated and analysed in the SeqENV pipeline. Finally, the extension was used to generate Per Term Taxa Abundance and Per Taxa Term Abundance results. Sugarcane leaf, stalk, root and rhizosphere soil samples were collected by Dr. Kelly Hamonts at Hawkesbury Institute for the Environment, Western Sydney University, Australia, in November 2014 from eight sugarcane fields growing three sugarcane varieties (KQ228, MQ239 and Q240) near Ingham, Queensland, Australia. In each field, 3 stools were randomly selected and samples were collected from 2 plants per stool. Samples were snap-frozen in liquid nitrogen on the field, transported to the laboratory on dry ice and stored at -80C. Frozen sugarcane tissue samples were ground using mortar and pestle and DNA was extracted from the resulting powder using the MoBio PowerPlant DNA extraction kit, following the manufacturer’s instructions. The MoBIO PowerSoil DNA extraction kit was used to extract DNA from the soil samples. Bacterial 16S rRNA amplicon sequencing was performed by the NGS facility at Western Sydney University using Illumina Miseq (2x 301 bp PE) and the 341F/805R primer set. Marine dataset came from (Jeffries, T.C. et. al. 2015).

      Data can be downloaded from the link provided in this record. Supplemental information associated with the article: Extending SEQenv: a taxa-centric approach to environmental annotations of 16S rDNA sequences; is available in the Attachments section.


    • Data type dataset
    • Keywords
      • NGS
      • Illumina
      • Taxonomy
      • Annotation
      • Pipeline
      • Community analysis
      • Saccharum Spp.
    • Funding source
      • Western Sydney University and CRC-CARE
    • Grant number(s)
      • -
    • FoR codes
      SEO codes
      Temporal (time) coverage
    • Start date 2013/02/01
    • End date 2017/02/28
    • Time period
       
      Spatial (location,mapping) coverage
    • Locations
    • Related publications
        Name Seqenv: linking sequences to environments through text mining. DOI: 10.7717/peerj.2690
      • URL http://doi.org/10.7717/peerj.2690
      • Notes
      • Name Extending SEQenv: a taxa-centric approach to environmental annotations of 16S rDNA sequences. DOI: 10.7717/peerj.3827
      • URL https://doi.org/10.7717/peerj.3827
      • Notes
      • Name Taxonomic and Environmental Annotation of Bacterial 16S rDNA sequences via Shannon Entropy and Database Metadata Terms
      • URL http://hdl.handle.net/1959.7/uws:47536
      • Notes Thesis (Ph.D.)--Western Sydney University, 2017
      • Name Jeffries, TC, Ostrowski, M, Williams, RB, Xie, C, Jensen, RM, Jensen, RM et al. 2015, ‘Spatially extensive microbial biogeography of the Indian Ocean provides insights into the unique community structure of a pristine coral atoll’, Scientific Reports, vol. 5, article no. 15383.
      • URL https://doi.org/10.1038/srep15383
      • Notes
    • Related website
    • Related metadata (including standards, codebooks, vocabularies, thesauri, ontologies)
    • Related data
        Name
      • URL
      • Notes
    • Related services
        Name
      • URL
      • Notes
      Citation Ijaz, Ali; Jeffries, Thomas; Hamonts, Kelly (2017): Collection of datasets containing the extension to SeqENV pipeline, Illumina sequencing data from sugarcane including subhabitats from soil, rhizsophere, stem and root and two marine datasets and analysis results. Western Sydney University. https://doi.org/10.4225/35/59224ad317fdc