db.rna16s
is a curated database of 16S sequences, obtained directly form db.rnacentral. This package contains code to filter the data from RNACentral releases, as well as pointers to the location of the data.
For each supported version of db.rnacentral, a single FASTA is available, containing a subset of the RNACentral sequences that are identified as 16S.
All the data in db.rna16s
is versioned following the RNACentral releases number scheme.
Each of these versions is encoded as an object that extends the sealed class Version
.
The Set
Version.all
contains all the releases supported and maintained through db.rna16s
.
The module db.rna16s.data
contains the pointers to the S3 objects where the actual data is stored. The paths of the S3 objects corresponding to the FASTA file and mappings file, respectively, can be accessed evaluating the following methods over a Version
object:
sequences : Version => S3Object
mappings : Version => S3Object
The path to the S3 object returned by those functions look something like the following:
s3://resources.ohnosequences.com/ohnosequences/db/rna16s/<version>/rna16s.fa
s3://resources.ohnosequences.com/ohnosequences/db/rna16s/<version>/mappings
- The code which generates the database is licensed under the AGPLv3 license
- The database itself is made available under the ODbLv1 license.
- The database contents are available under their respective licenses. As far as we can tell all data included in db.rna16s could be considered free for any use; do note that sequences and annotations coming from SILVA, which has a restrictive license, are excluded from db.rna16s.
See the open data commons FAQ for more on this distinction between database and contents.