首页 > sliva数据库简介--转载

sliva数据库简介--转载

sliva rRNA数据库(http://www.arb-silva.de/)用来检查和比对RNA序列,既可以针对16S/18S,SSU,也可以针对23S/28S, LSU,包括了Bacteria, Archaea and Eukarya。同时也是ARB的官方指定数据库。

LSU: Large subunit (23S/28Sribosomal RNAs)

SSU: Small subunit (16S/18Sribosomal RNAs)

1.下载

1.1 针对arb的下载

到目前(2015.2.4,最新的数据库为Realease119,网页版的已经到121版本了,但是现在不提供下载)

下载介绍http://www.arb-silva.de/download/arb-files/

下载地址:http://www.arb-silva.de/no_cache/download/archive/release_119/ARB_files/

我选择的是其中的 RefNR 99,他是Ref 119的无冗余版本。

wget –c 

http://www.arb-silva.de/fileadmin/silva_databases/release_119/ARB_files/SSURef_NR99_119_SILVA_14_07_14_opt.arb.tgz;

md5sum SSURef_NR99_119_SILVA_14_07_14_opt.arb.tgz;

wget –c ;http://www.arb-silva.de/fileadmin/silva_databases/release_119/ARB_files/SSURef_NR99_119_SILVA_14_07_14_opt.arb.tgz

 

1.2 仅仅是下载fasta文件

下载地址:http://www.arb-silva.de/no_cache/download/archive/release_119/Exports/

根据下载的需求,选择针对23S/28Sribosomal RNAs的LSU或者是针对16S/18Sribosomal RNAs的SSU;然后选择是否去冗余的,我选择去,即Nr99;然后选择是否trunc,即是否对名字缩写;选择是否全长比对结果;

*_tax_silva.fasta.gz

-----------------

Multi FASTA files of the SSU/LSU databases including the SILVAtaxonomy for

Bacteria, Archaea and Eukaryotes in the header.

REMARK: The sequences in the files are NOT truncated to theeffective LSU or

SSU genes. They contain the full entries as they have been depositedin the

public repositories (ENA/GenBank/DDBJ).

 

Fasta header:

>accession_number.start_position.stop_position taxonomic pathorganism name

 

*_tax_silva_full_align_trunc.fasta.gz

-----------------------

Multi FASTA files of the SSU/LSU databases including the SILVAtaxonomy for

Bacteria, Archaea and Eukaryotes in the header (including the FULLalignment).

REMARK: Sequences in these files haven been truncated. This meansthat all

nucleotides that have not been aligned were removed from thesequence.

 

*_tax_silva_trunc.fasta.gz

-----------------------

Multi FASTA files of the SSU/LSU database including the SILVAtaxonomy for

Bacteria, Archaea and Eukaryotes in the header.

REMARK: Sequences in these files haven been truncated. This meansthat all

nucleotides that have not been aligned were removed from thesequence.

 

生成使用与Mothur的silva数据库:http://blog.mothur.org/2014/08/08/SILVA-v119-reference-files/

 

文献:

Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, PepliesJ, Glöckner FO (2013) The SILVA ribosomal RNA gene database project: improveddata processing and web-based tools. Opens external link in new windowNucl.Acids Res. 41 (D1): D590-D596

转载于:https://www.cnblogs.com/nkwy2012/p/6396427.html

更多相关:

  • 上篇笔记中梳理了一把 resolver 和 balancer,这里顺着前面的流程走一遍入口的 ClientConn 对象。ClientConn// ClientConn represents a virtual connection to a conceptual endpoint, to // perform RPCs. // //...

  • 我的实验是基于PSPNet模型实现二维图像的语义分割,下面的代码直接从得到的h5文件开始往下做。。。 也不知道是自己的检索能力出现了问题还是咋回事,搜遍全网都没有可以直接拿来用的语义分割代码,东拼西凑,算是搞成功了。 实验平台:Windows、VS2015、Tensorflow1.8 api、Python3.6 具体的流程为:...

  • Path Tracing 懒得翻译了,相信搞图形学的人都能看得懂,2333 Path Tracing is a rendering algorithm similar to ray tracing in which rays are cast from a virtual camera and traced through a s...

  • configure_file( [COPYONLY] [ESCAPE_QUOTES] [@ONLY][NEWLINE_STYLE [UNIX|DOS|WIN32|LF|CRLF] ]) 我遇到的是 configure_file(config/config.in ${CMAKE_SOURCE_DIR}/...

  •     直接复制以下代码创建一个名为settings.xml的文件,放到C:UsersAdministrator.m2下即可  http...

  • 首先运行easy_install pymongo命令安装pymongo驱动。然后执行操作: 创建连接 1 In [1]: import pymongo 2 3 In [2]: connection = pymongo.Connection('localhost', 27017) 切换到数据库malware In [3]: db...

  • 代码: public class Person{public int ID { get; set; }public string Name { get; set; }public int Age { get; set; } }public class Dog{public int ID { get; set; }...