NIF | Searching in Community Resources

NEXTDB

RRID:SCR_004480

http://nematode.lab.nig.ac.jp/

Expression pattern map of the 100Mb genome of the nematode Caenorhabditis elegans through EST analysis and systematic whole mount in situ hybridization. NEXTDB is the database to integrate all information from their expression pattern project and to make the data available to the scientific community. Information available in the current version is as follows: * Map: Visual expression of the relationships among the cosmids, predicted genes and the cDNA clones. * Image: In situ hybridization images that are arranged by their developmental stages. * Sequence: Tag sequences of the cDNA clones are available. * Homology: Results of BLASTX search are available. Users of the data presented on our web pages should not publish the information without our permission and appropriate acknowledgment. Methods are available for: * In situ hybridization on whole mount embryos of C.elegans * Protocols for large scale in situ hybridization on C.elegans larvae

Proper citation: NEXTDB (RRID:SCR_004480) Copy

Source: SciCrunch Registry

SOrt-ITEMS

RRID:SCR_004716

http://metagenomics.atc.tcs.com/binning/SOrt-ITEMS/

Sequence orthology based software for improved taxonomic estimation of metagenomic sequences., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.

Proper citation: SOrt-ITEMS (RRID:SCR_004716) Copy

Source: SciCrunch Registry

AbundanceBin

RRID:SCR_004648

http://omics.informatics.indiana.edu/AbundanceBin/

An abundance-based software tool for binning metagenomic sequences, such that the reads classified in a bin belong to species of identical or very similar abundances. AbundanceBin also gives estimations of species abundances and their genome sizes -two important characteristic parameters for a microbial community.

Proper citation: AbundanceBin (RRID:SCR_004648) Copy

Source: SciCrunch Registry

Phymm and PhymmBL

RRID:SCR_004751

http://www.cbcb.umd.edu/software/phymm/

Software for Phylogenetic Classification of Metagenomic Data with Interpolated Markov Models to taxonomically classify DNA sequences and accurately classify reads as short as 100 bp. PhymmBL, the hybrid classifier included in this distribution which combines analysis from both Phymm and BLAST, produces even higher accuracy.

Proper citation: Phymm and PhymmBL (RRID:SCR_004751) Copy

Source: SciCrunch Registry

GeneDB Tbrucei

RRID:SCR_004786

http://www.genedb.org/Homepage/Tbruceibrucei927

Database of the most recent sequence updates and annotations for the T. brucei genome. New annotations are constantly being added to keep up with published manuscripts and feedback from the Trypanosomatid research community. You may search by Protein Length, Molecular Mass, Gene Type, Date, Location, Protein Targeting, Transmembrane Helices, Product, GO, EC, Pfam ID, Curation and Comments, and Dbxrefs. BLAST and other tools are available. T. brucei possesses a two-unit genome, a nuclear genome and a mitochondrial (kinetoplast) genome with a total estimated size of 35Mb/haploid genome. The nuclear genome is split into three classes of chromosomes according to their size on pulsed-field gel electrophoresis, 11 pairs of megabase chromosomes (0.9-5.7 Mb), intermediate (300-900 kb) and minichromosomes (50-100 kb). The T. brucei genome contains a ~0.5Mb segmental duplication affecting chromosomes 4 and 8, which is responsible for some 75 gene duplicates unique to this species. A comparative chromosome map of the duplicons can be accessed here (PubmedID 18036214). Protozoan parasites within the species Trypanosoma brucei are the etiological agent of human sleeping sickness and Nagana in animals. Infections are limited to patches of sub-Saharan Africa where insects vectors of the Glossina genus are endemic. The most recent estimates indicate between 50,000 - 70,000 human cases currently exist, with 17 000 new cases each year (WHO Factsheet, 2006). In collaboration with GeneDB, the EuPathDB genomic sequence data and annotations are regularly deposited on TriTrypDB where they can be integrated with other datasets and queried using customized queries.

Proper citation: GeneDB Tbrucei (RRID:SCR_004786) Copy

Source: SciCrunch Registry

NCBI Sequence Read Archive (SRA)

RRID:SCR_004891

http://www.ncbi.nlm.nih.gov/sra

Repository of raw sequencing data from next generation of sequencing platforms including including Roche 454 GS System, Illumina Genome Analyzer, Applied Biosystems SOLiD System, Helicos Heliscope, Complete Genomics, and Pacific Biosciences SMRT. In addition to raw sequence data, SRA now stores alignment information in form of read placements on reference sequence. Data submissions are welcome. Archive of high throughput sequencing data,part of international partnership of archives (INSDC) at NCBI, European Bioinformatics Institute and DNA Database of Japan. Data submitted to any of this three organizations are shared among them.

Proper citation: NCBI Sequence Read Archive (SRA) (RRID:SCR_004891) Copy

Source: SciCrunch Registry

cortex var

RRID:SCR_005081

http://cortexassembler.sourceforge.net/index_cortex_var.html

A tool for genome assembly and variation analysis from sequence data. You can use it to discover and genotype variants on single or multiple haploid or diploid samples. If you have multiple samples, you can use Cortex to look specifically for variants that distinguish one set of samples (eg phenotype=X, cases, parents, tumour) from another set of samples (eg phenotype=Y, controls, child, normal). cortex_var features * Variant discovery by de novo assembly - no reference genome required * Supports multicoloured de Bruijn graphs - have multiple samples loaded into the same graph in different colours, and find variants that distinguish them. * Capable of calling SNPs, indels, inversions, complex variants, small haplotypes * Extremely accurate variant calling - see our paper for base-pair-resolution validation of entire alleles (rather than just breakpoints) of SNPs, indels and complex variants by comparison with fully sequenced (and finished) fosmids - a level of validation beyond that demanded of any other variant caller we are aware of - currently cortex_var is the most accurate variant caller for indels and complex variants. * Capable of aligning a reference genome to a graph and using that to call variants * Support for comparing cases/controls or phenotyped strains * Typical memory use: 1 high coverage human in under 80Gb of RAM, 1000 yeasts in under 64Gb RAM, 10 humans in under 256 Gb RAM

Proper citation: cortex var (RRID:SCR_005081) Copy

Source: SciCrunch Registry

European Genome phenome Archive

RRID:SCR_004944

http://www.ebi.ac.uk/ega/

Web service for permanent archiving and sharing of all types of personally identifiable genetic and phenotypic data resulting from biomedical research projects. The repository allows you to explore datasets from numerous genotype experiments, supplied by a range of data providers. The EGA''s role is to provide secure access to the data that otherwise could not be distributed to the research community. The EGA contains exclusive data collected from individuals whose consent agreements authorize data release only for specific research use or to bona fide researchers. Strict protocols govern how information is managed, stored and distributed by the EGA project. As an example, only members of the EGA team are allowed to process data in a secure computing facility. Once processed, all data are encrypted for dissemination and the encryption keys are delivered offline. The EGA also supports data access only for the consortium members prior to publication.

Proper citation: European Genome phenome Archive (RRID:SCR_004944) Copy

Source: SciCrunch Registry

NCBI Popset

RRID:SCR_005049

http://www.ncbi.nlm.nih.gov/popset

Database containing a set of DNA sequences that have been collected to analyse the evolutionary relatedness of a population. The population could originate from different members of the same species, or from organisms from different species. Users may submit a Popset using Sequin.

Proper citation: NCBI Popset (RRID:SCR_005049) Copy

Source: SciCrunch Registry

M-pick

RRID:SCR_004995

http://plaza.ufl.edu/xywang/Mpick.htm

A modularity-based clustering software for Operational Taxonomic Unit (OTU) picking of 16S rRNA sequences. The algorithm does not require a predetermined cut-off level, and our simulation studies suggest that it is superior to existing methods that require specified distance or variance levels to define OTUs.

Proper citation: M-pick (RRID:SCR_004995) Copy

Source: SciCrunch Registry

Classifier for Metagenomic Sequences

RRID:SCR_004929

http://clams.jgi-psf.org/

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on May 2nd, 2023. Sequence composition based classifier for metagenomic sequences. It works by capturing signatures of each sequence based on the sequence composition. Each sequence is modeled as a walk in a de Bruijn graph with underlying Markov chain properties. ClaMS captures stationary parameters of the underlying Markov chain as well as structural parameters of the underlying de Bruijn graph to form this signature. In practice, for each sequence to binned, such a signature is computed and matched to similar signatures computed for the training sets. The best match that also qualifies the normalized distance cut-off wins. In the case that the best match does not qualify this cut-off, the sequence remains un-binned.

Proper citation: Classifier for Metagenomic Sequences (RRID:SCR_004929) Copy

Source: SciCrunch Registry

RetroSeq

RRID:SCR_005133

https://github.com/tk2/RetroSeq

A tool for discovery and genotyping of transposable element variants (TEVs) (also known as mobile element insertions) from next-gen sequencing reads aligned to a reference genome in BAM format. The goal is to call TEVs that are not present in the reference genome but present in the sample that has been sequenced. It should be noted that RetroSeq can be used to locate any class of viral insertion in any species where whole-genome sequencing data with a suitable reference genome is available. RetroSeq is a two phase process, the first being the read pair discovery phase where discorandant mate pairs are detected and assigned to a TE class (Alu, SINE, LINE, etc.) by using either the annotated TE elements in the reference and/or aligned with Exonerate to the supplied library of viral sequences.

Proper citation: RetroSeq (RRID:SCR_005133) Copy

Source: SciCrunch Registry

SeqAnt

RRID:SCR_005186

http://seqant.genetics.emory.edu/

A free web service and open source software package that performs rapid, automated annotation of DNA sequence variants (single base mutations, insertions, deletions) discovered with any sequencing platform. Variant sites are characterized with respect to their functional type (Silent, Replacement, 5' UTR, 3' UTR, Intronic, Intergenic), whether they have been previously submitted to dbSNP, and their evolutionary conservation. Annotated variants can be viewed directly on the web browser, downloaded in a tab delimited text file, or directly uploaded in a Browser Extended Data (BED) format to the UCSC genome browser. SeqAnt further identifies all loci harboring two or more coding sequence variants that help investigators identify potential compound heterozygous loci within exome sequencing experiments. In total, SeqAnt resolves a significant bottleneck by allowing an investigator to rapidly prioritize the functional analysis of those variants of interest.

Proper citation: SeqAnt (RRID:SCR_005186) Copy

Source: SciCrunch Registry

BWA

RRID:SCR_010910

http://bio-bwa.sourceforge.net/

Software for aligning sequencing reads against large reference genome. Consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. First for sequence reads up to 100bp, and other two for longer sequences ranged from 70bp to 1Mbp.

Proper citation: BWA (RRID:SCR_010910) Copy

Source: SciCrunch Registry

GeneTack

RRID:SCR_011953

http://topaz.gatech.edu/GeneTack/cgi/print_page.cgi?fn=db_home.html&title=Frameshift%20Database

Tools for frameshift prediction and a frameshift database.

Proper citation: GeneTack (RRID:SCR_011953) Copy

Source: SciCrunch Registry

JiffyNet

RRID:SCR_011954

http://www.jiffynet.org/

Web based instant protein network modeler for newly sequenced species. Web server designed to instantly construct genome scale protein networks using protein sequence data. Provides network visualization, analysis pages and solution for instant network modeling of newly sequenced species.

Proper citation: JiffyNet (RRID:SCR_011954) Copy

Source: SciCrunch Registry

MODELLER

RRID:SCR_008395

http://salilab.org/modeller/modeller.html

Software tool as Program for Comparative Protein Structure Modelling by Satisfaction of Spatial Restraints. Used for homology or comparative modeling of protein three dimensional structures. User provides alignment of sequence to be modeled with known related structures and MODELLER automatically calculates model containing all non hydrogen atoms.

Proper citation: MODELLER (RRID:SCR_008395) Copy

Source: SciCrunch Registry

G protein receptor interaction feature finding instrument

RRID:SCR_008343

http://griffin.cbrc.jp/

Griffin (G-protein-receptor interacting feature finding instrument) is a high-throughput system to predict GPCR - G-protein coupling selectively with the input of GPCR sequence and ligand molecular weight. This system consists of two parts: 1) HMM section using family specific multiple alignment of GPCRs, 2) SVM section using physico-chemical feature vectors in GPCR sequence. G-protein coupled receptors (GPCR), which is composed of seven transmembrane helices, play a role as interface of signal transduction. The external stimulation for GPCR, induce the coupling with G-protein (Gi/o, Gq/11, Gs, G12/13) followed by different kinds of signal transduction to inner cell. About half of distributed drugs are intending to control this GPCR - G-protein binding system, and therefore this system is important research target for the development of effective drug. For this purpose, it is necessary to monitor, effectively and comprehensively, of the activation of G-protein by identifying ligand combined with GPCR. Since, at present, it is difficult to construct such biochemical experiment system, if the answers for experimental results can be prepared beforehand by using bioinformatics techniques, large progress is brought to G-protein related drug design. Previous works for predicting GPCR-G protein coupling selectivity are using sequence pattern search, statistical models, and HMM representations showed high sensitivity of predictions. However, there are still no works that can predict with both high sensitivity and specificity. In this work we extracted comprehensively the physico-chemical parameters of each part of ligand, GPCR and G-protein, and choose the parameters which have strong correlation with the coupling selectivity of G-protein. These parameters were put as a feature vector, used for GPCR classification based on SVM.

Proper citation: G protein receptor interaction feature finding instrument (RRID:SCR_008343) Copy

Source: SciCrunch Registry

Distributed Annotation System

RRID:SCR_008427

http://www.biodas.org

The Distributed Annotation System (DAS) defines a communication protocol used to exchange annotations on genomic or protein sequences. It is motivated by the idea that such annotations should not be provided by single centralized databases, but should instead be spread over multiple sites. Data distribution, performed by DAS servers, is separated from visualization, which is done by DAS clients. The advantages of this system are that control over the data is retained by data providers, data is freed from the constraints of specific organisations and the normal issues of release cycles, API updates and data duplication are avoided. DAS is a client-server system in which a single client integrates information from multiple servers. It allows a single machine to gather up sequence annotation information from multiple distant web sites, collate the information, and display it to the user in a single view. Little coordination is needed among the various information providers. DAS is heavily used in the genome bioinformatics community. Over the last years we have also seen growing acceptance in the protein sequence and structure communities. A DAS-enabled website or application can aggregate complex and high-volume data from external providers in an efficient manner. For the biologist, this means the ability to plug in the latest data, possibly including a user''s own data. For the application developer, this means protection from data format changes and the ability to add new data with minimal development cost. Here are some examples of DAS-enabled applications or websites for end users: :- Dalliance Experimental Web/Javascript based Genome Viewer :- IGV Integrative Genome Viewer java based browser for many genomes :- Ensembl uses DAS to pull in genomic, gene and protein annotations. It also provides data via DAS. :- Gbrowse is a generic genome browser, and is both a consumer and provider of DAS. :- IGB is a desktop application for viewing genomic data. :- SPICE is an application for projecting protein annotations onto 3D structures. :- Dasty2 is a web-based viewer for protein annotations :- Jalview is a multiple alignment editor. :- PeppeR is a graphical viewer for 3D electron microscopy data. :- DASMI is an integration portal for protein interaction data. :- DASher is a Java-based viewer for protein annotations. :- EpiC presents structure-function summaries for antibody design. :- STRAP is a STRucture-based sequence Alignment Program. Hundreds of DAS servers are currently running worldwide, including those provided by the European Bioinformatics Institute, Ensembl, the Sanger Institute, UCSC, WormBase, FlyBase, TIGR, and UniProt. For a listing of all available DAS sources please visit the DasRegistry. Sponsors: The initial ideas for DAS were developed in conversations with LaDeana Hillier of the Washington University Genome Sequencing Center.

Proper citation: Distributed Annotation System (RRID:SCR_008427) Copy

Source: SciCrunch Registry

Augustus

RRID:SCR_008417

http://bioinf.uni-greifswald.de/augustus/

Software for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.

Proper citation: Augustus (RRID:SCR_008417) Copy

Source: SciCrunch Registry

Searching the RRID Resource Information Network

Our searching services are busy right now. Please try again later

Log in

Leaving Community

About

Community Resources

More Resources

Literature

Log in

Tools Select Another Resource Report Type

Options

Current Facets and Filters

Facets

Recent searches

RRID:SCR_004480

RRID:SCR_004716

RRID:SCR_004648

RRID:SCR_004751

RRID:SCR_004786

RRID:SCR_004891

RRID:SCR_005081

RRID:SCR_004944

RRID:SCR_005049

RRID:SCR_004995

RRID:SCR_004929

RRID:SCR_005133

RRID:SCR_005186

RRID:SCR_010910

RRID:SCR_011953

RRID:SCR_011954

RRID:SCR_008395

RRID:SCR_008343

RRID:SCR_008427

RRID:SCR_008417

Neuroscience Information Framework Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Sources

Categories

Subcategories

Further Questions

Category Graph

About

Recent News Entries

Contact Us

SciCrunch