Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
The E. coli Genome Project has the goal of completely sequencing the E. coli and human genomes. They began isolation of an overlapping lambda clonebank of E. coli K-12 strain MG1655. Those clones served as the starting material in our initial efforts to sequence the whole genome. Improvements in sequencing technology have since reached the point where whole-genome sequencing of microbial genomes is routine, and the human genome has in fact been completed. They initiated additional sequencing efforts, concentrating on pathogenic members of the family Enterobacteriaceae -- to which E. coli belongs. They also began a systematic functional characterization of E. coli K-12 genes and their regulation, using the whole genome sequence to address how the over 4000 genes of this organism act together to enable its survival in a wide range of environments.
Proper citation: E. coli Genome project (RRID:SCR_008139) Copy
MitoRes, is a comprehensive and reliable resource for massive extraction of sequences and sub-sequences of nuclear genes and encoded products targeting mitochondria in metazoa. It has been developed for supporting high-throughput in-silico analyses aimed to studies of functional genomics related to mitochondrial biogenesis, metabolism and to their pathological dysfunctions. It integrates information from the most accredited world-wide databases to bring together gene, transcript and encoded protein sequences associated to annotations on species name and taxonomic classification, gene name, functional product, organelle localization, protein tissue specificity, Enzyme Classification (EC), Gene Ontology (GO) classification and links to other related public databases. The section Cluster, has been dedicated to the collection of data on protein clustering of the entire catalogue of MitoRes protein sequences based on all versus all global pair-wise alignments for assessing putative intra- and inter-species functional relationships. The current version of MitoRes is based on the UniProt release 4 and contains 64 different metazoan species. The incredible explosion of knowledge production in Biology in the past two decades has created a critical need for bioinformatic instruments able to manage data and facilitate their retrieval and analysis. Hundreds of biological databases have been produced and the integration of biological data from these different resources is very important when we want to focus our efforts towards the study of a particular layer of biological knowledge. MitoRes is a completely rebuilt edition of MitoNuc database, which has been extensively modified to deal successfully with the challenges of the post genomic era. Its goal is to represent a comprehensive and reliable resource supporting high-quality in-silico analyses aimed to the functional characterization of gene, transcript and amino acid sequences, encoded by the nuclear genome and involved in mitochondrial biogenesis, metabolism and pathological dysfunctions in metazoa. The central features of MitoRes are: # an integrated catalogue of protein, transcript and gene sequences and sub-sequences # a Web-based application composed of a wide spectrum of search/retrieval facilities # a sequence export manager allowing massive extraction of bio-sequences (genes, introns, exons, gene flanking regions, transcripts, UTRs, CDS, proteins and signal peptides) in FASTA, EMBL and GenBank formats. It is an interconnected knowledge management system based on a MySQL relational database, which ensures data consistency and integrity, and on a Web Graphical User Interface (GUI), built in Seagull PHP Framework, offering a wide range of search and sequence extraction facilities. The database is compiled extracting and integrating information from public resources and data generated by the MitoRes team. The MitoRes database consists of comprehensive sequence entries whose core data are protein, transcript and gene sequences and taxonomic information describing the biological source of the protein. Additional information include: bio-sequences structure and location, biological function of protein product and dynamic links to both, external public databases used as data resources and public databases reporting complementary information. The core entity of the MitoRes database is represented by the protein so that each MitoRes entry is generated for each protein reported in the UniProt database as a nuclear encoded protein involved in mitochondrial biogenesis and function. Sponsors: MitoRes has been supported by Ministero Universit e Ricerca Scientifica, Italy (PRIN, Programma Biotecnologie legge 95/95-MURST 5, Proiect MURST Cluster C03/2000, CEGBA). Currently it is supported by operating grants from the Ministero dellIstruzione, dellUniversit e della Ricerca (MIUR), Italy (PNR 2001-2003 (FIRB art.8) D.M. 199, Strategic Program: Post-genome, grant 31-063933 and Project n.2, Cluster C03 L. 488/929).
Proper citation: MitoRes (RRID:SCR_008208) Copy
http://www.ebi.ac.uk/parasites/parasite-genome.html
This website contains information about the genomic sequence of parasites. It also contains multiple search engines to search six frame translations of parasite nucleotide databases for motifs, parasite protein databases for motifs, and parasite protein databases for keywords and text terms. * Guide to Internet Access to Parasite Genome Information * Guide to web-based analysis tools * Parasite Genome BLAST Server: Search a range of parasite specific nucleotide sequence databases with your own sequence. * Parasite Proteome Keyword Search Facility: Search parasite protein databases for keywords and text terms * Parasite Proteome Motif Search Facility: Search parasite protein databases for motifs * Parasite Six Frame Translation Motif Search Facility: Search six frame translations of parasite nucleotide databases for motifs * Genome computing resources: A list of ftp and gopher sites where genome computing applications and other resources can be found.
Proper citation: Parasite genome databases and genome research resources (RRID:SCR_008150) Copy
Database of information about restriction enzymes and related proteins containing published and unpublished references, recognition and cleavage sites, isoschizomers, commercial availability, methylation sensitivity, crystal, genome, and sequence data. DNA methyltransferases, homing endonucleases, nicking enzymes, specificity subunits and control proteins are also included. Several tools are available including REBsites, BLAST against REBASE, NEBcutter and REBpredictor. Putative DNA methyltransferases and restriction enzymes, as predicted from analysis of genomic sequences, are also listed. REBASE is updated daily and is constantly expanding. Users may submit new enzyme and/or sequence information, recommend references, or send them corrections to existing data. The contents of REBASE may be browsed from the web and selected compilations can be downloaded by ftp (ftp.neb.com). Additionally, monthly updates can be requested via email.,
Proper citation: REBASE (RRID:SCR_007886) Copy
http://www.broad.mit.edu/mammals/dog
The genome of the domesticated dog, a close evolutionary relation to human, is a powerful new tool for understanding the human genome. Comparison of the dog with human and other mammals reveals key information about the structure and evolution of genes and genomes. The unique breeding history of dogs, with their extraordinary behavioral and physical diversity, offers the opportunity to find important genes underlying diseases shared between dogs and humans, such as cancer, diabetes, and epilepsy. The Canine Genome Sequencing Project produced a high-quality draft sequence of a female boxer named Tasha. By comparing Tasha with many other breeds, the project also compiled a comprehensive set of SNPs (single nucleotide polymorphisms) useful in all dog breeds. These closely spaced genomic landmarks are critical for disease mapping. By comparing the dog, rodent, and human lineages, researchers at the Broad Institute uncovered exciting new information about human genes, their evolution, and the regulatory mechanisms governing their expression. Using SNPs, researchers describe the strikingly different haplotype structure in dog breeds compared with the entire dog population. In addition, they show that by understanding the patterns of variation in dog breeds, scientists can design powerful gene mapping experiments for complex diseases that are difficult to map in human populations. Contribute Although the astounding generosity of Eli and Edythe L. Broad and several other venture philanthropists empowers our scientists to tackle many of the most important problems at the cutting edge of genomic medicine, there are many other critical challenges that they cannot yet pursue because of limited resources. We need additional visionary partners to join the Broads and the Broad Institute in transforming medicine with the power of genomics.
Proper citation: Dog Genome Project (RRID:SCR_008486) Copy
http://www.hgsc.bcm.tmc.edu/content/bovine-genome-project
Downloadable files of the bos taurus genome. Draft assemblies available for download as contigs or linearized scaffolds of the genomic sequence of cow, Bos taurus, including the final draft assembly (7.1 coverage) and the two previous assemblies. The genome is sequenced to 6- to 8-fold sequence depth, with high-quality finished sequence in some areas. Accompanying EST and SNP analyses is also included. The bovine genome assembly and analysis and the study of cattle genetic history were published in April 24, 2009 issue of Science. The Human Genome Sequencing Center provides BLAST searches of the genome assemblies, either as contigs or as linearized chromosome sequences. The WGS sequence enriched BAC assemblies and the unassembled reads (sequencing reads that did not end up in the genome assembly) can also be searched by BLAST. Traces are available from the NCBI Trace Archive by using the link in the sidebar or by using NCBI MegaBLAST with a same species or cross species query.
Proper citation: Bovine Genome Project (RRID:SCR_008370) Copy
A promoter database of Saccharomyces cerevisiae. Users can explore the promoter regions of ~6000 genes and ORFs in yeast genome, annotate putative regulatory sites of all genes and ORFs, locate intergenic regions, and retrieve sequence of the promoter region. In regards to regulatory elements and transcription factors, users can provide information on transcriptionally related genes, browse matrix and consensus sequences, view the correlation between elements, observe binding affinity and expression, and look at genomewise distribution. SCPD also provides some simple but useful tools for promoter sequence analysis. Gene, consensus and matrix records may be submitted.
Proper citation: SCPD - Saccharomyces cerevisiae promoter database (RRID:SCR_004412) Copy
A collaborative ontology for the definition of sequence features used in biological sequence annotation. SO was initially developed by the Gene Ontology Consortium. Contributors to SO include the GMOD community, model organism database groups such as WormBase, FlyBase, Mouse Genome Informatics group, and institutes such as the Sanger Institute and the EBI. Input to SO is welcomed from the sequence annotation community. The OBO revision is available here: http://sourceforge.net/p/song/svn/HEAD/tree/ SO includes different kinds of features which can be located on the sequence. Biological features are those which are defined by their disposition to be involved in a biological process. Biomaterial features are those which are intended for use in an experiment such as aptamer and PCR_product. There are also experimental features which are the result of an experiment. SO also provides a rich set of attributes to describe these features such as polycistronic and maternally imprinted. The Sequence Ontologies use the OBO flat file format specification version 1.2, developed by the Gene Ontology Consortium. The ontology is also available in OWL from Open Biomedical Ontologies. This is updated nightly and may be slightly out of sync with the current obo file. An OWL version of the ontology is also available. The resolvable URI for the current version of SO is http://purl.obolibrary.org/obo/so.owl.
Proper citation: SO (RRID:SCR_004374) Copy
http://bix.ucsd.edu/projects/singlecell/
Software package for short read data from single cells that improves assembly through use of progressively increasing coverage cutoff. Used for single cell Illumina sequences, allows variable coverage datasets to be utilized with assembly of E. coli and S. aureus single cell reads. Assembles single cell genome of uncultivated SAR324 clade of Deltaproteobacteria.
Proper citation: Velvet-SC (RRID:SCR_004377) Copy
http://www.genedb.org/Homepage/Lmajor
Database of the most recent sequence updates and annotations for the L. major genome. New annotations are constantly being added to keep up with published manuscripts and feedback from the Trypanosomatid research community. You may search by Protein Length, Molecular Mass, Gene Type, Date, Location, Protein Targeting, Transmembrane Helices, Product, GO, EC, Pfam ID, Curation and Comments, and Dbxrefs. BLAST and other tools are available. Leishmania species cause a spectrum of human diseases in tropical and subtropical regions of the world. We have sequenced the 36 chromosomes of the 32.8-megabase haploid genome of Leishmania major (Friedlin strain) and predict 911 RNA genes, 39 pseudogenes, and 8272 protein-coding genes, of which 36% can be ascribed a putative function. These include genes involved in host-pathogen interactions, such as proteolytic enzymes, and extensive machinery for synthesis of complex surface glycoconjugates. The Pathogen Genomics group at the Wellcome Trust Sanger Institute played a major role in sequencing the genome of Leishmania major (see Ivens et al.) Details of the centres involved and which chromosomes they sequenced, are given. The sequence data were obtained by adopting several parallel approaches, including complete cosmid sequencing, whole chromosome shotguns and/or BAC sequencing/skimming. The Leishmania parasite is an intracellular pathogen of the immune system targeting macrophages and dendritic cells. The disease Leishmaniasis affects the populations of 88 counties worldwide with symptoms ranging from disfiguring cutaneous and muco-cutaneous lesions that can cause widespread destruction of mucous membranes to visceral disease affecting the haemopoetic organs. In collaboration with GeneDB, the EuPathDB genomic sequence data and annotations are regularly deposited on TriTrypDB where they can be integrated with other datasets and queried using customized queries.
Proper citation: GeneDB Lmajor (RRID:SCR_004613) Copy
http://blast.ncbi.nlm.nih.gov/Blast.cgi
Web search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.
Proper citation: NCBI BLAST (RRID:SCR_004870) Copy
http://www.ncbi.nlm.nih.gov/nucest
Nucleotide database as collection of sequences from several sources, including GenBank, RefSeq, TPA and PDB. Genome, gene and transcript sequence data provide the foundation for biomedical research and discovery.
Proper citation: Nucleotide database (RRID:SCR_004630) Copy
System that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function even in absence of direct experimental evidence. Orthologs view is curated orthology relationships between genes for human, mouse, rat, fish, worm, and fly., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PANTHER (RRID:SCR_004869) Copy
https://datashare.ed.ac.uk/handle/10283/3844
Genome transcriptome atlas by RNA in situ hybridization on sagittal sections of developing mouse at embryonic day 14.5. Consists of searchable database of annotated images that can be interactively viewed. Anatomy based expression profiles for coding genes and microRNAs, tissue specific genes. Expression data generated by using human and murine tissue arrays.
Proper citation: Eurexpress (RRID:SCR_005093) Copy
Software tool for identification and annotation of genetically mobile domains and analysis of domain architectures.
Proper citation: SMART (RRID:SCR_005026) Copy
http://dgv.tcag.ca/dgv/app/home
Public repository that accepts direct submissions and provides archiving, accessioning and distribution of publicly available genomic structural variants, in all species. Variants are accessioned at the study and sample level, granting stable identifiers that can be used in publications. DGVa data is integrated with other EBI resources, including comprehensive EBI search and Ensembl genome browser. Exchanges data with companion database, dbVar, at National Center for Biotechnology Information.NOTE: since 2019 DGVa doesn't accept submissions. Please send the data for submission to European Variation Archive (EVA).
Proper citation: Database of Genomic Variants Archive (DGVa) (RRID:SCR_004896) Copy
A web server designed to rapidly and accurately identify, annotate and graphically display prophage sequences within bacterial genomes or plasmids. It accepts either raw DNA sequence data or partially annotated GenBank formatted data and rapidly performs a number of database comparisons as well as phage cornerstone feature identification steps to locate, annotate and display prophage sequences and prophage features. Relative to other prophage identification tools, PHAST is up to 40 times faster and up to 15% more sensitive. It is also able to process and annotate both raw DNA sequence data and Genbank files, provide richly annotated tables on prophage features and prophage quality and distinguish between intact and incomplete prophage. PHAST also generates downloadable, high quality, interactive graphics that display all identified prophage components in both circular and linear genomic views. Databases available for download include Virus DB, Prophage and virus DB, Bacteria DB, and PHAST result DB. Pre-calculated genomes for viewing are also available.
Proper citation: PHAge Search Tool (RRID:SCR_005184) Copy
Portal supporting the North East Bioinformatics Collaborative''s project to sequence the genome of the Little Skate. Provided is a clearinghouse for Little Skate Genome Project and other publicly available Skate and Ray (Batoidea) genome data, and tools for data visualization and analysis. Little Skate Genome Project The little skate (Leucoraja erinacea) is a chondrichthyan (cartilaginous) fish native to the east coast of North America. Elasmobranchs (Skates, Rays, and Sharks) exhibit many fundamental vertebrate characteristics, including a neural crest, jaws and teeth, an adaptive immune system, and a pressurized circulatory system. These characteristics have been exploited to promote understanding about human physiology, immunology, stem cell biology, toxicology, neurobiology and regeneration. The development of standardized experimental protocols in elasmobranchs such as L. erinacea and the spiny dogfish shark (Squalus acanthias) has further positioned these organisms as important biomedical and developmental models. Despite this distinction, the only reported chondrichthyan genome is the low coverage (1.4x) draft genome of the elephant shark (Callorhinchus milii). To close the evolutionary gaps in available elasmobranch genome sequence data, and generate critical genomic resources for future biomedical study, the genome of L. erinacea is being sequenced by the North East Bioinformatics Collaborative (NEBC). As close evolutionary relatives, the little skate sequence will facilitate studies that employ dogfish shark and other elasmobranchs as model organisms. Skate tools include the SkateBLAST and the Skate Genome Browsers: Little Skate Mitochondrion, Thorny Skate Mitochondrion, and Ocellate Spot Skate Mitochondrion.
Proper citation: SkateBase (RRID:SCR_005302) Copy
http://en.wikipedia.org/wiki/Gene_Wiki
The Gene Wiki is a project that facilitates transferring information on human genes to Wikipedia article stubs with the goal of promoting collaboration and expansion of the articles. Number of gene articles The human genome contains an estimated 20,00025,000 protein-coding genes. The goal of the Gene Wiki project is to create seed articles for every notable human gene, that is, every gene whose function has been assigned in the peer-reviewed scientific literature. Approximately half of human genes have assigned function, therefore the total number of articles seeded by the Gene Wiki project would be expected to be in the range of 10,000 - 15,000. To date, approximately 10,271 articles have been created or augmented to include Gene Wiki project content. Expansion Once seed articles have been established, the hope and expectation is that these will be annotated and expanded by editors ranging in experience from the lay audience to students to professionals and academics. Proteins encoded by genes The majority of genes encode proteins hence understanding the function of a gene generally requires understanding of the function of the corresponding protein. In addition to including basic information about the gene, the project therefore also includes information about the protein encoded by the gene. Stubs for the Gene Wiki project are created by a bot and contain links to the following primary gene/protein databases * HUGO Gene Nomenclature Committee official gene name * Entrez Gene database * OMIM (Mendelian Inheritance in Man) database that catalogues all the known diseases with a genetic component * Amigo Gene Ontology * HomoloGene gene homologs in other species * SymAtlasRNA gene expression pattern in tissues * Protein Data Bank 3D structure of protein encoded by the gene * Uniprot (universal protein resource) a central repository of protein data
Proper citation: Gene Wiki (RRID:SCR_005317) Copy
Portal providing access to all JGI genomic databases and analytical tools, sequencing projects and their status, search for and download assemblies and annotations of sequenced genomes, and interactively explore those genomes and compare them with other sequenced microbes, fungi, plants or metagenomes using specialized systems tailored to each particular class of organisms. The Department of Energy (DOE) Joint Genome Institute (JGI) is a national user facility with massive-scale DNA sequencing and analysis capabilities dedicated to advancing genomics for bioenergy and environmental applications. Beyond generating tens of trillions of DNA bases annually, the Institute develops and maintains data management systems and specialized analytical capabilities to manage and interpret complex genomic data sets, and to enable an expanding community of users around the world to analyze these data in different contexts over the web.
Proper citation: JGI Genome Portal (RRID:SCR_002383) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.