Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.ncbi.nlm.nih.gov/gene
Database for genomes that have been completely sequenced, have active research community to contribute gene-specific information, or that are scheduled for intense sequence analysis. Includes nomenclature, map location, gene products and their attributes, markers, phenotypes, and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases. All entries follow NCBI's format for data collections. Content of Entrez Gene represents result of curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases, and from many other databases available from NCBI. Records are assigned unique, stable and tracked integers as identifiers. Content is updated as new information becomes available.
Proper citation: Entrez Gene (RRID:SCR_002473) Copy
http://www.tigr.org/tdb/fungal/
This website contains a list of five fungal genome databases from The J. Craig Venter Institute. Aspergillus genomes: -Aspergillus fumigatus (strain-Af 293) -Aspergillus clavatus -Neosartorya fischeri Other Fungal Genomes: -Cryptococcus neoformans (strain-JEC21) -Coccidioides posadasii
Proper citation: Fungal Databases (RRID:SCR_002350) Copy
Database of transcriptional start sites (TSSs) representing exact positions in the genome based on a unique experimentally validated TSS sequencing method, TSS Seq. A major part of human adult and embryonic tissues are covered. DBTSS contains 491 million TSS tag sequences collected from a total of 20 tissues and 7 cell cultures. Also integrated is generated RNA-seq data of subcellular- fractionated RNAs and ChIP Seq data of histone modifications, RNA polymerase II and several transcriptional regulatory factors in cultured cell lines. Also included is external epigenomic data, such as chromatin map of the ENCODE project. They associated those TSS information with public and original SNV data, in order to identify single nucleotide variations (SNVs) in the regulatory regions.
Proper citation: DBTSS: Database of Transcriptional Start Sites (RRID:SCR_002354) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. Database to access gene information through common names and allows identification of homologs and paralogs for a given gene. This publicly available tool leverages public sequence data, gene metadata information, and other publicly available data to calculate and display orthologous and paralogous gene relationships for all genes from several species, including yeasts, insects, worms, vertebrates, mammals, and primates such as humans.
Proper citation: GeneSeer (RRID:SCR_002626) Copy
http://machibase.gi.k.u-tokyo.ac.jp/
Database for Drosophila melanogaster transcription profiling that allows users to search the Drosophilia genome, see sequence overviews, and look at various transcripts. The data were generated in conjunction with the recently developed high-throughput genome sequencer Illumina / Solexa using a newly developed 5'-end mRNA collection method. Approximately 25 million 25-27 nucleotide (nt) 5'-end mRNA tags from the embryos, larvae, young males, young females, old males, old females, and S2 (culture cell line) of D. melanogaster were collected. By arranging this vast amount of expression tag with other annotated data, they have built a one-stop service for Drosophila melanogaster transcription profiling.
Proper citation: MachiBase (RRID:SCR_003078) Copy
http://www.ncbi.nlm.nih.gov/proteinclusters
Database of related protein sequences (clusters) consisting of proteins derived from the annotations of whole genomes, organelles and plasmids. It currently limited to Archaea, Bacteria, Plants, Fungi, Protozoans, and Viruses. It contains annotation information, publications, domains, structures, and external links and analysis tools including multiple alignments, phylogenetic trees, and genomic neighborhoods (ProtMap). Data is available for download via Protein Clusters FTP
Proper citation: Protein Clusters (RRID:SCR_003459) Copy
http://www.ncbi.nlm.nih.gov/mapview/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 4, 2023. Database that provides special browsing capabilities for a subset of organisms in Entrez Genomes. Map Viewer allows users to view and search an organism's complete genome, display chromosome maps, and zoom into progressively greater levels of detail, down to the sequence data for a region of interest. If multiple maps are available for a chromosome, it displays them aligned to each other based on shared marker and gene names, and, for the sequence maps, based on a common sequence coordinate system.
Proper citation: MapViewer (RRID:SCR_003092) Copy
http://xavante.fmrp.usp.br/mammibase/
Database developed to assist the phylogeneticist user in retrieving individual gene sequence alignments for genes in complete mammalian mitochondrial genomes. Data retrieval in MamMiBase requires three stages. At the first stage, the user must select the mammalian species or group that (s)he wishes to study. In the second stage, the user will select the outgroup from a list that included all species selected in the first stage plus Xenopus laevis and Gallus gallus. Finally, at the third stage, the user will select individual mitochondrial gene alignments or a phylogenetic tree that (s)he wishes to download.
Proper citation: Mammalian Mitochondrial Genomics Database (RRID:SCR_003084) Copy
http://www.ncbi.nlm.nih.gov/genomes/FLU/
Database of data obtained from the NIAID Influenza Genome Sequencing Project as well as from GenBank, combined with tools for flu sequence analysis and annotation. In addition, it provides links to other resources that contain flu sequences, publications and general information about flu viruses. Users can search the Flu database, build queries, retrieve sequences, and apply analysis tools. This includes selecting influenza sequences by virus, subtype, host, and other criteria, finding complete genome sets, aligning sequence and others in the database (up to 1000 sequences), viewing clustering and phylogenetic trees, BLAST searching a flu sequence against the database, and more.
Proper citation: Influenza Virus Resource (RRID:SCR_002984) Copy
A Wiki-based database for transcription factor-binding data generated by the ENCODE consortium.
Proper citation: Factorbook (RRID:SCR_004086) Copy
The Ensembl Genomes project produces genome databases for important species from across the taxonomic range, using the Ensembl software system. Five sites are now available, one of which is Ensembl Protists, which houses protists species. Sponsors: EnsembProtists is a project run by EMBL - EBI to maintain annotation on selected genomes, based on the software developed in the Ensembl project developed jointly by the EBI and the Wellcome Trust Sanger Institute.
Proper citation: Ensembl Protists (RRID:SCR_013154) Copy
GiardiaDB is a resource for information on Giardia lamblia. It contains gene information, including genomic attributes, protein expression patterns, evolution, and EST sequence information. The website provides tools for BLASTing, sequence retrieval, graphic visualization, and PubMed information.
Proper citation: GiardiaDB (RRID:SCR_013377) Copy
http://epsf.bmad.bii.a-star.edu.sg/cube/db/html/home.html
Cube-DB is a database of pre-evaluated conservation and specialization scores for residues in paralogous proteins belonging to multi-member families of human proteins. Protein family classification follows (largely) the classification suggested by HUGO Gene Nomenclature Committee. Sets of orhtologous protein sequences were generated by mutual-best-hit strategy using full vertebrate genomes available in Ensembl. The scores, described on documentation page, are assigned to each individual residue in a protein, and presented in the form of a table (html or downloadable xls formats) and mapped, when appropriate, onto the related structure (Jmol, Pymol, Chimera).
Proper citation: Cube-DB (RRID:SCR_013233) Copy
An international collaborative effort to develop and enrich new and existing reference ontologies for plants, improve ontology use and cross-references, and to develop data annotation standards. Users can search for ontology terms and bioentities and submit the ontology-related term requests by visiting the following GitHub request trackers.
Proper citation: Planteome (RRID:SCR_014411) Copy
http://signal.salk.edu/cgi-bin/RiceGE
Gene database for Japonica rice. RiceGE is associated with SIGnAL at the Salk Institute.
Proper citation: RiceGE (RRID:SCR_015061) Copy
http://hb.flatironinstitute.org/
Formerly known as GIANT (Genome-scale Integrated Analysis of gene Networks in Tissues), HumanBase applies machine learning algorithms to learn biological associations from massive genomic data collections. These integrative analyses reach beyond existing "biological knowledge" represented in the literature to identify novel, data-driven associations.
Proper citation: HumanBase (RRID:SCR_016145) Copy
http://software.broadinstitute.org/gsea/msigdb/index.jsp
Collection of annotated gene sets for use with Gene Set Enrichment Analysis (GSEA) software.
Proper citation: Molecular Signatures Database (RRID:SCR_016863) Copy
http://www.broad.mit.edu/node/305
The Connectivity Map aims to generate a detailed map that links gene patterns associated with disease to corresponding patterns produced by drug candidates and a variety of genetic manipulations. The Connectivity Map is the most comprehensive effort yet for using genomics in a drug-discovery framework. It allows researchers to screen compounds against genome-wide disease signatures, rather than a pre-selected set of target genes. Drugs are paired with diseases using sophisticated pattern-matching methods with a high level of resolution and specificity. To build a Connectivity Map, the Broad Institute brings together molecular biologists, genomics specialists, computational scientists, pharmacologists, chemists and chemical biologists, as well as expertise from across the breadth and depth of medicine.Connectivity map is a large public database of signatures of drugs and genes, and pattern-matching tools to detect similarities among these signatures.The parent site for the Broad Institute at MIT has a software library of software applications developed for use in genetic analysis.
Proper citation: National Institute of Mental Health (NIMH) Human Genetics Initiative (RRID:SCR_007436) Copy
A web-based software package for comparative genomics.
Proper citation: Sybil (RRID:SCR_005593) Copy
http://www.hgsc.bcm.tmc.edu/content/honey-bee-genome-project
The HGSC has sequenced the honey bee, Apis mellifera. The version 4.0 assembly was released in March 2006 and published in October 2006. The genome sequence is being upgraded with additional sequence coverage. The honey bee is important in the agricultural community as a producer of honey and as a facilitator of pollination. It is a model organism for studying the following human health issues: immunity, allergic reaction, antibiotic resistance, development, mental health, longevity and diseases of the X chromosome. In addition, biologists are interested in the honey bee's social organization and behavioral traits. This project was proposed to the HGSC by a group of dedicated insect biologists, headed by Gene Robinson. Following a workshop at the HGSC and a honey bee white paper, the HGSC began the project in 2002. A 6-fold coverage WGS, BAC sequence from pooled arrays, and an initial genome assembly (Amel_v1.0) were released beginning in 2003. This has been a challenging project with difficulty in recovering AT-rich regions. The WGS data had lower coverage in AT-rich regions and BAC data from clones showed evidence of internal deletions. Additional reads from AT enriched DNA addressed these underrepresented regions. The current assembly Amel_4.0 was produced with Atlas and includes 2.7 million reads (1.8 Gb) or 7.5x coverage of the (clonable) genome. About 97% of STSs, 98% of ESTs, and 96% of cDNAs are represented in the 231 Mb assembly. About 2,500 reads were also produced from a strain of Africanized honey bee and SNPs were extracted. These were released in dbSNP and the NCBI Trace Archive. Analysis of the genome by a consortium of 20 labs has been completed. This produced a gene list derived from five different methods melded through the GLEAN software. Publications include a main paper in Nature and up to forty companion papers in Genome Research and Insect Molecular Biology. Sponsors: Sequencing of the honey bee is jointly funded by National Human Genome Research Institute (NHGRI) and the Department of Agriculture (USDA). Multiple drones from the same queen (strain DH4) were obtained from Danny Weaver of B. Weaver Apiaries. All libraries were made from DNA isolated from these drones. The honey bee BAC library (CHORI-224) was prepared by Pieter de Jong and Katzutoyo Osoegawa at the Children's Hospital Oakland Research Institute.
Proper citation: Honey Bee Genome Project (RRID:SCR_002890) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.