Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Resource for reuse, sharing and meta-analysis of expression profiling data. Database and set of tools for meta analysis, reuse and sharing of genomics data. Targeted at analysis of gene expression profiles. Users can search, access and visualize coexpression and differential expression results.
Proper citation: Gemma (RRID:SCR_008007) Copy
Database that provides access to mRNA sequences and associated regulatory elements that were processed from Genbank. These mRNA sequences include complete genomes, which are divided into 5-prime UTRs, 3-prime UTRs, initiation sequences, termination regions and full CDS sequences. This data can be searched for a range of properties including specific mRNA sequences, mRNA motifs, codon usage, RSCU values, information content, etc.
Proper citation: Transterm (RRID:SCR_008244) Copy
http://www.bioinformatics2.wsu.edu/cgi-bin/Athena/cgi/home.pl
Athena is a web-based application that warehouses disparate datatypes related to the control of gene expression. Athena provides several features to enable exploration of the regulatory mechanisms of Arabidopsis gene control. The first main tool we provide is visualization of promoter domains of selected genes. Database crossreference for these transcription factors is provided as well as a statistical test for enrichment of binding activity within the set of selected promoters. The data mining tools in Athena allow for selection of sets of genes based on two different factors. -Genes can be select by specifying a set of binding factors whose putative sites must be present within all of those genes'' promoter regions. -Alternatively, genes can be selected using Gene Ontology annotations. Both GO (Gene Ontology) Slim terms and Gene Ontology terms are available. One can select a set of genes by either choosing a union of the genes annotated by a selected set of Slim terms or Gene Ontology terms. The selected gene''s putative binding factors are listed, including enrichment data. Furthermore, enriched presence of Gene Ontology terms is given. The analysis suite provides both enhanced data mining tools for selecting genes as well as several data displays., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Athena (RRID:SCR_008110) Copy
http://andromeda.gsf.de/litminer
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The LitMiner software is a literature data-mining tool that facilitates the identification of major gene regulation key players related to a user-defined field of interest in PubMed abstracts. The prediction of gene-regulatory relationships is based on co-occurrence analysis of key terms within the abstracts. LitMiner predicts relationships between key terms from the biomedical domain in four categories (genes, chemical compounds, diseases and tissues). The usefulness of the LitMiner system has been demonstrated recently in a study that reconstructed disease-related regulatory networks by promoter modeling that was initiated by a LitMiner generated primary gene list. To overcome the limitations and to verify and improve the data, we developed WikiGene, a Wiki-based curation tool that allows revision of the data by expert users over the Internet. It is based on the annotation of key terms in article abstracts followed by statistical co-citation analysis of annotated key terms in order to predict relationships. Key terms belonging to four different categories are used for the annotation process: -Genes: Names of genes and gene products. Gene name recognition is based on Ensembl . Synonyms and aliases are resolved. -Chemical Compounds: Names of chemical compounds and their respective aliases. -Diseases and Phenotypes: Names of diseases and phenotypes -Tissues and Organs: Names of tissues and organs LitMiner uses a database of disease and phenotype terms for literature annotation. Currently, there are 2225 diseases or phenotypes, 801 tissues and organs, and 10477 compounds in the database.
Proper citation: LitMiner (RRID:SCR_008200) Copy
http://pbil.univ-lyon1.fr/databases/homolens.php
Database of homologous genes from Ensembl organisms, structured under ACNUC sequence database management system. It allows to select sets of homologous genes among species, and to visualize multiple alignments and phylogenetic trees. It is possible to search for orthologous genes in a wide range of taxons. HOMOLENS is particularly useful for comparative sequence analysis, phylogeny and molecular evolution studies. More generally, HOMOLENS gives an overall view of what is known about a peculiar gene family. Note that HOMOLENS is split into two databases on this server: HOMOLENS contains the protein sequences while HOMOLENSDNA contains the nucleotide sequences. Protein sequences of HOMOLENS have been generated by translating the CDS of HOMOLENSDNA and using associated cross-references to generate the annotations.
Proper citation: Homologous Sequences in Ensembl Animal Genomes (RRID:SCR_008356) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Retrieve-ensembl-seq is included in the software suite regulatory sequence analysis tools (RSAT), allowing instant submission of retrieved sequences to further analysis tools. AVAILABILITY: retrieve-ensembl-seq is integrated in the RSAT suite: http://rsat.ulb.ac.be/rsat. Web site: http://rsat.ulb.ac.be/rsat/retrieve-ensembl-seq_form.cgi. Web services: http://rsat.ulb.ac.be/rsat/web_services/RSATWS.wsdl. Stand-alone distribution: freely available under an academic licence to download from the RSAT web site. The complete manual, a convenient tutorial and demos are available from the RSAT website. Additional help can be found on the RSAT public forum.
Proper citation: Regulatory Sequence Analysis Tools (RRID:SCR_008560) Copy
http://www.cmbi.ru.nl/GeneSeeker/
The GeneSeeker allows you to search across different databases simultaneously, given a known human genetic location and expression/phenotypic pattern. The GeneSeeker returns any found gene names which are located on the specified location and expressed in the specified tissue. To search for more expression location in one search, just enter them in the textbox for the expression location and separate them with logical operators (and, or, not). You can specify as many tissues as you want, the program starts 20 queries simultaneously, and then waits for a query to finish before starting another query, to keep server loads to a minimum. You can also search only for expression, just leave the cytogenetic location fields blank, and do the query. If you only want to look for one cytogenetic location, only fill in the first location field, and the GeneSeeker will search with only this one. Housekeeping genes , found in Swissprot can be excluded, or genes that are to be excluded can be specified. Human chromosome localizations are translated with an oxford-grid to mouse chromosome localizations, and then submitted to the Mgd. Sponsors: GeneSeeker is a service provided by the Centre for Molecular and Biomolecular Informatics (CMBI).
Proper citation: GeneSeeker (RRID:SCR_008347) Copy
DNAtraffic database is dedicated to be an unique comprehensive and richly annotated database of genome dynamics during the cell life. DNAtraffic contains extensive data on the nomenclature, ontology, structure and function of proteins related to control of the DNA integrity mechanisms such as chromatin remodeling, DNA repair and damage response pathways from eight model organisms commonly used in the DNA-related study: Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Escherichia coli and Arabidopsis thaliana. DNAtraffic contains comprehensive information on diseases related to the assembled human proteins. Database is richly annotated in the systemic information on the nomenclature, chemistry and structure of the DNA damage and drugs targeting nucleic acids and/or proteins involved in the maintenance of genome stability. One of the DNAtraffic database aim is to create the first platform of the combinatorial complexity of DNA metabolism pathway analysis. Database includes illustrations of pathway, damage, protein and drug. Since DNAtraffic is designed to cover a broad spectrum of scientific disciplines it has to be extensively linked to numerous external data sources. Database represents the result of the manual annotation work aimed at making the DNAtraffic database much more useful for a wide range of systems biology applications. DNAtraffic database is freely available and can be queried by the name of DNA network process, DNA damage, protein, disease, and drug.
Proper citation: DNAtraffic (RRID:SCR_008886) Copy
FINDbase Worldwide is an online repository of information about the frequency of different mutations leading to inherited disorders in various populations around the globe. Frequency data about 32 disorders, 25 genes within 98 populations covering 1226 mutations is now available. 28 curators worldwide contributed to this database containing data from 37 submissions.
Proper citation: FINDbase Worldwide (RRID:SCR_012744) Copy
http://data-analysis.charite.de/care/
Comprehensive database of cancer relevant proteins and compound interactions supported by experimental knowledge.Knowledgebase for drug-target relationships related to cancer as well as for supporting information or experimental data.
Proper citation: CancerResource (RRID:SCR_011945) Copy
http://evs.gs.washington.edu/EVS/
The goal of the project is to discover novel genes and mechanisms contributing to heart, lung and blood disorders by pioneering the application of next-generation sequencing of the protein coding regions of the human genome across diverse, richly-phenotyped populations and to share these datasets and findings with the scientific community to extend and enrich the diagnosis, management and treatment of heart, lung and blood disorders. The groups participating and collaborating in the NHLBI GO ESP include: Seattle GO - University of Washington, Seattle, WA Broad GO - Broad Institute of MIT and Harvard, Cambridge, MA WHISP GO - Ohio State University Medical Center, Columbus, OH Lung GO - University of Washington, Seattle, WA WashU GO - Washington University, St. Louis, MO Heart GO - University of Virginia Health System, Charlottesville, VA ChargeS GO - University of Texas Health Sciences Center at Houston
Proper citation: NHLBI Exome Sequencing Project (ESP) (RRID:SCR_012761) Copy
THIS RESOURCE IS NO LONGER IN SERVICE,documented on August 16, 2019. Fugu genome is among the smallest vertebrate genomes and has proved to be a valuable reference genome for identifying genes and other functional elements such as regulatory elements in the human and other vertebrate genomes, and for understanding the structure and evolution of vertebrate genomes. This site presents version 4 of the Fugu genome, released in October 2004 by the International Fugu Genome Consortium. Fugu rubripes has a very compact genome, with less than 15 consisting of dispersed repetitive sequence, which makes it ideal for gene discovery. A draft sequence of the fugu genome was determined by the International Fugu Genome Consortium in 2002 using the ''whole-genome shotgun'' sequencing strategy. Fugu is the second vertebrate genome to be sequenced, the first being the human genome. This webpage presents the annotation made on the fourth assembly by the IMCB team using the Ensembl annotation pipeline. We are continuing with the gap filling work and linking of the scaffolds to obtain super-contigs.
Proper citation: Fugu Genome Project (RRID:SCR_013014) Copy
http://www.ebi.ac.uk/thornton-srv/databases/MACiE/
MACiE, which stands for Mechanism, Annotation and Classification in Enzymes, is a collaborative project on enzyme reaction mechanisms. MACiE currently contains 223 fully annotated enzyme reaction mechanisms, which comprise 218 EC numbers (161 EC sub-subclasses) and 310 distinct CATH codes. It is a joint effortbetween the Mitchell Group at the Unilever Centre for Molecular Informatics part of the University of Cambridge and the Thornton Group at the European Bioinformatics Institute.
Proper citation: MACiE (RRID:SCR_013296) Copy
H-Invitational Database (H-InvDB) is an integrated database of human genes and transcripts. By extensive analyses of all human transcripts, we provide curated annotations of human genes and transcripts that include gene structures, alternative splicing isoforms, non-coding functional RNAs, protein functions, functional domains, sub-cellular localizations, metabolic pathways, protein 3D structure, genetic polymorphisms (SNPs, indels and microsatellite repeats) , relation with diseases, gene expression profiling, and molecular evolutionary features , protein-protein interactions (PPIs) and gene families/groups. This database is produced by the Genome Information Integration Project (2005-) based upon the annotation technology established in the H-Invitational Project for annotation of human full-length cDNAs.
Proper citation: H-InvDB (RRID:SCR_013265) Copy
A manually curated database of protein-protein interactions for Death Domain Superfamily. The Death Domain Database provides a detailed summary of PPI data, which fits into 3 categories: interaction, characterization, and functional role. Users can find in-depth information specified in the literature on relevant analytical methods, structural information. The DD superfamily currently comprises four subfamilies: * Death domain (DD) subfamily * Death effector domain (DED) subfamily * Caspase recruitment domain (CARD) subfamily * Pyrin domain (PYD) subfamily
Proper citation: Death Domain database (RRID:SCR_013231) Copy
http://unicarb-db.biomedicine.gu.se
An experimental glycomic MS database initially created to meet the in-house need to store structural and MS-glycomic data. Users can search by taxonomy and tissue, mass and composition, and MS/MS.
Proper citation: UniCarb-DB (RRID:SCR_014407) Copy
A genome and functional genomic database for the protozoan parasite Toxoplasma gondii. It incorporates the sequence and annotation of the T. gondii ME49 strain, as well as genome sequences for the GT1, VEG and RH (Chr Ia, Chr Ib) strains. Sequence information is integrated with various other genomic-scale data, including community annotation, ESTs, gene expression and proteomics data. Organisms * Toxoplasma gondii (ME49, RH, GT1, Veg strains) * Neospora caninum * environmental isolate sequences from numerous species Tools * BLAST: Identify Sequence Similarities * Sequence Retrieval: Retrieve Specific Sequences using IDs and coordinates * PubMed and Entrez: View the Latest Toxoplasma, Neospora Pubmed and Entrez Results * Genome Browser: View Sequences and Features in the genome browser * Ancillary Genome Browse: Access Additional info like Probeset data and Toxoplasma Array info
Proper citation: ApiDB ToxoDB (RRID:SCR_013453) Copy
The HUGE protein database has been created to publicize the Human cDNA project at the Kazusa DNA Research Institute. This project will sequence and analyze long (>4 kb) human cDNAs and establish methods by using the sequence data how to predict the primary structure of proteins of various biological activities. Currently, it focuses on the analysis of cDNA clones encoding particularly large proteins (>50 kDa). The HUGE protein database contains various types of information derived from the predicted primary structure data of newly identified human proteins. The HUGE protein database are expected to cover various sets of large human proteins of hitherto unidentified functions. They are likely to be involved in cellular structure/motility (such as cytoskeleton, membrane skeleton, and motor proteins), gene expression and nucleic acid metabolism, cell signaling/communication (such as cellular adhesion, signal transduction, channels, and receptors), and so on.
Proper citation: HUGE - Human Unidentified Gene-Encoded large proteins (RRID:SCR_013482) Copy
http://db.systemsbiology.net/kaviar/
A database containing a compilation of SNVs, indels, and complex variants observed in humans, designed to facilitate testing for the novelty and frequency of observed variants.
Proper citation: KAVIAR (RRID:SCR_013737) Copy
A database of protein disorder and mobility annotations. The database features three levels of annotation: manually curated data (which are extracted from the DisProt database), indirect data, and predicted data. Additional annotations are included from external sources, including UniProt, Pfam, PDB, and STRING.
Proper citation: MobiDB (RRID:SCR_014542) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.