Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Database about gene regulation and gene expression in prokaryotes. It includes a manually curated and unique collection of transcription factor binding sites. A variety of bioinformatics tools for the prediction, analysis and visualization of regulons and gene reglulatory networks is included. The integrated approach provides information about molecular networks in prokaryotes with focus on pathogenic organisms. In detail this concerns: * transcriptional regulation (transcription factors and their DNA binding sites * signal transduction (two-component systems, phosphylation cascades) * protein interactions (complex formation, oligomerization) * biochemical pathways (chemical reactions) * other regulation events (e.g. codon usage, etc. ...) It aims to be a resource to model protein-host interactions and to be a suitable platform to analyze high-throughput data from proteomis and transcriptomics experiments (systems biology). Currently it mainly contains detailed information about operon and promoter structures including huge collections of transcription factor binding sites. If an appropriate number of regulatory binding sites is available, a position weight matrix (PWM) and a sequence logo is provided, which can be used to predict new binding sites. This data is collected manually by screening the original scientific literature. PRODORIC also handles protein-protein interactions and signal-transduction cascades that commonly occur in form of two-component systems in prokaryotes. Furthermore it contains metabolic network data imported from the KEGG database., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PRODORIC (RRID:SCR_007074) Copy
http://arabidopsis.med.ohio-state.edu
An information resource of Arabidopsis promoter sequences, transcription factors and their target genes that contains three databases. *AtcisDB consists of approximately 33,000 upstream regions of annotated Arabidopsis genes (TAIR9 release) with a description of experimentally validated and predicted cis-regulatory elements. *AtTFDB contains information on approximately 1,770 transcription factors (TFs). These TFs are grouped into 50 families, based on the presence of conserved domains. *AtRegNet contains 11,355 direct interactions between TFs and target genes. They provide free download of Arabidopsis thaliana cis-regulatory database (AtcisDB) and transcription factor database (AtTFDB).
Proper citation: Arabidopsis Gene Regulatory Information Server (RRID:SCR_006928) Copy
http://www.ncbi.nlm.nih.gov/COG
A database for phylogenetic classification for proteins encoded in complete genomes. Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of individual proteins or groups of paralogs from at least 3 lineages and thus corresponds to an ancient conserved domain. Please be aware that COGs hasn't been updated in many years and will not be.
Proper citation: COG (RRID:SCR_007139) Copy
Database of collected known and predicted transcription factors (TF) of the black cottonwood tree, Populus trichocarpa. They have made extensive annotations, including similarity searches against major databases (Uniprot, RefSeq, EMBL, TRANSFAC et al) and EST expression information extraction from UniGene clusters and microarray expression, to provide comprehensive information for the putative TFs. In addition, multiple alignment of the DNA-binding domain of each family, Neighbor-Joining phylogenetic tree of each family, the GO annotation, homolog with the Database of Arabidopsis Transcription Factors (DATF), the Database of Rice Transcription Factors (DRTF) are included.
Proper citation: Database of Poplar Transcription Factors (RRID:SCR_007080) Copy
A database and interactive web site for manipulating and displaying annotations on genomes. Features include: detailed views of the genome; use of a variety of premade or personally made glyphs ; customizable order and appearance of tracks by administrators and end-users; search by annotation ID, name, or comment; support of third party annotation using GFF formats; DNA and GFF dumps; connectivity to different databases, including BioSQL and Chado; and a customizable plug-in architecture (e.g. run BLAST, find oligonucleotides, design primers, etc.). GBrowse is distributed as source code for Macintosh OS X, UNIX and Linux platforms, and as pre-packaged binaries for Windows machines. It can be installed using the standard Perl module build procedure, or automated using a network-based install script. In order to use the net installer, you will need to have Perl 5.8.6 or higher and the Apache web server installed. The wiki portion accepts data submissions.
Proper citation: GBrowse (RRID:SCR_006829) Copy
Database of compiled, public, deep sequencing miRNA data and several novel tools to facilitate exploration of massive data. The miR-seq browser supports users to examine short read alignment with the secondary structure and read count information available in concurrent windows. Features such as sequence editing, sorting, ordering, import and export of user data are of great utility for studying iso-miRs, miRNA editing and modifications. miRNA����??target relation is essential for understanding miRNA function. Coexpression analysis of miRNA and target mRNAs, based on miRNA-seq and RNA-seq data from the same sample, is visualized in the heat-map and network views where users can investigate the inverse correlation of gene expression and target relations, compiled from various databases of predicted and validated targets.
Proper citation: miRGator (RRID:SCR_007793) Copy
Database to explore known and predicted interactions of chemicals and proteins. It integrates information about interactions from metabolic pathways, crystal structures, binding experiments and drug-target relationships. Inferred information from phenotypic effects, text mining and chemical structure similarity is used to predict relations between chemicals. STITCH further allows exploring the network of chemical relations, also in the context of associated binding proteins. Each proposed interaction can be traced back to the original data sources. The database contains interaction information for over 68,000 different chemicals, including 2200 drugs, and connects them to 1.5 million genes across 373 genomes and their interactions contained in the STRING database.
Proper citation: Search Tool for Interactions of Chemicals (RRID:SCR_007947) Copy
European website providing information about orphan drugs and rare diseases. It contains content both for physicians and for patients. Reference portal for rare diseases and orphan drugs to help improve diagnosis, care and treatment of patients with rare diseases.
Proper citation: Orphanet (RRID:SCR_006628) Copy
Database containing information on marketed medicines and their recorded adverse drug reactions. The information is extracted from public documents and package inserts. The available information include side effect frequency, drug and side effect classifications as well as links to further information, for example drug-target relations. The SIDER Side Effect Resource represents an effort to aggregate dispersed public information on side effects. To our knowledge, no such resource exist in machine-readable form despite the importance of research on drugs and their effects. The creation of this resource was motivated by the many requests for data that we received related to our paper (Campillos, Kuhn et al., Science, 2008, 321(5886):263-6.) on the utilization of side effects for drug target prediction. Inclusion of side effects as readouts for drug treatment should have many applications and we hope to be able to enhance the respective research with this resource. You may browse the drugs by name, browse the side effects by name, download the current version of SIDER, or use the search interface.
Proper citation: SIDER (RRID:SCR_004321) Copy
A database of genomic and protein data for Drosophila site-specific transcription factors.
Proper citation: FlyTF.org (RRID:SCR_004123) Copy
A curated database that provides comprehensive integrated biological information for Saccharomyces cerevisiae along with search and analysis tools to explore these data. SGD allows researchers to discover functional relationships between sequence and gene products in fungi and higher organisms. The SGD also maintains the S. cerevisiae Gene Name Registry, a complete list of all gene names used in S. cerevisiae which includes a set of general guidelines to gene naming. Protein Page provides basic protein information calculated from the predicted sequence and contains links to a variety of secondary structure and tertiary structure resources. Yeast Biochemical Pathways allows users to view and search for biochemical reactions and pathways that occur in S. cerevisiae as well as map expression data onto the biochemical pathways. Literature citations are provided where available.
Proper citation: SGD (RRID:SCR_004694) Copy
A database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).
Proper citation: Pfam (RRID:SCR_004726) Copy
http://www.ebi.ac.uk/biosamples/
Database that aggregates sample information for reference samples (e.g. Coriell Cell lines) and samples for which data exist in one of the EBI''''s assay databases such as ArrayExpress, the European Nucleotide Archive or PRoteomics Identificates DatabasE. It provides links to assays for specific samples, and accepts direct submissions of sample information. The goals of the BioSample Database include: # recording and linking of sample information consistently within EBI databases such as ENA, ArrayExpress and PRIDE; # minimizing data entry efforts for EBI database submitters by enabling submitting sample descriptions once and referencing them later in data submissions to assay databases and # supporting cross database queries by sample characteristics. The database includes a growing set of reference samples, such as cell lines, which are repeatedly used in experiments and can be easily referenced from any database by their accession numbers. Accession numbers for the reference samples will be exchanged with a similar database at NCBI. The samples in the database can be queried by their attributes, such as sample types, disease names or sample providers. A simple tab-delimited format facilitates submissions of sample information to the database, initially via email to biosamples (at) ebi.ac.uk. Current data sources: * European Nucleotide Archive (424,811 samples) * PRIDE (17,001 samples) * ArrayExpress (1,187,884 samples) * ENCODE cell lines (119 samples) * CORIELL cell lines (27,002 samples) * Thousand Genome (2,628 samples) * HapMap (1,417 samples) * IMSR (248,660 samples)
Proper citation: BioSample Database at EBI (RRID:SCR_004856) Copy
http://www.ncbi.nlm.nih.gov/biosample
Database containing descriptions of biological source materials used in experimental assays. Sources include: GenBank, Sequence Read Archive (SRA), Coriell, ATCC. Submissions are supported by a web-based Submission Portal that guides users through a series of forms for input of rich metadata describing their samples. As the capacity and complexity of biological data sets expands, databases face new challenges in ensuring that the information is adequately organized and described. The NCBI BioSample database is being developed to help address the challenges by providing the means by which data generators can organize and describe a broad range of sample types, and link to corresponding sets of experimental data in archival databases.
Proper citation: NCBI BioSample (RRID:SCR_004854) Copy
http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi
A web server and database that organizes, analyzes and predicts interactions between proteins and other biomolecules. For a given protein sequence or structure query, it reports protein-protein, protein-small molecule, protein nucleic acids and protein-ion interactions observed in experimentally-determined structural biological assemblies. It also infers/predicts interacting partners and binding sites by homology, by inspecting the protein complexes formed by close homologs of a given query. To ensure biological relevance of inferred binding sites, the IBIS algorithm clusters binding sites formed by homologs based on binding site sequence and structure conservation.
Proper citation: IBIS: Inferred Biomolecular Interactions Server (RRID:SCR_004886) Copy
https://sites.google.com/site/jpopgen/dbNSFP
A database for functional prediction and annotation of all potential non-synonymous single-nucleotide variants (nsSNVs) in the human genome. Version 2.0 is based on the Gencode release 9 / Ensembl version 64 and includes a total of 87,347,043 nsSNVs and 2,270,742 essential splice site SNVs. It compiles prediction scores from six prediction algorithms (SIFT, Polyphen2, LRT, MutationTaster, MutationAssessor and FATHMM), three conservation scores (PhyloP, GERP++ and SiPhy) and other related information including allele frequencies observed in the 1000 Genomes Project phase 1 data and the NHLBI Exome Sequencing Project, various gene IDs from different databases, functional descriptions of genes, gene expression and gene interaction information, etc. Some dbNSFP contents (may not be up-to-date though) can also be accessed through variant tools, ANNOVAR, KGGSeq, UCSC Genome Browser''s Variant Annotation Integrator, Ensembl Variant Effect Predictor and HGMD.
Proper citation: dbNSFP (RRID:SCR_005178) Copy
http://exon.cshl.org/cgi-bin/atprobe/atprobe.pl
Arabidopsis thaliana promoter binding element database that focuses on specific binding elements on known genes, found with experimental methods.
Proper citation: AtProbe (RRID:SCR_005412) Copy
http://cgi-www.daimi.au.dk/cgi-chili/datfap/frontdoor.py
A database of transcription factors from 13 plant species, and PCR primers for around 90% of them.
Proper citation: DATFAP (RRID:SCR_005413) Copy
http://bioinfozen.uncc.edu/tfindit/
A database and web service for structural bioinformatics studies of transcription factor (TF)-DNA interactions. Various datasets can be generated based on one or more search options specified by users.
Proper citation: TFinDIT (RRID:SCR_005411) Copy
http://swissregulon.unibas.ch/fcgi/sr/swissregulon
A database of genome-wide annotations of regulatory sites. The predictions are based on Bayesian probabilistic analysis of a combination of input information including: * Experimentally determined binding sites reported in the literature. * Known sequence-specificities of transcription factors. * ChIP-chip and ChIP-seq data. * Alignments of orthologous non-coding regions. Predictions were made using the PhyloGibbs, MotEvo, IRUS and ISMARA algorithms developed in their group, depending on the data available for each organism. Annotations can be viewed in a Gbrowse genome browser and can also be downloaded in flat file format.
Proper citation: SwissRegulon (RRID:SCR_005333) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.