Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://core.biotech.hawaii.edu/Bioinformatics.htm
THIS RESOURCE IS NO LONGER IN SERVCE, documented January 28, 2019. Core Facility provides the software and support for computer assisted protein and DNA sequence analysis and database access. The Genetics Computer Group GCG-Wisconsin package is currently available on PBRC's UNIX platform that is accessible via modem or direct connection. The package can be accessed via three interfaces: the command-line interface (UNIX C-shell), the web-based interface (SeqWeb) and the X-Windows based graphics interface (SeqLab). Applications in the package include sequence editing, alignment, comparison, primer design, restriction analysis, mapping, data presentation, database browsing, etc. In addition to local databases, access to remote databases (BLAST) is integrated into the package. The local databases are updated quarterly. Databases available include GenBank, EMBL, PIR-Protein, SWISS-PROT and Restriction Enzymes (REBASE).
Proper citation: GCG/SeqWeb (RRID:SCR_003454) Copy
https://services.healthtech.dtu.dk/
Center for Biological Sequence Analysis of the Technical University of Denmark conducts basic research in the field of bioinformatics and systems biology and directs its research primarily towards topics related to the elucidation of the functional aspects of complex biological mechanisms. A large number of computational methods have been produced, which are offered to others via WWW servers. Several data sets are also available. The center also has experimental efforts in gene expression analysis using DNA chips and data generation in relation to the physical and structural properties of DNA. The on-line prediction services at CBS are available as interactive input forms. Most of the servers are also available as stand-alone software packages with the same functionality. In addition, for some servers, programmatic access is provided in the form of SOAP-based Web Services. The center also educates engineering students in biotechnology and systems biology and offers a wide range of courses in bioinformatics, systems biology, human health, microbiology and nutrigenomics.
Proper citation: DTU Center for Biological Sequence Analysis (RRID:SCR_003590) Copy
http://www.agedbrainsysbio.eu/
Consortium focused on identifying the foundational pathways responsible for the aging of the brain, with a focus on Late Onset Alzheimer's disease. They aim to identify the interactions through which the aging phenotype develops in normal and in disease conditions; modeling novel pathways and their evolutionary properties to design experiments that identify druggable targets. As early steps of neurodegenerative disorders are expected to impact synapse function the project will focus in particular on pre- or postsynaptic protein networks. The concept is to identify subsets of pathways with two unique druggable hallmarks, the validation of interactions occurring locally in subregions of neurons and a human and/or primate accelerated evolutionary signature. The consortium will do this through six approaches: * identification of interacting protein networks from recent Late-Onset Alzheimer Disease-Genome Wide Association Studies (LOAD-GWAS) data, * experimental validation of interconnected networks working in subregion of a neuron (such as dendrites and dendritic spines), * inclusion of these experimentally validated networks in larger networks obtained from available databases to extend possible protein interactions, * identification of human and/or primate positive selection either in coding or in regulatory gene sequences, * manipulation of these human and/or primate accelerated evolutionary interacting proteins in human neurons derived from induced Pluripotent Stem Cells (iPSCs) * modeling predictions in drosophila and novel mouse transgenic models * validation of new druggable targets and markers as a proof-of-concept towards the prevention and cure of aging cognitive defects. The scientists will share results and know-how on Late-Onset Alzheimer Disease-Genome Wide Association Studies (LOAD-GWAS) gene discovery, comparative functional genomics in mouse and drosophila models, in mouse transgenic approaches, research on human induced pluripotent stem cells (hiPSC) and their differentiation in vitro and modeling pathways with emphasis on comparative and evolutionary aspects. The four European small to medium size enterprises (SMEs) involved will bring their complementary expertise and will ensure translation of project results to clinical application.
Proper citation: AgedBrainSYSBIO (RRID:SCR_003825) Copy
http://tigger.uic.edu/~cjeffery/
The moonlighting protein database is not yet available publicly. Stay tuned. Moonlighting proteins have multiple, seemingly unrelated functions not due to gene fusions or alternative splicing. Like PGI, which is a cytosolic enzyme and an extracellular cytokine, dozens of other proteins have been found to moonlight. Connie coined the term moonlighting proteins and has written several review articles that develop the idea of moonlighting proteins and describe additional moonlighting proteins from the literature, how they switch between functions, how they might have evolved, and how they might benefit the cell. She is currently writing two additional invited articles and planning computational studies of the sequences and structures of known moonlighting proteins.
Proper citation: MoonProt (RRID:SCR_008803) Copy
http://clipserve.clip.ubc.ca/topfind
An integrated knowledgebase focused on protein termini, their formation by proteases and functional implications. It contains information about the processing and the processing state of proteins and functional implications thereof derived from research literature, contributions by the scientific community and biological databases. It lists more than 120,000 N- and C-termini and almost 10,000 cleavages. TopFIND is a resource for comprehensive coverage of protein N- and C-termini discovered by all available in silico, in vitro as well as in vivo methodologies. It makes use of existing knowledge by seamless integration of data from UniProt and MEROPS and provides access to new data from community submission and manual literature curating. It renders modifications of protein termini, such as acetylation and citrulination, easily accessible and searchable and provides the means to identify and analyse extend and distribution of terminal modifications across a protein. The data is presented to the user with a strong emphasis on the relation to curated background information and underlying evidence that led to the observation of a terminus, its modification or proteolytic cleavage. In brief the protein information, its domain structure, protein termini, terminus modifications and proteolytic processing of and by other proteins is listed. All information is accompanied by metadata like its original source, method of identification, confidence measurement or related publication. A positional cross correlation evaluation matches termini and cleavage sites with protein features (such as amino acid variants) and domains to highlight potential effects and dependencies in a unique way. Also, a network view of all proteins showing their functional dependency as protease, substrate or protease inhibitor tied in with protein interactions is provided for the easy evaluation of network wide effects. A powerful yet user friendly filtering mechanism allows the presented data to be filtered based on parameters like methodology used, in vivo relevance, confidence or data source (e.g. limited to a single laboratory or publication). This provides means to assess physiological relevant data and to deduce functional information and hypotheses relevant to the bench scientist. TopFIND PROVIDES: * Integration of protein termini with proteolytic processing and protein features * Displays proteases and substrates within their protease web including detailed evidence information * Fully supports the Human Proteome Project through search by chromosome location CONTRIBUTE * Submit your N- or C-termini datasets * Contribute information on protein cleavages * Provide detailed experimental description, sample information and raw data
Proper citation: TopFIND (RRID:SCR_008918) Copy
PDBj (Protein Data Bank Japan) maintains a centralized PDB archive of macromolecular structures and provides integrated tools, in collaboration with the RCSB, the BMRB in USA and the PDBe in EU.
Proper citation: PDBj - Protein Data Bank Japan (RRID:SCR_008912) Copy
APID Interactomes (Agile Protein Interactomes DataServer) provides information on the protein interactomes of numerous organisms, based on the integration of known experimentally validated protein-protein physical interactions (PPIs). The interactome data includes a report on quality levels and coverage over the proteomes for each organism included. APID integrates PPIs from primary databases of molecular interactions (BIND, BioGRID, DIP, HPRD, IntAct, MINT) and also from experimentally resolved 3D structures (PDB) where more than two distinct proteins have been identified. This collection references protein interactors, through a UniProt identifier.
Proper citation: Agile Protein Interactomes DataServer (RRID:SCR_008871) Copy
http://www.proconsortium.org/pro/
An ontological representation of protein-related entities by explicitly defining them and showing the relationships between them. Each PRO term represents a distinct class of entities (including specific modified forms, orthologous isoforms, and protein complexes) ranging from the taxon-neutral to the taxon-specific. The ontology has a meta-structure encompassing three areas: proteins based on evolutionary relatedness (ProEvo); protein forms produced from a given gene locus (ProForm); and protein-containing complexes (ProComp). NOTICE: The PRO ID format has changed from PRO: to PR: (e.g. PRO:000000563 is now PR:000000563).
Proper citation: PR (RRID:SCR_004964) Copy
GenMAPP is a free computer application designed to visualize gene expression and other genomic data on maps representing biological pathways and groupings of genes. Integrated with GenMAPP are programs to perform a global analysis of gene expression or genomic data in the context of hundreds of pathway MAPPs and thousands of Gene Ontology Terms (MAPPFinder), import lists of genes/proteins to build new MAPPs (MAPPBuilder), and export archives of MAPPs and expression/genomic data to the web. The main features underlying GenMAPP are: *Draw pathways with easy to use graphics tools *Color genes on MAPP files based on user-imported genomic data *Query data against MAPPs and the GeneOntology Enhanced features include the simultaneous view of multiple color sets, expanded species-specific gene databases and custom database options.
Proper citation: Gene Map Annotator and Pathway Profiler (RRID:SCR_005094) Copy
https://www.rostlab.org/services/snpdbe/
A database to fill the annotation gap left by the high cost of experimental testing for functional significance of protein variants. It joins related bits of knowledge, currently distributed throughout various databases, into a consistent, easily accessible, and updatable resource. It currently covers over 155,000 protein sequences which come from more than 2,600 organisms. Overall more than one million single amino acid substitutions (SAASs) are referenced consisting of natural variants, SAASs from mutagenesis experiments and sequencing conflicts. SNPdbe offers the following pieces of information (if available) on each SAAS: * Experimentally derived functional and structural impact * Predicted functional effect * Associated disease * Average heterozygosity * Experimental evidence of the nsSNP * Evolutionary conservation of wildtype and mutant amino acid * Link-outs to external databases A convenient webinterface to query SAASs on the following levels is offered: * Protein and gene identifiers and keywords * Disease keywords * Protein sequence on different sequence identity thresholds * Variant identifier (dbSNP rs, SwissVar, PMD) or specific mutant like XposY and specified sequence They offer the possibility to submit protein sequences along with experimentally substantiated mutations in order to predict their functional effect and inclusion into our database.
Proper citation: SNPdbe (RRID:SCR_005190) Copy
http://www.jcvi.org/cgi-bin/tigrfams/index.cgi
Consists curated multiple sequence alignments, Hidden Markov Models (HMMs) for protein sequence classification, and associated information designed to support automated annotation of (mostly prokaryotic) proteins. Starting with release 10.0, TIGRFAMs models use HMMER3, which provides excellent search speed as well as exquisite search sensitivity. See the "TIGRFAMs Complete Listing" page to review the accession, protein name, model type, and EC number (if assigned) of all models. TIGRFAMs is a member database in InterPro. The HMM libraries and supporting files are available to download and use for free from our FTP site.
Proper citation: TIGRFAMS (RRID:SCR_005493) Copy
http://llama.mshri.on.ca/synergizer/translate/
The Synergizer database is a growing repository of gene and protein identifier synonym relationships. This tool facilitates the conversion of identifiers from one naming scheme (a.k.a namespace) to another. The Synergizer is a service for translating between sets of biological identifiers. It can, for example, translate Ensembl Gene IDs to Entrez Gene IDs, or IPI IDs to HGNC gene symbols, and much more. Unlike some other tools for this purpose, The Synergizer is simple and easy to learn. The Synergizer works via a web interface (for users who are not programmers) or through a web service (for programmatic access).
Proper citation: Synergizer (RRID:SCR_005308) Copy
http://www.ncbi.nlm.nih.gov/gtr/
Central location for voluntary submission of genetic test information by providers including the test''s purpose, methodology, validity, evidence of the test''s usefulness, and laboratory contacts and credentials. GTR aims to advance the public health and research into the genetic basis of health and disease. GTR is accepting registration of clinical tests for Mendelian disorders, complex tests and arrays, and pharmacogenetic tests. These tests may include multiple methods and may include multiple major method categories such as biochemical, cytogenetic, and molecular tests. GTR is not currently accepting registration of tests for somatic disorders, research tests or direct-to-consumer tests.
Proper citation: Genetic Testing Registry (RRID:SCR_005565) Copy
http://noble.gs.washington.edu/proj/sdp-svm/
A statistical framework for genomic data fusion is a computational framework for integrating and drawing inferences from a collection of genome-wide measurements. Each dataset is represented via a kernel function, which defines generalized similarity relationships between pairs of entities, such as genes or proteins. The kernel representation is both flexible and efficient, and can be applied to many different types of data. Furthermore, kernel functions derived from different types of data can be combined in a straightforward fashion. Recent advances in the theory of kernel methods have provided efficient algorithms to perform such combinations in a way that minimizes a statistical loss function. These methods exploit semidefinite programming techniques to reduce the problem of finding optimizing kernel combinations to a convex optimization problem. Computational experiments performed using yeast genome-wide datasets, including amino acid sequences, hydropathy profiles, gene expression data and known protein-protein interactions, demonstrate the utility of this approach. A statistical learning algorithm trained from all of these data to recognize particular classes of proteins--membrane proteins and ribosomal proteins--performs significantly better than the same algorithm trained on any single type of data. Matlab code to center a kernel matrix and Matlab code for normalization are available.
Proper citation: A statistical framework for genomic data fusion (RRID:SCR_007219) Copy
http://www.ncbi.nlm.nih.gov/guide/sitemap/
The National Center for Biotechnology Information''s listing of resources. Sort by alphabetical character, Databases, Downloads, Submissions, Tools and How-To; or by Topic: Chemicals & Bioassays; Data & Software; DNA & RNA; Domains & Structures; Genes & Expression; Genetics & Medicine; Genomes & Maps; Homology; Literature; Proteins; Sequence Analysis; Taxonomy; Training & Tutorials; Variation.
Proper citation: NCBI Resource List (RRID:SCR_005628) Copy
http://wiki.c2b2.columbia.edu/califanolab/index.php/BCellInteractome.htm
A network of protein-protein, protein-DNA and modulatory interactions in human B cells. The network contains known interactions (reported in public databases) and predicted interactions by a Bayesian evidence integration framework which integrates a variety of generic and context specific experimental clues about protein-protein and protein-DNA interactions with inferences from different reverse engineering algorithms, such as GeneWays and ARACNE. Modulatory interactions are predicted by the MINDY, an algorithm for the prediction of modulators of transcriptional interactions (please refer to the publication section for more information). The BCI can be downloaded as one tab delimited file containing the complete network (BCI.txt) with each type of interaction explicitly defined.
Proper citation: B Cell Interactome (RRID:SCR_008655) Copy
http://www.ch.embnet.org/software/COILS_form.html
COILS is a program that compares a sequence to a database of known parallel two-stranded coiled-coils and derives a similarity score. By comparing this score to the distribution of scores in globular and coiled-coil proteins, the program then calculates the probability that the sequence will adopt a coiled-coil conformation.
Proper citation: COILS: Prediction of Coiled Coil Regions in Proteins (RRID:SCR_008440) Copy
https://github.com/TransDecoder/TransDecoder
Software tool to identify candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to genome using Tophat and Cufflinks.Starts from FASTA or GFF file. Can scan and retain open reading frames (ORFs) for homology to known proteins by using BlastP or Pfam search and incorporate results into obtained selection. Predictions can then be visualized by using genome browser such as IGV.
Proper citation: TransDecoder (RRID:SCR_017647) Copy
https://www.synapse.org/#!Synapse:syn4921369/wiki/235539
Portal of PsychENCODE Consortium to study role of rare genetic variants involved in several psychiatric disorders. Database of regulatory elements, epigenetic modifications, RNA and protein in brain.
Proper citation: PsychENCODE Knowledge Portal (RRID:SCR_017500) Copy
http://pathways.mcdb.ucla.edu/algal/
Tools to search gene lists for functional term enrichment as well as to dynamically visualize proteins onto pathway maps. Additionally, integrated expression data may be used to discover similarly expressed genes based on a starting gene of interest.
Proper citation: Algal Functional Annotation Tool (RRID:SCR_012034) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.