Switch to GRS_UTRdb
   

GRSDB Help: Search & Analysis

1. QGRS Definition
2. Search & Analysis
3. Workbench Help
4. Understanding G-Scores
5. Dealing with overlaps
6. Glossary

The search function contains the following user choices which can be combined with a Boolean operator.(Note: all searches are case-independent.)

Search Field

There are fourteen different fields that the user may search on:
Search FieldComments
Gene IDThe unique identifier used in Entrez Gene
Gene SymbolStandard abbreviation of gene name
AliasesOther names the gene is known by
Gene NameThe official NCBI gene name. The user may search for a gene having precisely this name, containing the name, starting or ending with the name.
GIThe unique 'GeneInfo' identifier used in NCBI
Accession #NCBI accession number
ChromosomeThe chromosome on which the gene is located
Gene size A range of values for the length (nucelotides) of the gene
Number of ProductsAllows user to look for genes having a number of mRNA products in a given range
Number of PolyA SignalsAllows user to look for genes having a number of polyA signals in a given range
OrganismThe user may perform a search on a particular set of organisms or all organisms (which is the default).
GO functionGene Ontology function
GO processGene Ontology process
GO componentGene Ontology component

Query results

Suppose that we wanted to consider alternatively spliced human or rat genes involved in apoptosis.

G-Quadruplex Search Options

This results in several hits. The first page of results is shown below. The results may be sorted using any of the columns. For example, sorting by Gene Size arranges the genes in increasing order of gene size. G-Quadruplex Search Results

The user may further analyze genes by putting them into a Workbench, where various programs are available to study QGRS distribution patterns for a particular set of genes.

There are several OPTIONS that control how the user wishes to display results for a specific gene. The options are set in the following window:

G-Quadruplex Filter Options

The following options are available to the user:
Search options Comments
Number of genes per page On the results pages, 12 genes are displayed per page by default.
Type of QGRS to display There are two categories of QGRS that may be selected from (although the user has additional QGRS filters described below). (30, 2) selects all QGRS of length at most 30 nucleotides and having at least 2 G's per G-group (45,3) selects all QGRS of length at most 45 and having at least 3 G's per G-group.
G-score Only QGRS having a G-score in the selected range are displayed
QGRS Max Length The maximum length of QGRS.
Min G-group The minimum number of G's in each G-group. 2 G's are the default value.
Loop Size Only QGRS having all three loops in the selected range are displayed. Default range is 0 to 36. However, only up to one loop is allowed to be of length zero. Please see QGRS Definition in the Help.

QGRS Analysis

G-quadruplex may be displayed in five different views: Gene View, Data View (no overlaps), Data View (with overlaps), Sequence View, Graphics View.

Gene View

To analyze a particular gene one clicks on the entry in the Analyze Genes column. Suppose we select the PCBP4 gene. This leads to a page entitled Gene View. This page lists basic gene information, including gene ontology. The number of QGRS (either overlapping or non-overlapping) are listed in both categories (30,2), (45,3).

For each alternatively spliced product a map of the exon and intron structure is given together with QGRS information for that product.

PCBP4 Gene View

The user can next look at the Data View or Sequence View, either for collection of ALL Products or for any single product. If either view is selected for ALL Products, then information is displayed for each product. Any of the options for filtering QGRS can be set at the Gene View page. Suppose that the options are set so that the each loop must be between 1 and 7 bases.

Data View (no overlaps)

Suppose that the Data View for non-overlapping QGRS for one product has been selected. The QGRS displayed in this view are non-overlapping. Please see Dealing with overlaps to learn about selection of non-overlapping QGRS.

A table of basic gene information is given. Each QGRS in that product is listed as to whether it is in an exon, intron, or near a polyA signal (which is defined to be within 200 nucleotides of the signal).

For QGRS in exons the following information is given: length of the sequence, which exon the QGRS is located in, distance from 3' and 5' splice sites, the specific nucleotide sequence, and the G-score of the sequence. Each QGRS is assigned a numerical value, called its G-score. The higher the G-score the more likely will the sequence form a stable unimolecular G-quadruplex. For more on G-scores see the help section Understanding G-Scores.

For QGRS in introns the following information is given: length of the sequence, which intron the QGRS is located in, distance from 3' and 5' splice sites, the specific nucleotide sequence, and the G-score of the sequence.

For QGRS near polyA signals the following information is given: length of the sequence, which polyA signal the QGRS is near, where the polyA signal is located, the specific nucleotide sequence, and the G-score of the sequence.

In case no QGRS found with the current settings, nothing is displayed in relevant section of the table.

In addition, the Data View shows any user options that have been selected and allows the user to export the data to Excel. Shown below is the Data View for the product NM_00105158.1 of the PCBP4 gene using loop sizes between 1 and 7 nucleotides.

CD74 Gene Data View

Data View (with overlaps)

Gives a display similar to the above, except that all QGRS overlapping or not, are displayed.

Sequence View

The nucleotide sequence for the entire gene is displayed. Exons are listed in purple and each QGRS is shown in yellow. Basic gene information is given and any filters that have been set are shown. Shown below is the Sequence View for the product NM_00105158.1 of the PCBP4 gene with loop sizes between 1 and 7 nucleotides.

CD74 Gene Sequence View

Graphics View

The user can also choose the Graphics View to give a visual display of the location of QGRS. This allows the user to see the location of QGRS relative to exons and introns (if that information is available). The Graphics View has the following components: CD74 Gene Graphics View