GeNMR


GeNMR method is the first fully automated template-based method of protein structure determination that utilizes both NMR chemical shifts and NOE -based distance restraints.
In addition to the template-based approach, the GeNMR webserver also offers an ab initio protein folding mode that starts folding from an extended structure. The GeNMR web server produces an ensemble of PDB coordinates within a period ranging from 20 minutes to 4 hours, depending on protein size, server load, quality and type of experimental information, and selected protocol options. GeNMR webserver is composed of two parts, a front-end web-interface and a back-end consisting of eight different alignment, structure generation and structure optimization programs along with three local databases.

Input

GeNMR accepts and processes backbone and side chain 1H, 13C or 15N chemical shift data of almost any combination. This allows GeNMR to handle small peptides to large proteins.
The input files must include chemical shift data in NMR-STAR 2.1 format and distance restraints in / format. The minimum sequence length is 30 residues.

Output

The output for a typical GeNMR structure calculation consists of a user-defined set of lowest energy PDB coordinates in a simple, downloadable text format. In addition, details about the overall energy score and chemical shift correlations is provided at the top of the output page. If score failed to decrease below a certain threshold, a warning is printed at the top of the page.

Sub-programs

A flow chart describing the processing logic used in GeNMR is shown on the right. GeNMR makes use of a number of well-known programs and databases. These include to perform structural modeling, to calculate torsion angles from chemical shifts, for comparative modeling and alignment and CS23D to calculate protein structures from chemical shifts only. GeNMR also uses several well-known external programs, including for ab initio folding without NOEs and for NOE-based simulated annealing and refinement. A more complete list of GeNMR sub-programs is listed on the CS23D page.

Homology modelling

GeNMR uses homology modeling and sequence/structure threading to rapidly generate a first-pass model of the query protein. The use of homology modeling/threading in GeNMR allows a considerable speed-up in its structure calculations since homology models can often be generated and refined in a minute or two.

Genetic algorithm

GeNMR also makes use of genetic algorithms to allow configurational sampling and structural refinement using non-differentiable scores, such as ShiftX chemical shift scores. GeNMR's genetic algorithm creates a population of initial structures and then uses combinations of mutations, cross-overs, segment swaps and writhe movements to comprehensively sample conformation space. The 25 lowest energy structures are then selected, duplicated and carried to the next round of conformational sampling.

Scoring functions

The potential functions used in GeNMR are derived from those used in and . The knowledge-based potentials include information on predicted/known secondary structure, radius of gyration, hydrogen bond energies, number of hydrogen bonds, allowed backbone and side chain torsion angles, atom contact radii, disulfide bonding information and a modified threading energy based on the . The chemical shift component of the GeNMR potential uses weighted correlation coefficients calculated between the observed and calculated shifts of the structure being refined.

Calculation scenarios

There are six different kinds of calculation scenarios that GeNMR can currently accommodate. These scenarios include:
  1. chemical shift only—query has homologue in database;
  2. chemical shift only—query has no homologue in database;
  3. NOE only—query has homologue in database;
  4. NOE only—query has no homologue in database;
  5. NOE and chemical shift—query has homologue in database;
  6. NOE and chemical shift—query has no homologue in database.